SOTAVerified

Optical Character Recognition

Papers

Showing 251300 of 526 papers

TitleStatusHype
Consensus Entropy: Harnessing Multi-VLM Agreement for Self-Verifying and Self-Improving OCR0
POINTS1.5: Building a Vision-Language Model towards Real World Applications0
POINTS: Improving Your Vision-language Model with Affordable Strategies0
PP-OCRv3: More Attempts for the Improvement of Ultra Lightweight OCR System0
Producing Corpora of Medieval and Premodern Occitan0
Proposal for Automatic License and Number Plate Recognition System for Vehicle Identification0
Qalam : A Multimodal LLM for Arabic Optical Character and Handwriting Recognition0
Quantitative Analysis of Image Classification Techniques for Memory-Constrained Devices0
RDU: A Region-based Approach to Form-style Document Understanding0
Reading in the Dark with Foveated Event Vision0
Real-time information retrieval from Identity cards0
Recognition of Text Image Using Multilayer Perceptron0
Recommending Scientific Videos based on Metadata Enrichment using Linked Open Data0
Recursive Recurrent Nets with Attention Modeling for OCR in the Wild0
Regularization and Kernelization of the Maximin Correlation Approach0
Representing Online Handwriting for Recognition in Large Vision-Language Models0
Reranking with Linguistic and Semantic Features for Arabic Optical Character Recognition0
Resilience of Large Language Models for Noisy Instructions0
Resolving Referring Expressions in Images With Labeled Elements0
Resource Constrained Structured Prediction0
Resume Information Extraction via Post-OCR Text Processing0
Revisiting Multi-Modal LLM Evaluation0
Rich Semantic Knowledge Enhanced Large Language Models for Few-shot Chinese Spell Checking0
Rosetta: Large scale system for text detection and recognition in images0
SAML-QC: a Stochastic Assessment and Machine Learning based QC technique for Industrial Printing0
SARD: A Large-Scale Synthetic Arabic OCR Dataset for Book-Style Text Recognition0
Scaling Automatic Extraction of Pseudocode0
Scatteract: Automated extraction of data from scatter plots0
SceneGATE: Scene-Graph based co-Attention networks for TExt visual question answering0
SceneVTG++: Controllable Multilingual Visual Text Generation in the Wild0
Seeing the Signs: A Survey of Edge-Deployable OCR Models for Billboard Visibility Analysis0
See then Tell: Enhancing Key Information Extraction with Vision Grounding0
Segmentation-free Connectionist Temporal Classification loss based OCR Model for Text Captcha Classification0
Segmenting Messy Text: Detecting Boundaries in Text Derived from Historical Newspaper Images0
Self-paced learning to improve text row detection in historical documents with missing labels0
Self-supervised Data Bootstrapping for Deep Optical Character Recognition of Identity Documents0
Semantic rule Web-based Diagnosis and Treatment of Vector-Borne Diseases using SWRL rules0
Semi-Structured Query Grounding for Document-Oriented Databases with Deep Retrieval and Its Application to Receipt and POI Matching0
Sequence to Sequence Learning for Optical Character Recognition0
Simple is not Easy: A Simple Strong Baseline for TextVQA and TextCaps0
Simple Transparent Adversarial Examples0
Statistical Learning for OCR Text Correction0
Machine Learning Construction: implications to cybersecurity0
STRIDE : Scene Text Recognition In-Device0
Sum-Product Networks for Sequence Labeling0
SuperOCR: A Conversion from Optical Character Recognition to Image Captioning0
SVDocNet: Spatially Variant U-Net for Blind Document Deblurring0
Table Structure Extraction with Bi-directional Gated Recurrent Unit Networks0
Tag, Copy or Predict: A Unified Weakly-Supervised Learning Framework for Visual Information Extraction using Sequences0
TAP-VL: Text Layout-Aware Pre-training for Enriched Vision-Language Models0
Show:102550
← PrevPage 6 of 11Next →

No leaderboard results yet.