SOTAVerified

Optical Character Recognition (OCR)

Optical Character Recognition or Optical Character Reader (OCR) is the electronic or mechanical conversion of images of typed, handwritten or printed text into machine-encoded text, whether from a scanned document, a photo of a document, a scene-photo (for example the text on signs and billboards in a landscape photo, license plates in cars...) or from subtitle text superimposed on an image (for example: from a television broadcast)

Papers

Showing 601650 of 1209 papers

TitleStatusHype
SAML-QC: a Stochastic Assessment and Machine Learning based QC technique for Industrial Printing0
SARD: A Large-Scale Synthetic Arabic OCR Dataset for Book-Style Text Recognition0
Scalable Video-to-Dataset Generation for Cross-Platform Mobile Agents0
Scaling Automatic Extraction of Pseudocode0
Scatteract: Automated extraction of data from scatter plots0
SceneGATE: Scene-Graph based co-Attention networks for TExt visual question answering0
Scene Text recognition with Full Normalization0
SceneVTG++: Controllable Multilingual Visual Text Generation in the Wild0
SciCapenter: Supporting Caption Composition for Scientific Figures with Machine-Generated Captions and Ratings0
Seeing is Believing? Mitigating OCR Hallucinations in Multimodal Large Language Models0
Seeing the Signs: A Survey of Edge-Deployable OCR Models for Billboard Visibility Analysis0
See then Tell: Enhancing Key Information Extraction with Vision Grounding0
SEE: Towards Semi-SupervisedEnd-to-End Scene Text Recognition0
Segmentation-free Connectionist Temporal Classification loss based OCR Model for Text Captcha Classification0
Self-paced learning to improve text row detection in historical documents with missing labels0
Self-supervised Data Bootstrapping for Deep Optical Character Recognition of Identity Documents0
Semantic rule Web-based Diagnosis and Treatment of Vector-Borne Diseases using SWRL rules0
Semantic Segmentation for Real-World and Synthetic Vehicle's Forward-Facing Camera Images0
Semi-automated annotation of page-based documents within the Genre and Multimodality framework0
Semi-Structured Query Grounding for Document-Oriented Databases with Deep Retrieval and Its Application to Receipt and POI Matching0
Sequence-to-Label Script Identification for Multilingual OCR0
Sequence to Sequence Learning for Optical Character Recognition0
Sequence-to-Sequence Pre-training with Unified Modality Masking for Visual Document Understanding0
Shakti-VLMs: Scalable Vision-Language Models for Enterprise AI0
Similar Document Template Matching Algorithm0
Simple is not Easy: A Simple Strong Baseline for TextVQA and TextCaps0
Simple Transparent Adversarial Examples0
Simulation d’erreurs d’OCR dans les systèmes de TAL pour le traitement de données anachroniques (Simulation of OCR errors in NLP systems for processing anachronistic data)0
Sinica-IASL Chinese spelling check system at Sighan-70
SIS@IIITH at SemEval-2020 Task 8: An Overview of Simple Text Classification Methods for Meme Analysis0
Slide2Text: Leveraging LLMs for Personalized Textbook Generation from PowerPoint Presentations0
Solution for SMART-101 Challenge of ICCV Multi-modal Algorithmic Reasoning Task 20230
Solving Substitution Ciphers with Combined Language Models0
Southern Newswire Corpus: A Large-Scale Dataset of Mid-Century Wire Articles Beyond the Front Page0
SPARLING: Learning Latent Representations with Extremely Sparse Activations0
Sparse Concept Coded Tetrolet Transform for Unconstrained Odia Character Recognition0
SpellBERT: A Lightweight Pretrained Model for Chinese Spelling Check0
Squibs: Spelling Error Patterns in Brazilian Portuguese0
Star-net: A spatial attention residue network for scene text recognition.0
Statistical Learning for OCR Text Correction0
Machine Learning Construction: implications to cybersecurity0
Statistical Machine Translation Improvement based on Phrase Selection0
Still not there? Comparing Traditional Sequence-to-Sequence Models to Encoder-Decoder Neural Networks on Monotone String Translation Tasks0
STRIDE : Scene Text Recognition In-Device0
Structured Analysis and Comparison of Alphabets in Historical Handwritten Ciphers0
Sum-Product Networks for Sequence Labeling0
SuperOCR: A Conversion from Optical Character Recognition to Image Captioning0
SuperOCR for ALTA 2017 Shared Task0
Survey of Computational Approaches to Lexical Semantic Change0
SVDocNet: Spatially Variant U-Net for Blind Document Deblurring0
Show:102550
← PrevPage 13 of 25Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1DTrOCR 105MAccuracy (%)89.6Unverified
2DTrOCRAccuracy (%)89.6Unverified
3MaskOCR-LAccuracy (%)82.6Unverified
4TransOCRAccuracy (%)72.8Unverified
5SRNAccuracy (%)65Unverified
6MORANAccuracy (%)64.3Unverified
7SEEDAccuracy (%)61.2Unverified
#ModelMetricClaimedVerifiedStatus
1GPT-4oAverage Accuracy76.22Unverified
2Gemini-1.5 ProAverage Accuracy76.13Unverified
3Claude-3 SonnetAverage Accuracy67.71Unverified
4RapidOCRAverage Accuracy56.98Unverified
5EasyOCRAverage Accuracy49.3Unverified
#ModelMetricClaimedVerifiedStatus
1STREETSequence error27.54Unverified
2SEESequence error22Unverified
3AttentionOCR_Inception-resnet-v2_LocationSequence error15.8Unverified
#ModelMetricClaimedVerifiedStatus
1I2L-NOPOOLBLEU89.09Unverified
2I2L-STRIPSBLEU89Unverified
#ModelMetricClaimedVerifiedStatus
1TesseractCharacter Error Rate (CER)0.08Unverified
2EasyOCRCharacter Error Rate (CER)0.07Unverified
#ModelMetricClaimedVerifiedStatus
1I2L-STRIPSBLEU88.86Unverified