SOTAVerified

Optical Character Recognition (OCR)

Optical Character Recognition or Optical Character Reader (OCR) is the electronic or mechanical conversion of images of typed, handwritten or printed text into machine-encoded text, whether from a scanned document, a photo of a document, a scene-photo (for example the text on signs and billboards in a landscape photo, license plates in cars...) or from subtitle text superimposed on an image (for example: from a television broadcast)

Papers

Showing 601625 of 1209 papers

TitleStatusHype
SAML-QC: a Stochastic Assessment and Machine Learning based QC technique for Industrial Printing0
SARD: A Large-Scale Synthetic Arabic OCR Dataset for Book-Style Text Recognition0
Scalable Video-to-Dataset Generation for Cross-Platform Mobile Agents0
Scaling Automatic Extraction of Pseudocode0
Scatteract: Automated extraction of data from scatter plots0
SceneGATE: Scene-Graph based co-Attention networks for TExt visual question answering0
Scene Text recognition with Full Normalization0
SceneVTG++: Controllable Multilingual Visual Text Generation in the Wild0
SciCapenter: Supporting Caption Composition for Scientific Figures with Machine-Generated Captions and Ratings0
Seeing is Believing? Mitigating OCR Hallucinations in Multimodal Large Language Models0
Seeing the Signs: A Survey of Edge-Deployable OCR Models for Billboard Visibility Analysis0
See then Tell: Enhancing Key Information Extraction with Vision Grounding0
SEE: Towards Semi-SupervisedEnd-to-End Scene Text Recognition0
Segmentation-free Connectionist Temporal Classification loss based OCR Model for Text Captcha Classification0
Self-paced learning to improve text row detection in historical documents with missing labels0
Self-supervised Data Bootstrapping for Deep Optical Character Recognition of Identity Documents0
Semantic rule Web-based Diagnosis and Treatment of Vector-Borne Diseases using SWRL rules0
Semantic Segmentation for Real-World and Synthetic Vehicle's Forward-Facing Camera Images0
Semi-automated annotation of page-based documents within the Genre and Multimodality framework0
Semi-Structured Query Grounding for Document-Oriented Databases with Deep Retrieval and Its Application to Receipt and POI Matching0
Sequence-to-Label Script Identification for Multilingual OCR0
Sequence to Sequence Learning for Optical Character Recognition0
Sequence-to-Sequence Pre-training with Unified Modality Masking for Visual Document Understanding0
Shakti-VLMs: Scalable Vision-Language Models for Enterprise AI0
Similar Document Template Matching Algorithm0
Show:102550
← PrevPage 25 of 49Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1DTrOCR 105MAccuracy (%)89.6Unverified
2DTrOCRAccuracy (%)89.6Unverified
3MaskOCR-LAccuracy (%)82.6Unverified
4TransOCRAccuracy (%)72.8Unverified
5SRNAccuracy (%)65Unverified
6MORANAccuracy (%)64.3Unverified
7SEEDAccuracy (%)61.2Unverified
#ModelMetricClaimedVerifiedStatus
1GPT-4oAverage Accuracy76.22Unverified
2Gemini-1.5 ProAverage Accuracy76.13Unverified
3Claude-3 SonnetAverage Accuracy67.71Unverified
4RapidOCRAverage Accuracy56.98Unverified
5EasyOCRAverage Accuracy49.3Unverified
#ModelMetricClaimedVerifiedStatus
1STREETSequence error27.54Unverified
2SEESequence error22Unverified
3AttentionOCR_Inception-resnet-v2_LocationSequence error15.8Unverified
#ModelMetricClaimedVerifiedStatus
1I2L-NOPOOLBLEU89.09Unverified
2I2L-STRIPSBLEU89Unverified
#ModelMetricClaimedVerifiedStatus
1TesseractCharacter Error Rate (CER)0.08Unverified
2EasyOCRCharacter Error Rate (CER)0.07Unverified
#ModelMetricClaimedVerifiedStatus
1I2L-STRIPSBLEU88.86Unverified