SOTAVerified

Optical Character Recognition (OCR)

Optical Character Recognition or Optical Character Reader (OCR) is the electronic or mechanical conversion of images of typed, handwritten or printed text into machine-encoded text, whether from a scanned document, a photo of a document, a scene-photo (for example the text on signs and billboards in a landscape photo, license plates in cars...) or from subtitle text superimposed on an image (for example: from a television broadcast)

Papers

Showing 626650 of 1209 papers

TitleStatusHype
Semi-Structured Query Grounding for Document-Oriented Databases with Deep Retrieval and Its Application to Receipt and POI Matching0
Identifying OCRs in cfDNA WGS Data by Correlation Clustering0
BLPnet: A new DNN model and Bengali OCR engine for Automatic License Plate Recognition0
Omnifont Persian OCR System Using Primitives0
DocBed: A Multi-Stage OCR Solution for Documents with Complex Layouts0
Self-paced learning to improve text row detection in historical documents with missing labels0
An Assessment of the Impact of OCR Noise on Language Models0
A Classical Approach to Handcrafted Feature Extraction Techniques for Bangla Handwritten Digit Recognition0
Classroom Slide Narration System0
Legal Entity Extraction using a Pointer Generator Network0
Improve Sentence Alignment by Divide-and-conquer0
On the Cross-dataset Generalization in License Plate RecognitionCode1
SAFL: A Self-Attention Scene Text Recognizer with Focal LossCode0
Intelligent Document Processing -- Methods and Tools in the real world0
LaTr: Layout-Aware Transformer for Scene-Text VQACode1
Challenging America: Modeling language in longer time scales0
Lesan -- Machine Translation for Low Resource Languages0
Tracing Text Provenance via Context-Aware Lexical Substitution0
Modelling Lips-State Detection Using CNN for Non-Verbal Communications0
A Survey on Deep learning based Document Image Enhancement0
An Automatic Approach for Generating Rich, Linked Geo-Metadata from Historical Map ImagesCode1
Transferring Modern Named Entity Recognition to the Historical Domain: How to Take the Step?0
On-Device Spatial Attention based Sequence Learning Approach for Scene Text Script Identification0
OCR-free Document Understanding TransformerCode3
Image preprocessing and modified adaptive thresholding for improving OCR0
Show:102550
← PrevPage 26 of 49Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1DTrOCRAccuracy (%)89.6Unverified
2DTrOCR 105MAccuracy (%)89.6Unverified
3MaskOCR-LAccuracy (%)82.6Unverified
4TransOCRAccuracy (%)72.8Unverified
5SRNAccuracy (%)65Unverified
6MORANAccuracy (%)64.3Unverified
7SEEDAccuracy (%)61.2Unverified
#ModelMetricClaimedVerifiedStatus
1GPT-4oAverage Accuracy76.22Unverified
2Gemini-1.5 ProAverage Accuracy76.13Unverified
3Claude-3 SonnetAverage Accuracy67.71Unverified
4RapidOCRAverage Accuracy56.98Unverified
5EasyOCRAverage Accuracy49.3Unverified
#ModelMetricClaimedVerifiedStatus
1STREETSequence error27.54Unverified
2SEESequence error22Unverified
3AttentionOCR_Inception-resnet-v2_LocationSequence error15.8Unverified
#ModelMetricClaimedVerifiedStatus
1I2L-NOPOOLBLEU89.09Unverified
2I2L-STRIPSBLEU89Unverified
#ModelMetricClaimedVerifiedStatus
1TesseractCharacter Error Rate (CER)0.08Unverified
2EasyOCRCharacter Error Rate (CER)0.07Unverified
#ModelMetricClaimedVerifiedStatus
1I2L-STRIPSBLEU88.86Unverified