SOTAVerified

Optical Character Recognition (OCR)

Optical Character Recognition or Optical Character Reader (OCR) is the electronic or mechanical conversion of images of typed, handwritten or printed text into machine-encoded text, whether from a scanned document, a photo of a document, a scene-photo (for example the text on signs and billboards in a landscape photo, license plates in cars...) or from subtitle text superimposed on an image (for example: from a television broadcast)

Papers

Showing 876900 of 1209 papers

TitleStatusHype
Automated Quality Control System for Canned Tuna Production using Artificial Vision0
Automated Transcription of Non-Latin Script Periodicals: A Case Study in the Ottoman Turkish Print Archive0
Automated Translation of a Literary Work: A Pilot Study0
Automatic Classification of Pathology Reports using TF-IDF Features0
Automatic Compositor Attribution in the First Folio of Shakespeare0
Auto-ML Deep Learning for Rashi Scripts OCR0
Balanced Korean Word Spacing with Structural SVM0
Bambara and Maninka Manding Languages Written Corpora Project (``Projet des corpus \'ecrits des langues manding : le bambara, le maninka'') [in French]0
Bangla Natural Language Processing: A Comprehensive Analysis of Classical, Machine Learning, and Deep Learning Based Methods0
Bangla Text Recognition from Video Sequence: A New Focus0
BART for Post-Correction of OCR Newspaper Text0
@Bench: Benchmarking Vision-Language Models for Human-centered Assistive Technology0
Benchmark for License Plate Character Segmentation0
Benchmarking Algorithms for Automatic License Plate Recognition0
Bengali Document Layout Analysis -- A YOLOV8 Based Ensembling Approach0
Bengali Document Layout Analysis with Detectron20
Bengali Handwritten Digit Recognition using CNN with Explainable AI0
BennettNLP at SemEval-2020 Task 8: Multimodal sentiment classification Using Hybrid Hierarchical Classifier0
Between History and Natural Language Processing: Study, Enrichment and Online Publication of French Parliamentary Debates of the Early Third Republic (1881-1899)0
Beyond Logit Lens: Contextual Embeddings for Robust Hallucination Detection & Grounding in VLMs0
Beyond the Mud: Datasets and Benchmarks for Computer Vision in Off-Road Racing0
Bidirectional Trained Tree-Structured Decoder for Handwritten Mathematical Expression Recognition0
BIOfid Dataset: Publishing a German Gold Standard for Named Entity Recognition in Historical Biodiversity Literature0
BLPnet: A new DNN model and Bengali OCR engine for Automatic License Plate Recognition0
Modelling Lips-State Detection Using CNN for Non-Verbal Communications0
Show:102550
← PrevPage 36 of 49Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1DTrOCRAccuracy (%)89.6Unverified
2DTrOCR 105MAccuracy (%)89.6Unverified
3MaskOCR-LAccuracy (%)82.6Unverified
4TransOCRAccuracy (%)72.8Unverified
5SRNAccuracy (%)65Unverified
6MORANAccuracy (%)64.3Unverified
7SEEDAccuracy (%)61.2Unverified
#ModelMetricClaimedVerifiedStatus
1GPT-4oAverage Accuracy76.22Unverified
2Gemini-1.5 ProAverage Accuracy76.13Unverified
3Claude-3 SonnetAverage Accuracy67.71Unverified
4RapidOCRAverage Accuracy56.98Unverified
5EasyOCRAverage Accuracy49.3Unverified
#ModelMetricClaimedVerifiedStatus
1STREETSequence error27.54Unverified
2SEESequence error22Unverified
3AttentionOCR_Inception-resnet-v2_LocationSequence error15.8Unverified
#ModelMetricClaimedVerifiedStatus
1I2L-NOPOOLBLEU89.09Unverified
2I2L-STRIPSBLEU89Unverified
#ModelMetricClaimedVerifiedStatus
1TesseractCharacter Error Rate (CER)0.08Unverified
2EasyOCRCharacter Error Rate (CER)0.07Unverified
#ModelMetricClaimedVerifiedStatus
1I2L-STRIPSBLEU88.86Unverified