SOTAVerified

Optical Character Recognition (OCR)

Optical Character Recognition or Optical Character Reader (OCR) is the electronic or mechanical conversion of images of typed, handwritten or printed text into machine-encoded text, whether from a scanned document, a photo of a document, a scene-photo (for example the text on signs and billboards in a landscape photo, license plates in cars...) or from subtitle text superimposed on an image (for example: from a television broadcast)

Papers

Showing 751775 of 1209 papers

TitleStatusHype
A Simple and Practical Approach to Improve Misspellings in OCR Text0
An End-to-End Khmer Optical Character Recognition using Sequence-to-Sequence with Attention0
Tag, Copy or Predict: A Unified Weakly-Supervised Learning Framework for Visual Information Extraction using Sequences0
Improving OCR-Based Image Captioning by Incorporating Geometrical Relationship0
Scene Text Telescope: Text-Focused Scene Image Super-ResolutionCode0
Mixed Model OCR Training on Historical Latin Script for Out-of-the-Box Recognition and Finetuning0
Classification of Documents Extracted from Images with Optical Character Recognition Methods0
Context-Free TextSpotter for Real-Time and Mobile End-to-End Text Detection and Recognition0
Classification of Contract-Amendment Relationships0
PAM: Understanding Product Images in Cross Product Category Attribute Extraction0
Toward Creation of Ancash Lexical Resources from OCR0
Bangla Natural Language Processing: A Comprehensive Analysis of Classical, Machine Learning, and Deep Learning Based Methods0
A Full-Stack Search Technique for Domain Optimized Deep Learning Accelerators0
Empirical Error Modeling Improves Robustness of Noisy Neural Sequence LabelingCode0
Simple Transparent Adversarial Examples0
End-to-End Unsupervised Document Image Blind Denoising0
STRIDE : Scene Text Recognition In-Device0
Reciprocal Feature Learning via Explicit and Implicit Tasks in Scene Text Recognition0
Supporting Land Reuse of Former Open Pit Mining Sites using Text Classification and Active Learning0
TextOCR: Towards large-scale end-to-end reasoning for arbitrary-shaped scene text0
GroupLink: An End-to-end Multitask Method for Word Grouping and Relation Extraction in Form Understanding0
An end-to-end Optical Character Recognition approach for ultra-low-resolution printed text images0
End-to-End Optical Character Recognition for Bengali Handwritten WordsCode0
Word-Level Alignment of Paper Documents with their Electronic Full-Text CounterpartsCode0
Analyzing Green View Index and Green View Index best path using Google Street View and deep learningCode0
Show:102550
← PrevPage 31 of 49Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1DTrOCRAccuracy (%)89.6Unverified
2DTrOCR 105MAccuracy (%)89.6Unverified
3MaskOCR-LAccuracy (%)82.6Unverified
4TransOCRAccuracy (%)72.8Unverified
5SRNAccuracy (%)65Unverified
6MORANAccuracy (%)64.3Unverified
7SEEDAccuracy (%)61.2Unverified
#ModelMetricClaimedVerifiedStatus
1GPT-4oAverage Accuracy76.22Unverified
2Gemini-1.5 ProAverage Accuracy76.13Unverified
3Claude-3 SonnetAverage Accuracy67.71Unverified
4RapidOCRAverage Accuracy56.98Unverified
5EasyOCRAverage Accuracy49.3Unverified
#ModelMetricClaimedVerifiedStatus
1STREETSequence error27.54Unverified
2SEESequence error22Unverified
3AttentionOCR_Inception-resnet-v2_LocationSequence error15.8Unverified
#ModelMetricClaimedVerifiedStatus
1I2L-NOPOOLBLEU89.09Unverified
2I2L-STRIPSBLEU89Unverified
#ModelMetricClaimedVerifiedStatus
1TesseractCharacter Error Rate (CER)0.08Unverified
2EasyOCRCharacter Error Rate (CER)0.07Unverified
#ModelMetricClaimedVerifiedStatus
1I2L-STRIPSBLEU88.86Unverified