SOTAVerified

Optical Character Recognition (OCR)

Optical Character Recognition or Optical Character Reader (OCR) is the electronic or mechanical conversion of images of typed, handwritten or printed text into machine-encoded text, whether from a scanned document, a photo of a document, a scene-photo (for example the text on signs and billboards in a landscape photo, license plates in cars...) or from subtitle text superimposed on an image (for example: from a television broadcast)

Papers

Showing 10511075 of 1209 papers

TitleStatusHype
A System for Identifying and Exploring Text Repetition in Large Historical Document Corpora0
Automatic Compositor Attribution in the First Folio of Shakespeare0
Scatteract: Automated extraction of data from scatter plots0
OCRAPOSE II: An OCR-based indoor positioning system using mobile phone images0
Attention-based Extraction of Structured Information from Street View ImageryCode0
Effective search space reduction for spell correction using character neural embeddings0
Important New Developments in Arabographic Optical Character Recognition (OCR)0
Content-based similar document image retrieval using fusion of CNN features0
A Holistic Approach for Optimizing DSP Block Utilization of a CNN implementation on FPGA0
Twitter100k: A Real-world Dataset for Weakly Supervised Cross-Media Retrieval0
Endangered Data for Endangered Languages: Digitizing Print dictionaries0
End-to-End Interpretation of the French Street Name Signs DatasetCode0
Language Independent Single Document Image Super-Resolution using CNN for improved recognition0
Document Decomposition of Bangla Printed Text0
LAREX - A semi-automatic open-source Tool for Layout Analysis and Region Extraction on Early Printed BooksCode0
Case Study of a highly automated Layout Analysis and OCR of an incunabulum: 'Der Heiligen Leben' (1488)Code0
Profiling of OCR'ed Historical Texts Revisited0
Mining Spatio-temporal Data on Industrialization from Historical RegistriesCode0
Recognition of Text Image Using Multilayer Perceptron0
papago: A Machine Translation Service with Word Sense Disambiguation and Currency Conversion0
Implementation of a Workflow Management System for Non-Expert Users0
Detection of Text Reuse in French Medical Corpora0
Integrating Optical Character Recognition and Machine Translation of Historical Documents0
Align Me: A framework to generate Parallel Corpus Using OCRs and Bilingual Dictionaries0
Providing and Analyzing NLP Terms for our Community0
Show:102550
← PrevPage 43 of 49Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1DTrOCRAccuracy (%)89.6Unverified
2DTrOCR 105MAccuracy (%)89.6Unverified
3MaskOCR-LAccuracy (%)82.6Unverified
4TransOCRAccuracy (%)72.8Unverified
5SRNAccuracy (%)65Unverified
6MORANAccuracy (%)64.3Unverified
7SEEDAccuracy (%)61.2Unverified
#ModelMetricClaimedVerifiedStatus
1GPT-4oAverage Accuracy76.22Unverified
2Gemini-1.5 ProAverage Accuracy76.13Unverified
3Claude-3 SonnetAverage Accuracy67.71Unverified
4RapidOCRAverage Accuracy56.98Unverified
5EasyOCRAverage Accuracy49.3Unverified
#ModelMetricClaimedVerifiedStatus
1STREETSequence error27.54Unverified
2SEESequence error22Unverified
3AttentionOCR_Inception-resnet-v2_LocationSequence error15.8Unverified
#ModelMetricClaimedVerifiedStatus
1I2L-NOPOOLBLEU89.09Unverified
2I2L-STRIPSBLEU89Unverified
#ModelMetricClaimedVerifiedStatus
1TesseractCharacter Error Rate (CER)0.08Unverified
2EasyOCRCharacter Error Rate (CER)0.07Unverified
#ModelMetricClaimedVerifiedStatus
1I2L-STRIPSBLEU88.86Unverified