SOTAVerified

Optical Character Recognition (OCR)

Optical Character Recognition or Optical Character Reader (OCR) is the electronic or mechanical conversion of images of typed, handwritten or printed text into machine-encoded text, whether from a scanned document, a photo of a document, a scene-photo (for example the text on signs and billboards in a landscape photo, license plates in cars...) or from subtitle text superimposed on an image (for example: from a television broadcast)

Papers

Showing 11011150 of 1209 papers

TitleStatusHype
Crowdsourcing an OCR Gold Standard for a German and French Heritage Corpus0
Measuring Lexical Quality of a Historical Finnish Newspaper Collection ― Analysis of Garbled OCR Data with Basic Language Technology Tools and Means0
Training \& Quality Assessment of an Optical Character Recognition Model for Northern Haida0
Extracting Weighted Language Lexicons from Wikipedia0
OCR Post-Correction Evaluation of Early Dutch Books Online - Revisited0
1 Million Captioned Dutch Newspaper Images0
OCR Error Correction Using Character Correction and Feature-Based Word Classification0
Overlay Text Extraction From TV News Broadcast0
Robust Scene Text Recognition with Automatic RectificationCode0
Recursive Recurrent Nets with Attention Modeling for OCR in the Wild0
Resource Constrained Structured Prediction0
Data Cleaning for XML Electronic Dictionaries via Statistical Anomaly Detection0
Improving patch-based scene text script identification with ensembles of conjoined networksCode0
Font Identification in Historical Documents Using Active Learning0
COCO-Text: Dataset and Benchmark for Text Detection and Recognition in Natural ImagesCode0
Decoding Anagrammed Texts Written in an Unknown Language and Script0
Finding Names in Trove: Named Entity Recognition for Australian Historical Newspapers0
Comparison of Visual and Logical Character Segmentation in Tesseract OCR Language Data for Indic Writing Scripts0
Calibrated Structured PredictionCode0
Sequence to Sequence Learning for Optical Character Recognition0
Directional Global Three-part Image Decomposition0
Telugu OCR Framework using Deep Learning0
OCR accuracy improvement on document images through a novel pre-processing approach0
Is it possible to recover personal health information from an automatically de-identified corpus of French EHRs?0
DanProof: Pedagogical Spell and Grammar Checking for Danish0
Statistical Machine Translation Improvement based on Phrase Selection0
Topic Stability over Noisy Sources0
A preliminary study on similarity-preserving digital book identifiers0
SAHSOH@QALB-2015 Shared Task: A Rule-Based Correction Method of Common Arabic Native and Non-Native Speakers' Errors0
TECHLIMED@QALB-Shared Task 2015: a hybrid Arabic Error Correction System0
A Linked Data Model for Multimodal Sentiment and Emotion Analysis0
License Plate Recognition System Based on Color Coding Of License Plates0
Boosting Optical Character Recognition: A Super-Resolution Approach0
Automated Translation of a Literary Work: A Pilot Study0
Unsupervised Code-Switching for Multilingual Historical Document Transcription0
Squibs: Spelling Error Patterns in Brazilian Portuguese0
Regularization and Kernelization of the Maximin Correlation Approach0
A survey of modern optical character recognition techniques0
A Study of Sindhi Related and Arabic Script Adapted languages Recognition0
Learning Multiple Tasks in Parallel with a Shared Annotator0
Efficient Media Retrieval from Non-Cooperative Queries0
Optical Character Recognition, Using K-Nearest Neighbors0
OCR and Automated Translation for the Navigation of non-English Handsets: A Feasibility Study with Arabic0
A random forest system combination approach for error detection in digital dictionaries0
Improve CAPTCHA's Security Using Gaussian Blur Filter0
Autocorrection of arabic common errors for large text corpus0
Balanced Korean Word Spacing with Structural SVM0
TECHLIMED system description for the Shared Task on Automatic Arabic Error Correction0
CMUQ@QALB-2014: An SMT-based System for Automatic Arabic Error Correction0
Bypassing Captcha By Machine A Proof For Passing The Turing Test0
Show:102550
← PrevPage 23 of 25Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1DTrOCRAccuracy (%)89.6Unverified
2DTrOCR 105MAccuracy (%)89.6Unverified
3MaskOCR-LAccuracy (%)82.6Unverified
4TransOCRAccuracy (%)72.8Unverified
5SRNAccuracy (%)65Unverified
6MORANAccuracy (%)64.3Unverified
7SEEDAccuracy (%)61.2Unverified
#ModelMetricClaimedVerifiedStatus
1GPT-4oAverage Accuracy76.22Unverified
2Gemini-1.5 ProAverage Accuracy76.13Unverified
3Claude-3 SonnetAverage Accuracy67.71Unverified
4RapidOCRAverage Accuracy56.98Unverified
5EasyOCRAverage Accuracy49.3Unverified
#ModelMetricClaimedVerifiedStatus
1STREETSequence error27.54Unverified
2SEESequence error22Unverified
3AttentionOCR_Inception-resnet-v2_LocationSequence error15.8Unverified
#ModelMetricClaimedVerifiedStatus
1I2L-NOPOOLBLEU89.09Unverified
2I2L-STRIPSBLEU89Unverified
#ModelMetricClaimedVerifiedStatus
1TesseractCharacter Error Rate (CER)0.08Unverified
2EasyOCRCharacter Error Rate (CER)0.07Unverified
#ModelMetricClaimedVerifiedStatus
1I2L-STRIPSBLEU88.86Unverified