SOTAVerified

Optical Character Recognition (OCR)

Optical Character Recognition or Optical Character Reader (OCR) is the electronic or mechanical conversion of images of typed, handwritten or printed text into machine-encoded text, whether from a scanned document, a photo of a document, a scene-photo (for example the text on signs and billboards in a landscape photo, license plates in cars...) or from subtitle text superimposed on an image (for example: from a television broadcast)

Papers

Showing 9761000 of 1209 papers

TitleStatusHype
FINN-L: Library Extensions and Design Trade-off Analysis for Variable Precision LSTM Networks on FPGAsCode0
Sum-Product Networks for Sequence Labeling0
Calamari - A High-Performance Tensorflow-based Deep Learning Package for Optical Character RecognitionCode0
Multi-Input Attention for Unsupervised OCR Correction0
ASTER: An Attentional Scene Text Recognizer with Flexible RectificationCode0
Deductron -- A Recurrent Neural Network0
Recommending Scientific Videos based on Metadata Enrichment using Linked Open Data0
NRTR: A No-Recurrence Sequence-to-Sequence Model For Scene Text RecognitionCode0
Confidence Prediction for Lexicon-Free OCR0
Implicit Language Model in LSTM for OCRCode0
IncepText: A New Inception-Text Module with Deformable PSROI Pooling for Multi-Oriented Scene Text Detection0
PDFdigest: an Adaptable Layout-Aware PDF-to-XML Textual Content Extractor for Scientific Articles0
PDF-to-Text Reanalysis for Linguistic Data Mining0
Building a Corpus from Handwritten Picture Postcards: Transcription, Annotation and Part-of-Speech Tagging0
TF-LM: TensorFlow-based Language Modeling ToolkitCode0
Towards Processing of the Oral History Interviews and Related Printed Documents0
Building A Handwritten Cuneiform Character Imageset0
PDFAnno: a Web-based Linguistic Annotation Tool for PDF DocumentsCode0
Delta vs. N-Gram Tracing: Evaluating the Robustness of Authorship Attribution Methods0
D\'etection d'erreurs dans des transcriptions OCR de documents historiques par r\'eseaux de neurones r\'ecurrents multi-niveau (Combining character level and word level RNNs for post-OCR error detection)0
Computer-assisted Speaker Diarization: How to Evaluate Human Corrections0
Measuring Innovation in Speech and Language Processing Publications.0
Matics Software Suite: New Tools for Evaluation and Data Exploration0
Low-resource Post Processing of Noisy OCR Output for Historical Corpus Digitisation0
Correction of OCR Word Segmentation Errors in Articles from the ACL Collection through Neural Machine Translation Methods0
Show:102550
← PrevPage 40 of 49Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1DTrOCRAccuracy (%)89.6Unverified
2DTrOCR 105MAccuracy (%)89.6Unverified
3MaskOCR-LAccuracy (%)82.6Unverified
4TransOCRAccuracy (%)72.8Unverified
5SRNAccuracy (%)65Unverified
6MORANAccuracy (%)64.3Unverified
7SEEDAccuracy (%)61.2Unverified
#ModelMetricClaimedVerifiedStatus
1GPT-4oAverage Accuracy76.22Unverified
2Gemini-1.5 ProAverage Accuracy76.13Unverified
3Claude-3 SonnetAverage Accuracy67.71Unverified
4RapidOCRAverage Accuracy56.98Unverified
5EasyOCRAverage Accuracy49.3Unverified
#ModelMetricClaimedVerifiedStatus
1STREETSequence error27.54Unverified
2SEESequence error22Unverified
3AttentionOCR_Inception-resnet-v2_LocationSequence error15.8Unverified
#ModelMetricClaimedVerifiedStatus
1I2L-NOPOOLBLEU89.09Unverified
2I2L-STRIPSBLEU89Unverified
#ModelMetricClaimedVerifiedStatus
1TesseractCharacter Error Rate (CER)0.08Unverified
2EasyOCRCharacter Error Rate (CER)0.07Unverified
#ModelMetricClaimedVerifiedStatus
1I2L-STRIPSBLEU88.86Unverified