SOTAVerified

Optical Character Recognition (OCR)

Optical Character Recognition or Optical Character Reader (OCR) is the electronic or mechanical conversion of images of typed, handwritten or printed text into machine-encoded text, whether from a scanned document, a photo of a document, a scene-photo (for example the text on signs and billboards in a landscape photo, license plates in cars...) or from subtitle text superimposed on an image (for example: from a television broadcast)

Papers

Showing 951975 of 1209 papers

TitleStatusHype
Optical Character Recognition and Transcription of Berber Signs from Images in a Low-Resource Language Amazigh0
Optical Character Recognition (OCR) for Telugu: Database, Algorithm and Application0
Optical character recognition quality affects perceived usefulness of historical newspaper clippings0
Optical Character Recognition using Convolutional Neural Networks for Ashokan Brahmi Inscriptions0
Optical Character Recognition, Using K-Nearest Neighbors0
Optical Character Recognition, Word Segmentation, Sentence Segmentation, and Information Extraction for Historical and Literature Texts in Classical Chinese0
Optical Text Recognition in Nepali and Bengali: A Transformer-based Approach0
Optimal Boxes: Boosting End-to-End Scene Text Recognition by Adjusting Annotated Bounding Boxes via Reinforcement Learning0
Optimizing the Neural Network Training for OCR Error Correction of Historical Hebrew Texts0
OSPC: Detecting Harmful Memes with Large Language Model as a Catalyst0
Out-of-Candidate Rectification for Weakly Supervised Semantic Segmentation0
Out-of-Distribution Recovery with Object-Centric Keypoint Inverse Policy for Visuomotor Imitation Learning0
Out-of-Vocabulary Challenge Report0
Overlay Text Extraction From TV News Broadcast0
Overview of the 2017 ALTA Shared Task: Correcting OCR Errors0
PACMAN: a framework for pulse oximeter digit detection and reading in a low-resource setting0
PaddleOCR 3.0 Technical Report0
Page Stream Segmentation with Convolutional Neural Nets Combining Textual and Visual Features0
PAM: Understanding Product Images in Cross Product Category Attribute Extraction0
papago: A Machine Translation Service with Word Sense Disambiguation and Currency Conversion0
Pay Voice: Point of Sale Recognition for Visually Impaired People0
PDFdigest: an Adaptable Layout-Aware PDF-to-XML Textual Content Extractor for Scientific Articles0
PdfTable: A Unified Toolkit for Deep Learning-Based Table Extraction0
PDF-to-Text Reanalysis for Linguistic Data Mining0
People and Places of Historical Europe: Bootstrapping Annotation Pipeline and a New Corpus of Named Entities in Late Medieval Texts0
Show:102550
← PrevPage 39 of 49Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1DTrOCR 105MAccuracy (%)89.6Unverified
2DTrOCRAccuracy (%)89.6Unverified
3MaskOCR-LAccuracy (%)82.6Unverified
4TransOCRAccuracy (%)72.8Unverified
5SRNAccuracy (%)65Unverified
6MORANAccuracy (%)64.3Unverified
7SEEDAccuracy (%)61.2Unverified
#ModelMetricClaimedVerifiedStatus
1GPT-4oAverage Accuracy76.22Unverified
2Gemini-1.5 ProAverage Accuracy76.13Unverified
3Claude-3 SonnetAverage Accuracy67.71Unverified
4RapidOCRAverage Accuracy56.98Unverified
5EasyOCRAverage Accuracy49.3Unverified
#ModelMetricClaimedVerifiedStatus
1STREETSequence error27.54Unverified
2SEESequence error22Unverified
3AttentionOCR_Inception-resnet-v2_LocationSequence error15.8Unverified
#ModelMetricClaimedVerifiedStatus
1I2L-NOPOOLBLEU89.09Unverified
2I2L-STRIPSBLEU89Unverified
#ModelMetricClaimedVerifiedStatus
1TesseractCharacter Error Rate (CER)0.08Unverified
2EasyOCRCharacter Error Rate (CER)0.07Unverified
#ModelMetricClaimedVerifiedStatus
1I2L-STRIPSBLEU88.86Unverified