SOTAVerified

Optical Character Recognition (OCR)

Optical Character Recognition or Optical Character Reader (OCR) is the electronic or mechanical conversion of images of typed, handwritten or printed text into machine-encoded text, whether from a scanned document, a photo of a document, a scene-photo (for example the text on signs and billboards in a landscape photo, license plates in cars...) or from subtitle text superimposed on an image (for example: from a television broadcast)

Papers

Showing 601625 of 1209 papers

TitleStatusHype
Derivate-based Component-Trees for Multi-Channel Image Segmentation0
Design and Development of a Framework For Stroke-Based Handwritten Gujarati Font Generation0
Design and Implementation of an OCR-Powered Pipeline for Table Extraction from Invoices0
Detecting de minimis Code-Switching in Historical German Books0
D\'etection d'erreurs dans des transcriptions OCR de documents historiques par r\'eseaux de neurones r\'ecurrents multi-niveau (Combining character level and word level RNNs for post-OCR error detection)0
Detection Masking for Improved OCR on Noisy Documents0
Reciprocal Feature Learning via Explicit and Implicit Tasks in Scene Text Recognition0
Recognition of Images of Korean Characters Using Embedded Networks0
Recognition of Text Image Using Multilayer Perceptron0
Recommending Scientific Videos based on Metadata Enrichment using Linked Open Data0
Reconnaissance d’entités nommées sur des sorties OCR bruitées : des pistes pour la désambiguïsation morphologique automatique (Resolution of entity linking issues on noisy OCR output : automatic disambiguation tracks)0
Recursive Recurrent Nets with Attention Modeling for OCR in the Wild0
Reference-Based Post-OCR Processing with LLM for Diacritic Languages0
Refining Corpora from a Model Calibration Perspective for Chinese Spelling Correction0
Refining Text-to-Image Generation: Towards Accurate Training-Free Glyph-Enhanced Image Generation0
Regularization and Kernelization of the Maximin Correlation Approach0
ReLayout: Towards Real-World Document Understanding via Layout-enhanced Pre-training0
Representing Online Handwriting for Recognition in Large Vision-Language Models0
Reranking with Linguistic and Semantic Features for Arabic Optical Character Recognition0
Resilience of Large Language Models for Noisy Instructions0
Resolving Referring Expressions in Images With Labeled Elements0
Resolving Sentiment Discrepancy for Multimodal Sentiment Detection via Semantics Completion and Decomposition0
Resource Constrained Structured Prediction0
Resume Information Extraction via Post-OCR Text Processing0
Rethinking Visual Prompting for Multimodal Large Language Models with External Knowledge0
Show:102550
← PrevPage 25 of 49Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1DTrOCRAccuracy (%)89.6Unverified
2DTrOCR 105MAccuracy (%)89.6Unverified
3MaskOCR-LAccuracy (%)82.6Unverified
4TransOCRAccuracy (%)72.8Unverified
5SRNAccuracy (%)65Unverified
6MORANAccuracy (%)64.3Unverified
7SEEDAccuracy (%)61.2Unverified
#ModelMetricClaimedVerifiedStatus
1GPT-4oAverage Accuracy76.22Unverified
2Gemini-1.5 ProAverage Accuracy76.13Unverified
3Claude-3 SonnetAverage Accuracy67.71Unverified
4RapidOCRAverage Accuracy56.98Unverified
5EasyOCRAverage Accuracy49.3Unverified
#ModelMetricClaimedVerifiedStatus
1STREETSequence error27.54Unverified
2SEESequence error22Unverified
3AttentionOCR_Inception-resnet-v2_LocationSequence error15.8Unverified
#ModelMetricClaimedVerifiedStatus
1I2L-NOPOOLBLEU89.09Unverified
2I2L-STRIPSBLEU89Unverified
#ModelMetricClaimedVerifiedStatus
1TesseractCharacter Error Rate (CER)0.08Unverified
2EasyOCRCharacter Error Rate (CER)0.07Unverified
#ModelMetricClaimedVerifiedStatus
1I2L-STRIPSBLEU88.86Unverified