SOTAVerified

Optical Character Recognition (OCR)

Optical Character Recognition or Optical Character Reader (OCR) is the electronic or mechanical conversion of images of typed, handwritten or printed text into machine-encoded text, whether from a scanned document, a photo of a document, a scene-photo (for example the text on signs and billboards in a landscape photo, license plates in cars...) or from subtitle text superimposed on an image (for example: from a television broadcast)

Papers

Showing 901950 of 1209 papers

TitleStatusHype
Neural Monkey: The Current State and Beyond0
News Deja Vu: Connecting Past and Present with Semantic Search0
N-gram language models for massively parallel devices0
Nonparametric modeling cash flows of insurance company0
NOSE Augment: Fast and Effective Data Augmentation Without Searching0
NoTeS-Bank: Benchmarking Neural Transcription and Search for Scientific Notes Understanding0
Notes on Applicability of GPT-4 to Document Understanding0
NusaAksara: A Multimodal and Multilingual Benchmark for Preserving Indonesian Indigenous Scripts0
NVLM: Open Frontier-Class Multimodal LLMs0
Object-Centric Representations Improve Policy Generalization in Robot Manipulation0
Object Detection and Recognition of Swap-Bodies using Camera mounted on a Vehicle0
OCR4all -- An Open-Source Tool Providing a (Semi-)Automatic OCR Workflow for Historical Printings0
OCR accuracy improvement on document images through a novel pre-processing approach0
OCR and Automated Translation for the Navigation of non-English Handsets: A Feasibility Study with Arabic0
OCR and post-correction of historical Finnish texts0
OCRAPOSE II: An OCR-based indoor positioning system using mobile phone images0
OCR++: A Robust Framework For Information Extraction from Scholarly Articles0
OCR, Classification & Machine Translation (OCCAM)0
OCR Error Correction Using Character Correction and Feature-Based Word Classification0
OCR evaluation tools for the 21st century0
OCR for TIFF Compressed Document Images Directly in Compressed Domain Using Text segmentation and Hidden Markov Model0
OCR Graph Features for Manipulation Detection in Documents0
OCR Improves Machine Translation for Low-Resource Languages0
OCR is All you need: Importing Multi-Modality into Image-based Defect Detection System0
OCR Language Models with Custom Vocabularies0
OCR of historical printings with an application to building diachronic corpora: A case study using the RIDGES herbal corpus0
OCR Post-Correction Evaluation of Early Dutch Books Online - Revisited0
OCR Post-Processing Text Correction using Simulated Annealing (OPTeCA)0
OCR Processing of Swedish Historical Newspapers Using Deep Hybrid CNN–LSTM Networks0
OCR quality affects perceived usefulness of historical newspaper clippings -- a user study0
OCR Quality and NLP Preprocessing0
OCR-RTPS: An OCR-based real-time positioning system for the valet parking0
OCR Synthetic Benchmark Dataset for Indic Languages0
OdiEnCorp 2.0: Odia-English Parallel Corpus for Machine Translation0
Offline Handwritten MODI Character Recognition Using HU, Zernike Moments and Zoning0
Old Content and Modern Tools - Searching Named Entities in a Finnish OCRed Historical Newspaper Collection 1771-19100
Omnifont Persian OCR System Using Primitives0
On-Device Document Classification using multimodal features0
On-Device Language Identification of Text in Images using Diacritic Characters0
On-Device Spatial Attention based Sequence Learning Approach for Scene Text Script Identification0
On-Device Text Image Super Resolution0
One Filter to Deploy Them All: Robust Safety for Quadrupedal Navigation in Unknown Environments0
One RL to See Them All: Visual Triple Unified Reinforcement Learning0
Emergency-Brake Simplex: Toward A Verifiably Safe Control-CPS Architecture for Abrupt Runtime Reachability Constraint Changes0
On the Accuracy of CRNNs for Line-Based OCR: A Multi-Parameter Evaluation0
On the feasibility of attacking Thai LPR systems with adversarial examples0
Open data for Moroccan license plates for OCR applications : data collection, labeling, and model construction0
Open Philology at the University of Leipzig0
OpenSubtitles2016: Extracting Large Parallel Corpora from Movie and TV Subtitles0
Open-Vocabulary Scene Text Recognition via Pseudo-Image Labeling and Margin Loss0
Show:102550
← PrevPage 19 of 25Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1DTrOCRAccuracy (%)89.6Unverified
2DTrOCR 105MAccuracy (%)89.6Unverified
3MaskOCR-LAccuracy (%)82.6Unverified
4TransOCRAccuracy (%)72.8Unverified
5SRNAccuracy (%)65Unverified
6MORANAccuracy (%)64.3Unverified
7SEEDAccuracy (%)61.2Unverified
#ModelMetricClaimedVerifiedStatus
1GPT-4oAverage Accuracy76.22Unverified
2Gemini-1.5 ProAverage Accuracy76.13Unverified
3Claude-3 SonnetAverage Accuracy67.71Unverified
4RapidOCRAverage Accuracy56.98Unverified
5EasyOCRAverage Accuracy49.3Unverified
#ModelMetricClaimedVerifiedStatus
1STREETSequence error27.54Unverified
2SEESequence error22Unverified
3AttentionOCR_Inception-resnet-v2_LocationSequence error15.8Unverified
#ModelMetricClaimedVerifiedStatus
1I2L-NOPOOLBLEU89.09Unverified
2I2L-STRIPSBLEU89Unverified
#ModelMetricClaimedVerifiedStatus
1TesseractCharacter Error Rate (CER)0.08Unverified
2EasyOCRCharacter Error Rate (CER)0.07Unverified
#ModelMetricClaimedVerifiedStatus
1I2L-STRIPSBLEU88.86Unverified