SOTAVerified

Optical Character Recognition (OCR)

Optical Character Recognition or Optical Character Reader (OCR) is the electronic or mechanical conversion of images of typed, handwritten or printed text into machine-encoded text, whether from a scanned document, a photo of a document, a scene-photo (for example the text on signs and billboards in a landscape photo, license plates in cars...) or from subtitle text superimposed on an image (for example: from a television broadcast)

Papers

Showing 776800 of 1209 papers

TitleStatusHype
SuperOCR: A Conversion from Optical Character Recognition to Image Captioning0
On-Device Text Image Super Resolution0
Clustering-based Automatic Construction of Legal Entity Knowledge Base from Contracts0
On-Device Language Identification of Text in Images using Diacritic Characters0
OCR Post Correction for Endangered Language TextsCode1
Automated data extraction of bar chart raster images0
An Unsupervised method for OCR Post-Correction and Spelling Normalisation for FinnishCode1
Handwriting Classification for the Analysis of Art-Historical DocumentsCode0
Automated Transcription of Non-Latin Script Periodicals: A Case Study in the Ottoman Turkish Print Archive0
OCR, Classification & Machine Translation (OCCAM)0
Chunk-based Chinese Spelling Check with Global Optimization0
Alleviating Digitization Errors in Named Entity Recognition for Historical DocumentsCode0
RUArt: A Novel Text-Centered Solution for Text-Based Visual Question AnsweringCode1
Persian Handwritten Digit, Character and Word Recognition Using Deep Learning0
TLGAN: document Text Localization using Generative Adversarial NetsCode1
Boosting High-Level Vision with Joint Compression Artifacts Reduction and Super-Resolution0
DE-GAN: A Conditional Generative Adversarial Network for Document EnhancementCode1
A Conglomerate of Multiple OCR Table Detection and Extraction0
DocStruct: A Multimodal Method to Extract Hierarchy Structure in Document for General Form Understanding0
Tokenization Repair in the Presence of Spelling ErrorsCode1
Table Structure Recognition using Top-Down and Bottom-Up CuesCode1
Finding the Evidence: Localization-aware Answer Prediction for Text Visual Question Answering0
A Large Multi-Target Dataset of Common Bengali Handwritten GraphemesCode1
Hamming OCR: A Locality Sensitive Hashing Neural Network for Scene Text Recognition0
Towards Image-based Automatic Meter Reading in Unconstrained Scenarios: A Robust and Efficient Approach0
Show:102550
← PrevPage 32 of 49Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1DTrOCRAccuracy (%)89.6Unverified
2DTrOCR 105MAccuracy (%)89.6Unverified
3MaskOCR-LAccuracy (%)82.6Unverified
4TransOCRAccuracy (%)72.8Unverified
5SRNAccuracy (%)65Unverified
6MORANAccuracy (%)64.3Unverified
7SEEDAccuracy (%)61.2Unverified
#ModelMetricClaimedVerifiedStatus
1GPT-4oAverage Accuracy76.22Unverified
2Gemini-1.5 ProAverage Accuracy76.13Unverified
3Claude-3 SonnetAverage Accuracy67.71Unverified
4RapidOCRAverage Accuracy56.98Unverified
5EasyOCRAverage Accuracy49.3Unverified
#ModelMetricClaimedVerifiedStatus
1STREETSequence error27.54Unverified
2SEESequence error22Unverified
3AttentionOCR_Inception-resnet-v2_LocationSequence error15.8Unverified
#ModelMetricClaimedVerifiedStatus
1I2L-NOPOOLBLEU89.09Unverified
2I2L-STRIPSBLEU89Unverified
#ModelMetricClaimedVerifiedStatus
1TesseractCharacter Error Rate (CER)0.08Unverified
2EasyOCRCharacter Error Rate (CER)0.07Unverified
#ModelMetricClaimedVerifiedStatus
1I2L-STRIPSBLEU88.86Unverified