SOTAVerified

Optical Character Recognition (OCR)

Optical Character Recognition or Optical Character Reader (OCR) is the electronic or mechanical conversion of images of typed, handwritten or printed text into machine-encoded text, whether from a scanned document, a photo of a document, a scene-photo (for example the text on signs and billboards in a landscape photo, license plates in cars...) or from subtitle text superimposed on an image (for example: from a television broadcast)

Papers

Showing 651700 of 1209 papers

TitleStatusHype
Semi-Structured Query Grounding for Document-Oriented Databases with Deep Retrieval and Its Application to Receipt and POI Matching0
Sequence-to-Label Script Identification for Multilingual OCR0
Sequence to Sequence Learning for Optical Character Recognition0
Sequence-to-Sequence Pre-training with Unified Modality Masking for Visual Document Understanding0
Shakti-VLMs: Scalable Vision-Language Models for Enterprise AI0
Similar Document Template Matching Algorithm0
Simple is not Easy: A Simple Strong Baseline for TextVQA and TextCaps0
Simple Transparent Adversarial Examples0
Simulation d’erreurs d’OCR dans les systèmes de TAL pour le traitement de données anachroniques (Simulation of OCR errors in NLP systems for processing anachronistic data)0
Sinica-IASL Chinese spelling check system at Sighan-70
SIS@IIITH at SemEval-2020 Task 8: An Overview of Simple Text Classification Methods for Meme Analysis0
Slide2Text: Leveraging LLMs for Personalized Textbook Generation from PowerPoint Presentations0
Solution for SMART-101 Challenge of ICCV Multi-modal Algorithmic Reasoning Task 20230
Solving Substitution Ciphers with Combined Language Models0
Southern Newswire Corpus: A Large-Scale Dataset of Mid-Century Wire Articles Beyond the Front Page0
SPARLING: Learning Latent Representations with Extremely Sparse Activations0
Sparse Concept Coded Tetrolet Transform for Unconstrained Odia Character Recognition0
SpellBERT: A Lightweight Pretrained Model for Chinese Spelling Check0
Squibs: Spelling Error Patterns in Brazilian Portuguese0
Star-net: A spatial attention residue network for scene text recognition.0
Statistical Learning for OCR Text Correction0
Machine Learning Construction: implications to cybersecurity0
Statistical Machine Translation Improvement based on Phrase Selection0
Still not there? Comparing Traditional Sequence-to-Sequence Models to Encoder-Decoder Neural Networks on Monotone String Translation Tasks0
STRIDE : Scene Text Recognition In-Device0
Structured Analysis and Comparison of Alphabets in Historical Handwritten Ciphers0
Sum-Product Networks for Sequence Labeling0
SuperOCR: A Conversion from Optical Character Recognition to Image Captioning0
SuperOCR for ALTA 2017 Shared Task0
Survey of Computational Approaches to Lexical Semantic Change0
SVDocNet: Spatially Variant U-Net for Blind Document Deblurring0
SVTRv2: CTC Beats Encoder-Decoder Models in Scene Text Recognition0
SymbioticRAG: Enhancing Document Intelligence Through Human-LLM Symbiotic Collaboration0
Synergy of Nederlab and0
Synthesizing Annotated Image and Video Data Using a Rendering-Based Pipeline for Improved License Plate Recognition0
Table Structure Extraction with Bi-directional Gated Recurrent Unit Networks0
Tablext: A Combined Neural Network And Heuristic Based Table Extractor0
Tag, Copy or Predict: A Unified Weakly-Supervised Learning Framework for Visual Information Extraction using Sequences0
Tagging Named Entities in 19th Century and Modern Finnish Newspaper Material with a Finnish Semantic Tagger0
Tamil Vowel Recognition With Augmented MNIST-like Data Set0
TAP-VL: Text Layout-Aware Pre-training for Enriched Vision-Language Models0
TC-OCR: TableCraft OCR for Efficient Detection & Recognition of Table Structure & Content0
TDeLTA: A Light-weight and Robust Table Detection Method based on Learning Text Arrangement0
TECHLIMED@QALB-Shared Task 2015: a hybrid Arabic Error Correction System0
TECHLIMED system description for the Shared Task on Automatic Arabic Error Correction0
TeLCoS: OnDevice Text Localization with Clustering of Script0
Telugu OCR Framework using Deep Learning0
Text-Aware Dual Routing Network for Visual Question Answering0
TextBlockV2: Towards Precise-Detection-Free Scene Text Spotting with Pre-trained Language Model0
TextCaps: a Dataset for Image Captioning with Reading Comprehension0
Show:102550
← PrevPage 14 of 25Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1DTrOCRAccuracy (%)89.6Unverified
2DTrOCR 105MAccuracy (%)89.6Unverified
3MaskOCR-LAccuracy (%)82.6Unverified
4TransOCRAccuracy (%)72.8Unverified
5SRNAccuracy (%)65Unverified
6MORANAccuracy (%)64.3Unverified
7SEEDAccuracy (%)61.2Unverified
#ModelMetricClaimedVerifiedStatus
1GPT-4oAverage Accuracy76.22Unverified
2Gemini-1.5 ProAverage Accuracy76.13Unverified
3Claude-3 SonnetAverage Accuracy67.71Unverified
4RapidOCRAverage Accuracy56.98Unverified
5EasyOCRAverage Accuracy49.3Unverified
#ModelMetricClaimedVerifiedStatus
1STREETSequence error27.54Unverified
2SEESequence error22Unverified
3AttentionOCR_Inception-resnet-v2_LocationSequence error15.8Unverified
#ModelMetricClaimedVerifiedStatus
1I2L-NOPOOLBLEU89.09Unverified
2I2L-STRIPSBLEU89Unverified
#ModelMetricClaimedVerifiedStatus
1TesseractCharacter Error Rate (CER)0.08Unverified
2EasyOCRCharacter Error Rate (CER)0.07Unverified
#ModelMetricClaimedVerifiedStatus
1I2L-STRIPSBLEU88.86Unverified