SOTAVerified

Optical Character Recognition (OCR)

Optical Character Recognition or Optical Character Reader (OCR) is the electronic or mechanical conversion of images of typed, handwritten or printed text into machine-encoded text, whether from a scanned document, a photo of a document, a scene-photo (for example the text on signs and billboards in a landscape photo, license plates in cars...) or from subtitle text superimposed on an image (for example: from a television broadcast)

Papers

Showing 801850 of 1209 papers

TitleStatusHype
Discovering Airline-Specific Business Intelligence from Online Passenger Reviews: An Unsupervised Text Analytics Approach0
Vartani Spellcheck -- Automatic Context-Sensitive Spelling Correction of OCR-generated Hindi Text Using BERT and Levenshtein Distance0
Simple is not Easy: A Simple Strong Baseline for TextVQA and TextCaps0
BennettNLP at SemEval-2020 Task 8: Multimodal sentiment classification Using Hybrid Hierarchical Classifier0
Detecting de minimis Code-Switching in Historical German Books0
SIS@IIITH at SemEval-2020 Task 8: An Overview of Simple Text Classification Methods for Meme Analysis0
Ad Lingua: Text Classification Improves Symbolism Prediction in Image Advertisements0
Building a Part-of-Speech Tagged Corpus for Drenjongke (Bhutia)Code0
CSECU\_KDE\_MA at SemEval-2020 Task 8: A Neural Attention Model for Memotion Analysis0
A Survey of Deep Learning Approaches for OCR and Document UnderstandingCode0
A Panoramic Survey of Natural Language Processing in the Arab World0
SuperOCR: A Conversion from Optical Character Recognition to Image Captioning0
On-Device Text Image Super Resolution0
Clustering-based Automatic Construction of Legal Entity Knowledge Base from Contracts0
On-Device Language Identification of Text in Images using Diacritic Characters0
Automated data extraction of bar chart raster images0
Handwriting Classification for the Analysis of Art-Historical DocumentsCode0
Automated Transcription of Non-Latin Script Periodicals: A Case Study in the Ottoman Turkish Print Archive0
OCR, Classification & Machine Translation (OCCAM)0
Chunk-based Chinese Spelling Check with Global Optimization0
Alleviating Digitization Errors in Named Entity Recognition for Historical DocumentsCode0
Persian Handwritten Digit, Character and Word Recognition Using Deep Learning0
Boosting High-Level Vision with Joint Compression Artifacts Reduction and Super-Resolution0
A Conglomerate of Multiple OCR Table Detection and Extraction0
DocStruct: A Multimodal Method to Extract Hierarchy Structure in Document for General Form Understanding0
Finding the Evidence: Localization-aware Answer Prediction for Text Visual Question Answering0
Hamming OCR: A Locality Sensitive Hashing Neural Network for Scene Text Recognition0
Towards Image-based Automatic Meter Reading in Unconstrained Scenarios: A Robust and Efficient Approach0
An Efficient Language-Independent Multi-Font OCR for Arabic Script0
Word Segmentation from Unconstrained Handwritten Bangla Document Images using Distance Transform0
Handwritten Script Identification from Text Lines0
A New Approach for Texture based Script Identification At Block Level using Quad Tree Decomposition0
Fast Implementation of 4-bit Convolutional Neural Networks for Mobile Devices0
Abstractive Information Extraction from Scanned Invoices (AIESI) using End-to-end Sequential Approach0
MRZ code extraction from visa and passport documents using convolutional neural networksCode0
OCR Graph Features for Manipulation Detection in Documents0
Optical Character Recognition, Word Segmentation, Sentence Segmentation, and Information Extraction for Historical and Literature Texts in Classical Chinese0
EASTER: Efficient and Scalable Text Recognizer0
On the Accuracy of CRNNs for Line-Based OCR: A Multi-Parameter Evaluation0
Can You Read Me Now? Content Aware Rectification using Angle Supervision0
Weakly Supervised Construction of ASR Systems with Massive Video Data0
An End-to-End OCR Text Re-organization Sequence Learning for Rich-text Detail Image Comprehension0
Advancing Visual Specification of Code Requirements for Graphs0
Deep Learning Based Traffic Surveillance System For Missing and Suspicious Car Detection0
Tamil Vowel Recognition With Augmented MNIST-like Data Set0
What Machines See Is Not What They Get: Fooling Scene Text Recognition Models With Adversarial Text Images0
Exploiter des mod\`eles de langue pour \'evaluer des sorties de logiciels d'OCR pour des documents fran du XVIIe si\`ecle ()0
Computer Vision Toolkit for Non-invasive Monitoring of Factory Floor Artifacts0
Quantitative Analysis of Image Classification Techniques for Memory-Constrained Devices0
Deep Learning Based Vehicle Tracking System Using License Plate Detection And Recognition0
Show:102550
← PrevPage 17 of 25Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1DTrOCR 105MAccuracy (%)89.6Unverified
2DTrOCRAccuracy (%)89.6Unverified
3MaskOCR-LAccuracy (%)82.6Unverified
4TransOCRAccuracy (%)72.8Unverified
5SRNAccuracy (%)65Unverified
6MORANAccuracy (%)64.3Unverified
7SEEDAccuracy (%)61.2Unverified
#ModelMetricClaimedVerifiedStatus
1GPT-4oAverage Accuracy76.22Unverified
2Gemini-1.5 ProAverage Accuracy76.13Unverified
3Claude-3 SonnetAverage Accuracy67.71Unverified
4RapidOCRAverage Accuracy56.98Unverified
5EasyOCRAverage Accuracy49.3Unverified
#ModelMetricClaimedVerifiedStatus
1STREETSequence error27.54Unverified
2SEESequence error22Unverified
3AttentionOCR_Inception-resnet-v2_LocationSequence error15.8Unverified
#ModelMetricClaimedVerifiedStatus
1I2L-NOPOOLBLEU89.09Unverified
2I2L-STRIPSBLEU89Unverified
#ModelMetricClaimedVerifiedStatus
1TesseractCharacter Error Rate (CER)0.08Unverified
2EasyOCRCharacter Error Rate (CER)0.07Unverified
#ModelMetricClaimedVerifiedStatus
1I2L-STRIPSBLEU88.86Unverified