SOTAVerified

Optical Character Recognition

Papers

Showing 51100 of 526 papers

TitleStatusHype
Toxicity of the Commons: Curating Open-Source Pre-Training DataCode1
TransDocAnalyser: A Framework for Offline Semi-structured Handwritten Document Analysis in the Legal DomainCode1
Uni-MuMER: Unified Multi-Task Fine-Tuning of Vision-Language Model for Handwritten Mathematical Expression RecognitionCode1
Universal Defensive Underpainting Patch: Making Your Text Invisible to Optical Character RecognitionCode1
ViOCRVQA: Novel Benchmark Dataset and Vision Reader for Visual Question Answering by Understanding Vietnamese Text in ImagesCode1
WenetSpeech: A 10000+ Hours Multi-domain Mandarin Corpus for Speech RecognitionCode1
Robust Learning for Text Classification with Multi-source Noise Simulation and Hard Example MiningCode1
Hespi: A pipeline for automatically detecting information from hebarium specimen sheetsCode1
bbOCR: An Open-source Multi-domain OCR Pipeline for Bengali DocumentsCode1
Confidence-aware Non-repetitive Multimodal Transformers for TextCapsCode1
Implicit Feature Alignment: Learn to Convert Text Recognizer to Text SpotterCode1
Focus, Distinguish, and Prompt: Unleashing CLIP for Efficient and Flexible Scene Text RetrievalCode1
Exploring OCR Capabilities of GPT-4V(ision) : A Quantitative and In-depth EvaluationCode1
GenKIE: Robust Generative Multimodal Document Key Information ExtractionCode1
Iranis: A Large-scale Dataset of Farsi License Plate CharactersCode1
An Empirical Study of Scaling Law for OCRCode1
A Two-Step Approach for Automatic OCR Post-CorrectionCode1
DocParser: End-to-end OCR-free Information Extraction from Visually Rich DocumentsCode1
Efficient OCR for Building a Diverse Digital HistoryCode1
Detection of Furigana Text in ImagesCode1
A Comprehensive Gold Standard and Benchmark for Comics Text Detection and RecognitionCode1
Exploring Better Text Image Translation with Multimodal CodebookCode1
BankNote-Net: Open dataset for assistive universal currency recognitionCode1
FAWA: Fast Adversarial Watermark Attack on Optical Character Recognition (OCR) SystemsCode1
Digitizing Historical Balance Sheet Data: A Practitioner's GuideCode1
Geometry Restoration and Dewarping of Camera-Captured Document ImagesCode1
Data Generation for Post-OCR correction of Cyrillic handwritingCode1
hmBERT: Historical Multilingual Language Models for Named Entity RecognitionCode1
CORU: Comprehensive Post-OCR Parsing and Receipt Understanding DatasetCode1
Fully Unsupervised Diversity Denoising with Convolutional Variational AutoencodersCode1
Lights, Camera, Action! A Framework to Improve NLP Accuracy over OCR documentsCode1
LogicOCR: Do Your Large Multimodal Models Excel at Logical Reasoning on Text-Rich Images?Code1
Meta-Album: Multi-domain Meta-Dataset for Few-Shot Image ClassificationCode1
A Large Multi-Target Dataset of Common Bengali Handwritten GraphemesCode1
Enhancing License Plate Super-Resolution: A Layout-Aware and Character-Driven ApproachCode1
Boosting on the shoulders of giants in quantum device calibrationCode1
Let's Enhance: A Deep Learning Approach to Extreme Deblurring of Text ImagesCode1
RUArt: A Novel Text-Centered Solution for Text-Based Visual Question AnsweringCode1
A Survey on MLLM-based Visually Rich Document Understanding: Methods, Challenges, and Emerging Trends0
A survey of modern optical character recognition techniques0
Ancient but Digitized: Developing Handwritten Optical Character Recognition for East Syriac Script Through Creating KHAMIS Dataset0
A Study of Sindhi Related and Arabic Script Adapted languages Recognition0
Advancing Visual Specification of Code Requirements for Graphs0
Advancing Vehicle Plate Recognition: Multitasking Visual Language Models with VehiclePaliGemma0
An Assessment of the Impact of OCR Noise on Language Models0
Abstractive Information Extraction from Scanned Invoices (AIESI) using End-to-end Sequential Approach0
Artificial neural networks and fuzzy logic for recognizing alphabet characters and mathematical symbols0
Artificial Eye for the Blind0
An accurate and revised version of optical character recognition-based speech synthesis using LabVIEW0
Confidence-Aware Document OCR Error Detection0
Show:102550
← PrevPage 2 of 11Next →

No leaderboard results yet.