SOTAVerified

Optical Character Recognition

Papers

Showing 501526 of 526 papers

TitleStatusHype
Efficient Video-Based ALPR System Using YOLO and Visual RhythmCode0
Multi-modal Page Stream Segmentation with Convolutional Neural NetworksCode0
ChemScraper: Leveraging PDF Graphics Instructions for Molecular Diagram ParsingCode0
Advancing Multilingual Handwritten Numeral Recognition with Attention-driven Transfer LearningCode0
MultiOCR-QA: Dataset for Evaluating Robustness of LLMs in Question Answering on Multilingual OCR TextsCode0
Multi-Page Document Visual Question Answering using Self-Attention Scoring MechanismCode0
MultiQG-TI: Towards Question Generation from Multi-modal SourcesCode0
ASTER: An Attentional Scene Text Recognizer with Flexible RectificationCode0
Efficient Multi-domain Text Recognition Deep Neural Network Parameterization with Residual AdaptersCode0
Efficient License Plate Recognition in Videos Using Visual Rhythm and Accumulative Line AnalysisCode0
ChemGrapher: Optical Graph Recognition of Chemical Compounds by Deep LearningCode0
NASS-AI: Towards Digitization of Parliamentary Bills using Document Level Embedding and Bidirectional Long Short-Term MemoryCode0
E2TIMT: Efficient and Effective Modal Adapter for Text Image Machine TranslationCode0
Noisy Parallel Data AlignmentCode0
Post-OCR parsing: building simple and robust parser via BIO taggingCode0
Post-OCR Text Correction for Bulgarian Historical DocumentsCode0
DuoSearch: A Novel Search Engine for Bulgarian Historical DocumentsCode0
Character decomposition to resolve class imbalance problem in Hangul OCRCode0
Object detection deep learning networks for Optical Character RecognitionCode0
Chandojnanam: A Sanskrit Meter Identification and Utilization SystemCode0
Binary Document Image Super Resolution for Improved Readability and OCR PerformanceCode0
BaDLAD: A Large Multi-Domain Bengali Document Layout Analysis DatasetCode0
DriveThru: a Document Extraction Platform and Benchmark Datasets for Indonesian Local Language ArchivesCode0
An Evaluation of DNN Architectures for Page Segmentation of Historical NewspapersCode0
PsOCR: Benchmarking Large Multimodal Models for Optical Character Recognition in Low-resource Pashto LanguageCode0
Do Current Video LLMs Have Strong OCR Abilities? A Preliminary StudyCode0
Show:102550
← PrevPage 11 of 11Next →

No leaderboard results yet.