SOTAVerified

Optical Character Recognition

Papers

Showing 101150 of 526 papers

TitleStatusHype
ChemScraper: Leveraging PDF Graphics Instructions for Molecular Diagram ParsingCode0
Advancing Multilingual Handwritten Numeral Recognition with Attention-driven Transfer LearningCode0
Noisy Parallel Data AlignmentCode0
MultiQG-TI: Towards Question Generation from Multi-modal SourcesCode0
MultiOCR-QA: Dataset for Evaluating Robustness of LLMs in Question Answering on Multilingual OCR TextsCode0
A model of diffuse Galactic Radio Emission from 10 MHz to 100 GHzCode0
ChemGrapher: Optical Graph Recognition of Chemical Compounds by Deep LearningCode0
Are VLMs Really BlindCode0
Multi-Page Document Visual Question Answering using Self-Attention Scoring MechanismCode0
Arrow-Guided VLM: Enhancing Flowchart Understanding via Arrow Direction EncodingCode0
NASS-AI: Towards Digitization of Parliamentary Bills using Document Level Embedding and Bidirectional Long Short-Term MemoryCode0
MRZ code extraction from visa and passport documents using convolutional neural networksCode0
Mining Spatio-temporal Data on Industrialization from Historical RegistriesCode0
Alleviating Digitization Errors in Named Entity Recognition for Historical DocumentsCode0
Character decomposition to resolve class imbalance problem in Hangul OCRCode0
Chandojnanam: A Sanskrit Meter Identification and Utilization SystemCode0
CLOCR-C: Context Leveraging OCR Correction with Pre-trained Language ModelsCode0
ASTER: An Attentional Scene Text Recognizer with Flexible RectificationCode0
AON: Towards Arbitrarily-Oriented Text RecognitionCode0
Measuring Intersectional Biases in Historical DocumentsCode0
memorAIs: an Optical Character Recognition and Rule-Based Medication Intake Reminder-Generating SolutionCode0
Multi-modal Page Stream Segmentation with Convolutional Neural NetworksCode0
Comparative analysis of optical character recognition methods for Sámi texts from the National Library of NorwayCode0
Object detection deep learning networks for Optical Character RecognitionCode0
PP-OCRv3: More Attempts for the Improvement of Ultra Lightweight OCR SystemCode0
Calibrated Structured PredictionCode0
Calamari - A High-Performance Tensorflow-based Deep Learning Package for Optical Character RecognitionCode0
Answering Questions about Data Visualizations using Efficient Bimodal FusionCode0
LILA-BOTI : Leveraging Isolated Letter Accumulations By Ordering Teacher Insights for Bangla Handwriting RecognitionCode0
Aligned Music Notation and Lyrics TranscriptionCode0
LMV-RPA: Large Model Voting-based Robotic Process AutomationCode0
License Plate Detection and Recognition in Unconstrained ScenariosCode0
Brno Mobile OCR DatasetCode0
Judge a Book by its Cover: Investigating Multi-Modal LLMs for Multi-Page Handwritten Document TranscriptionCode0
M3T: A New Benchmark Dataset for Multi-Modal Document-Level Machine TranslationCode0
IDPL-PFOD2: A New Large-Scale Dataset for Printed Farsi Optical Character RecognitionCode0
Binary Document Image Super Resolution for Improved Readability and OCR PerformanceCode0
iExam: A Novel Online Exam Monitoring and Analysis System Based on Face Detection and RecognitionCode0
Handwritten Code Recognition for Pen-and-Paper CS EducationCode0
High-Throughput Phenotyping using Computer Vision and Machine LearningCode0
Enhancing Cross-task Transferability of Adversarial Examples with Dispersion ReductionCode0
It Takes Two to Tango: Combining Visual and Textual Information for Detecting Duplicate Video-Based Bug ReportsCode0
MaViLS, a Benchmark Dataset for Video-to-Slide Alignment, Assessing Baseline Accuracy with a Multimodal Alignment Algorithm Leveraging Speech, OCR, and Visual FeaturesCode0
From Videos to URLs: A Multi-Browser Guide To Extract User's Behavior with Optical Character RecognitionCode0
An Evaluation of DNN Architectures for Page Segmentation of Historical NewspapersCode0
Gated Recurrent Convolution Neural Network for OCRCode0
From the Paft to the Fiiture: a Fully Automatic NMT and Word Embeddings Method for OCR Post-CorrectionCode0
FastTextSpotter: A High-Efficiency Transformer for Multilingual Scene Text SpottingCode0
Evaluating Menu OCR and Translation: A Benchmark for Aligning Human and Automated Evaluations in Large Vision-Language ModelsCode0
DeepErase: Weakly Supervised Ink Artifact Removal in Document Text ImagesCode0
Show:102550
← PrevPage 3 of 11Next →

No leaderboard results yet.