SOTAVerified

Optical Character Recognition

Papers

Showing 126150 of 526 papers

TitleStatusHype
Calibrated Structured PredictionCode0
Calamari - A High-Performance Tensorflow-based Deep Learning Package for Optical Character RecognitionCode0
Answering Questions about Data Visualizations using Efficient Bimodal FusionCode0
LILA-BOTI : Leveraging Isolated Letter Accumulations By Ordering Teacher Insights for Bangla Handwriting RecognitionCode0
Aligned Music Notation and Lyrics TranscriptionCode0
LMV-RPA: Large Model Voting-based Robotic Process AutomationCode0
License Plate Detection and Recognition in Unconstrained ScenariosCode0
Brno Mobile OCR DatasetCode0
Judge a Book by its Cover: Investigating Multi-Modal LLMs for Multi-Page Handwritten Document TranscriptionCode0
M3T: A New Benchmark Dataset for Multi-Modal Document-Level Machine TranslationCode0
IDPL-PFOD2: A New Large-Scale Dataset for Printed Farsi Optical Character RecognitionCode0
Binary Document Image Super Resolution for Improved Readability and OCR PerformanceCode0
iExam: A Novel Online Exam Monitoring and Analysis System Based on Face Detection and RecognitionCode0
Handwritten Code Recognition for Pen-and-Paper CS EducationCode0
High-Throughput Phenotyping using Computer Vision and Machine LearningCode0
Enhancing Cross-task Transferability of Adversarial Examples with Dispersion ReductionCode0
It Takes Two to Tango: Combining Visual and Textual Information for Detecting Duplicate Video-Based Bug ReportsCode0
MaViLS, a Benchmark Dataset for Video-to-Slide Alignment, Assessing Baseline Accuracy with a Multimodal Alignment Algorithm Leveraging Speech, OCR, and Visual FeaturesCode0
From Videos to URLs: A Multi-Browser Guide To Extract User's Behavior with Optical Character RecognitionCode0
An Evaluation of DNN Architectures for Page Segmentation of Historical NewspapersCode0
Gated Recurrent Convolution Neural Network for OCRCode0
From the Paft to the Fiiture: a Fully Automatic NMT and Word Embeddings Method for OCR Post-CorrectionCode0
FastTextSpotter: A High-Efficiency Transformer for Multilingual Scene Text SpottingCode0
Evaluating Menu OCR and Translation: A Benchmark for Aligning Human and Automated Evaluations in Large Vision-Language ModelsCode0
DeepErase: Weakly Supervised Ink Artifact Removal in Document Text ImagesCode0
Show:102550
← PrevPage 6 of 22Next →

No leaderboard results yet.