SOTAVerified

Optical Character Recognition

Papers

Showing 126150 of 526 papers

TitleStatusHype
Calamari - A High-Performance Tensorflow-based Deep Learning Package for Optical Character RecognitionCode0
Answering Questions about Data Visualizations using Efficient Bimodal FusionCode0
LMV-RPA: Large Model Voting-based Robotic Process AutomationCode0
Aligned Music Notation and Lyrics TranscriptionCode0
M3T: A New Benchmark Dataset for Multi-Modal Document-Level Machine TranslationCode0
License Plate Detection and Recognition in Unconstrained ScenariosCode0
Brno Mobile OCR DatasetCode0
LILA-BOTI : Leveraging Isolated Letter Accumulations By Ordering Teacher Insights for Bangla Handwriting RecognitionCode0
MaViLS, a Benchmark Dataset for Video-to-Slide Alignment, Assessing Baseline Accuracy with a Multimodal Alignment Algorithm Leveraging Speech, OCR, and Visual FeaturesCode0
iExam: A Novel Online Exam Monitoring and Analysis System Based on Face Detection and RecognitionCode0
Binary Document Image Super Resolution for Improved Readability and OCR PerformanceCode0
It Takes Two to Tango: Combining Visual and Textual Information for Detecting Duplicate Video-Based Bug ReportsCode0
High-Throughput Phenotyping using Computer Vision and Machine LearningCode0
Enhancing Cross-task Transferability of Adversarial Examples with Dispersion ReductionCode0
Handwritten Code Recognition for Pen-and-Paper CS EducationCode0
IDPL-PFOD2: A New Large-Scale Dataset for Printed Farsi Optical Character RecognitionCode0
Judge a Book by its Cover: Investigating Multi-Modal LLMs for Multi-Page Handwritten Document TranscriptionCode0
Gated Recurrent Convolution Neural Network for OCRCode0
An Evaluation of DNN Architectures for Page Segmentation of Historical NewspapersCode0
From the Paft to the Fiiture: a Fully Automatic NMT and Word Embeddings Method for OCR Post-CorrectionCode0
From Videos to URLs: A Multi-Browser Guide To Extract User's Behavior with Optical Character RecognitionCode0
GeoContrastNet: Contrastive Key-Value Edge Learning for Language-Agnostic Document UnderstandingCode0
FastTextSpotter: A High-Efficiency Transformer for Multilingual Scene Text SpottingCode0
FINN-L: Library Extensions and Design Trade-off Analysis for Variable Precision LSTM Networks on FPGAsCode0
DeepErase: Weakly Supervised Ink Artifact Removal in Document Text ImagesCode0
Show:102550
← PrevPage 6 of 22Next →

No leaderboard results yet.