SOTAVerified

Optical Character Recognition

Papers

Showing 151175 of 526 papers

TitleStatusHype
It Takes Two to Tango: Combining Visual and Textual Information for Detecting Duplicate Video-Based Bug ReportsCode0
An Evaluation of DNN Architectures for Page Segmentation of Historical NewspapersCode0
Judge a Book by its Cover: Investigating Multi-Modal LLMs for Multi-Page Handwritten Document TranscriptionCode0
DELINE8K: A Synthetic Data Pipeline for the Semantic Segmentation of Historical DocumentsCode0
IDPL-PFOD2: A New Large-Scale Dataset for Printed Farsi Optical Character RecognitionCode0
iExam: A Novel Online Exam Monitoring and Analysis System Based on Face Detection and RecognitionCode0
Handwritten Code Recognition for Pen-and-Paper CS EducationCode0
GeoContrastNet: Contrastive Key-Value Edge Learning for Language-Agnostic Document UnderstandingCode0
DeepErase: Weakly Supervised Ink Artifact Removal in Document Text ImagesCode0
BaDLAD: A Large Multi-Domain Bengali Document Layout Analysis DatasetCode0
From Videos to URLs: A Multi-Browser Guide To Extract User's Behavior with Optical Character RecognitionCode0
Deciphering the Underserved: Benchmarking LLM OCR for Low-Resource ScriptsCode0
From the Paft to the Fiiture: a Fully Automatic NMT and Word Embeddings Method for OCR Post-CorrectionCode0
Gated Recurrent Convolution Neural Network for OCRCode0
High-Throughput Phenotyping using Computer Vision and Machine LearningCode0
DDI-100: Dataset for Text Detection and RecognitionCode0
FastTextSpotter: A High-Efficiency Transformer for Multilingual Scene Text SpottingCode0
A Gaussian Process Upsampling Model for Improvements in Optical Character RecognitionCode0
Mining Spatio-temporal Data on Industrialization from Historical RegistriesCode0
Do Current Video LLMs Have Strong OCR Abilities? A Preliminary StudyCode0
End-to-End Optical Character Recognition for Bengali Handwritten WordsCode0
Augmented Math: Authoring AR-Based Explorable Explanations by Augmenting Static Math TextbooksCode0
Evaluating Menu OCR and Translation: A Benchmark for Aligning Human and Automated Evaluations in Large Vision-Language ModelsCode0
DriveThru: a Document Extraction Platform and Benchmark Datasets for Indonesian Local Language ArchivesCode0
FINN-L: Library Extensions and Design Trade-off Analysis for Variable Precision LSTM Networks on FPGAsCode0
Show:102550
← PrevPage 7 of 22Next →

No leaderboard results yet.