SOTAVerified

Optical Character Recognition

Papers

Showing 151200 of 526 papers

TitleStatusHype
An Evaluation of DNN Architectures for Page Segmentation of Historical NewspapersCode0
M3T: A New Benchmark Dataset for Multi-Modal Document-Level Machine TranslationCode0
Measuring Intersectional Biases in Historical DocumentsCode0
DELINE8K: A Synthetic Data Pipeline for the Semantic Segmentation of Historical DocumentsCode0
LILA-BOTI : Leveraging Isolated Letter Accumulations By Ordering Teacher Insights for Bangla Handwriting RecognitionCode0
LMV-RPA: Large Model Voting-based Robotic Process AutomationCode0
Judge a Book by its Cover: Investigating Multi-Modal LLMs for Multi-Page Handwritten Document TranscriptionCode0
It Takes Two to Tango: Combining Visual and Textual Information for Detecting Duplicate Video-Based Bug ReportsCode0
DeepErase: Weakly Supervised Ink Artifact Removal in Document Text ImagesCode0
BaDLAD: A Large Multi-Domain Bengali Document Layout Analysis DatasetCode0
Deciphering the Underserved: Benchmarking LLM OCR for Low-Resource ScriptsCode0
iExam: A Novel Online Exam Monitoring and Analysis System Based on Face Detection and RecognitionCode0
License Plate Detection and Recognition in Unconstrained ScenariosCode0
memorAIs: an Optical Character Recognition and Rule-Based Medication Intake Reminder-Generating SolutionCode0
Multi-Page Document Visual Question Answering using Self-Attention Scoring MechanismCode0
DDI-100: Dataset for Text Detection and RecognitionCode0
A Gaussian Process Upsampling Model for Improvements in Optical Character RecognitionCode0
NASS-AI: Towards Digitization of Parliamentary Bills using Document Level Embedding and Bidirectional Long Short-Term MemoryCode0
Handwritten Code Recognition for Pen-and-Paper CS EducationCode0
Do Current Video LLMs Have Strong OCR Abilities? A Preliminary StudyCode0
High-Throughput Phenotyping using Computer Vision and Machine LearningCode0
GeoContrastNet: Contrastive Key-Value Edge Learning for Language-Agnostic Document UnderstandingCode0
Augmented Math: Authoring AR-Based Explorable Explanations by Augmenting Static Math TextbooksCode0
An efficient way for segmentation of Bangla characters in printed document using curved scanningCode0
From the Paft to the Fiiture: a Fully Automatic NMT and Word Embeddings Method for OCR Post-CorrectionCode0
A Tool for Facilitating OCR Postediting in Historical DocumentsCode0
E2TIMT: Efficient and Effective Modal Adapter for Text Image Machine TranslationCode0
FINN-L: Library Extensions and Design Trade-off Analysis for Variable Precision LSTM Networks on FPGAsCode0
From Videos to URLs: A Multi-Browser Guide To Extract User's Behavior with Optical Character RecognitionCode0
Optimization of Image Processing Algorithms for Character Recognition in Cultural Typewritten DocumentsCode0
Adversarial Training with OCR Modality Perturbation for Scene-Text Visual Question AnsweringCode0
Low-Resource Language Processing: An OCR-Driven Summarization and Translation PipelineCode0
A Survey on Multimodal Large Language ModelsCode0
FastTextSpotter: A High-Efficiency Transformer for Multilingual Scene Text SpottingCode0
Efficient Multi-domain Text Recognition Deep Neural Network Parameterization with Residual AdaptersCode0
Gated Recurrent Convolution Neural Network for OCRCode0
IDPL-PFOD2: A New Large-Scale Dataset for Printed Farsi Optical Character RecognitionCode0
Reading Between the Mud: A Challenging Motorcycle Racer Number DatasetCode0
Computer vision based vehicle tracking as a complementary and scalable approach to RFID tagging0
Comprehensive Overview of Named Entity Recognition: Models, Domain-Specific Applications and Challenges0
A Survey on MLLM-based Visually Rich Document Understanding: Methods, Challenges, and Emerging Trends0
Comparison of Image Preprocessing Techniques for Vehicle License Plate Recognition Using OCR: Performance and Accuracy Evaluation0
A survey of modern optical character recognition techniques0
Ancient but Digitized: Developing Handwritten Optical Character Recognition for East Syriac Script Through Creating KHAMIS Dataset0
Combining Human and Machine Transcriptions on the Zooniverse Platform0
CodeSCAN: ScreenCast ANalysis for Video Programming Tutorials0
A Study of Sindhi Related and Arabic Script Adapted languages Recognition0
Advancing Visual Specification of Code Requirements for Graphs0
Clustering-based Automatic Construction of Legal Entity Knowledge Base from Contracts0
CLIP is Almost All You Need: Towards Parameter-Efficient Scene Text Retrieval without OCR0
Show:102550
← PrevPage 4 of 11Next →

No leaderboard results yet.