SOTAVerified

Optical Character Recognition

Papers

Showing 151175 of 526 papers

TitleStatusHype
Towards Accessible Learning: Deep Learning-Based Potential Dysgraphia Detection and OCR for Potentially Dysgraphic Handwriting0
DriveThru: a Document Extraction Platform and Benchmark Datasets for Indonesian Local Language ArchivesCode0
M3DocRAG: Multi-modal Retrieval is What You Need for Multi-page Multi-document Understanding0
TAP-VL: Text Layout-Aware Pre-training for Enriched Vision-Language Models0
Handwriting Recognition in Historical Documents with Multimodal LLM0
Are VLMs Really BlindCode0
Comparison of Image Preprocessing Techniques for Vehicle License Plate Recognition Using OCR: Performance and Accuracy Evaluation0
ChartKG: A Knowledge-Graph-Based Representation for Chart Images0
MIRAGE: Multimodal Identification and Recognition of Annotations in Indian General Prescriptions0
JaPOC: Japanese Post-OCR Correction Benchmark using Vouchers0
See then Tell: Enhancing Key Information Extraction with Vision Grounding0
CodeSCAN: ScreenCast ANalysis for Video Programming Tutorials0
MaViLS, a Benchmark Dataset for Video-to-Slide Alignment, Assessing Baseline Accuracy with a Multimodal Alignment Algorithm Leveraging Speech, OCR, and Visual FeaturesCode0
@Bench: Benchmarking Vision-Language Models for Human-centered Assistive Technology0
Computer Vision Intelligence Test Modeling and Generation: A Case Study on Smart OCR0
ICDAR 2024 Competition on Few-Shot and Many-Shot Layout Segmentation of Ancient Manuscripts (SAM)0
PdfTable: A Unified Toolkit for Deep Learning-Based Table Extraction0
POINTS: Improving Your Vision-language Model with Affordable Strategies0
Confidence-Aware Document OCR Error Detection0
Post-OCR Text Correction for Bulgarian Historical DocumentsCode0
CLOCR-C: Context Leveraging OCR Correction with Pre-trained Language ModelsCode0
Can Visual Language Models Replace OCR-Based Visual Question Answering Pipelines in Production? A Case Study in Retail0
Knowledge Discovery in Optical Music Recognition: Enhancing Information Retrieval with Instance Segmentation0
FastTextSpotter: A High-Efficiency Transformer for Multilingual Scene Text SpottingCode0
A Permuted Autoregressive Approach to Word-Level Recognition for Urdu Digital Text0
Show:102550
← PrevPage 7 of 22Next →

No leaderboard results yet.