SOTAVerified

Optical Character Recognition

Papers

Showing 5175 of 526 papers

TitleStatusHype
Geometry Restoration and Dewarping of Camera-Captured Document ImagesCode1
SceneVTG++: Controllable Multilingual Visual Text Generation in the Wild0
Efficient Video-Based ALPR System Using YOLO and Visual RhythmCode0
Embedding Similarity Guided License Plate Super Resolution0
CLIP is Almost All You Need: Towards Parameter-Efficient Scene Text Retrieval without OCR0
OCRBench v2: An Improved Benchmark for Evaluating Large Multimodal Models on Visual Text Localization and ReasoningCode4
Optical Character Recognition using Convolutional Neural Networks for Ashokan Brahmi Inscriptions0
Do Current Video LLMs Have Strong OCR Abilities? A Preliminary StudyCode0
ERPA: Efficient RPA Model Integrating OCR and LLMs for Intelligent Document Processing0
Leveraging Deep Learning with Multi-Head Attention for Accurate Extraction of Medicine from Handwritten Prescriptions0
VORTEX: A Spatial Computing Framework for Optimized Drone Telemetry Extraction from First-Person View Flight Data0
LMV-RPA: Large Model Voting-based Robotic Process AutomationCode0
Deciphering the Underserved: Benchmarking LLM OCR for Low-Resource ScriptsCode0
RoundTripOCR: A Data Generation Technique for Enhancing Post-OCR Error Correction in Low-Resource Devanagari LanguagesCode0
Advancing Vehicle Plate Recognition: Multitasking Visual Language Models with VehiclePaliGemma0
Enhancement of text recognition for hanja handwritten documents of Ancient Korea0
DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal UnderstandingCode9
POINTS1.5: Building a Vision-Language Model towards Real World Applications0
Aligned Music Notation and Lyrics TranscriptionCode0
Text Change Detection in Multilingual Documents Using Image Comparison0
Patchfinder: Leveraging Visual Language Models for Accurate Information Retrieval using Model Uncertainty0
OCR Hinders RAG: Evaluating the Cascading Impact of OCR on Retrieval-Augmented GenerationCode2
AI-assisted summary of suicide risk Formulation0
Towards Accessible Learning: Deep Learning-Based Potential Dysgraphia Detection and OCR for Potentially Dysgraphic Handwriting0
DriveThru: a Document Extraction Platform and Benchmark Datasets for Indonesian Local Language ArchivesCode0
Show:102550
← PrevPage 3 of 22Next →

No leaderboard results yet.