SOTAVerified

Optical Character Recognition

Papers

Showing 1120 of 526 papers

TitleStatusHype
Uni-MuMER: Unified Multi-Task Fine-Tuning of Vision-Language Model for Handwritten Mathematical Expression RecognitionCode1
TextSR: Diffusion Super-Resolution with Multilingual OCR Guidance0
MT^3: Scaling MLLM-based Text Image Machine Translation via Multi-Task Reinforcement Learning0
Words as Geometric Features: Estimating Homography using Optical Character Recognition as Compressed Image Representation0
How Do Large Vision-Language Models See Text in Image? Unveiling the Distinctive Role of OCR Heads0
Every Pixel Tells a Story: End-to-End Urdu Newspaper OCR0
Reasoning-OCR: Can Large Multimodal Models Solve Complex Logical Reasoning Problems from OCR Cues?Code1
LogicOCR: Do Your Large Multimodal Models Excel at Logical Reasoning on Text-Rich Images?Code1
Low-Resource Language Processing: An OCR-Driven Summarization and Translation PipelineCode0
PsOCR: Benchmarking Large Multimodal Models for Optical Character Recognition in Low-resource Pashto LanguageCode0
Show:102550
← PrevPage 2 of 53Next →

No leaderboard results yet.