SOTAVerified

document understanding

Document understanding involves document classification, layout analysis, information extraction, and DocQA.

Papers

Showing 1120 of 309 papers

TitleStatusHype
DiCoRe: Enhancing Zero-shot Event Detection via Divergent-Convergent LLM Reasoning0
Infinity Parser: Layout Aware Reinforcement Learning for Scanned Document ParsingCode0
LEMONADE: A Large Multilingual Expert-Annotated Abstractive Event Dataset for the Real WorldCode1
MT^3: Scaling MLLM-based Text Image Machine Translation via Multi-Task Reinforcement Learning0
Point-RFT: Improving Multimodal Reasoning with Visually Grounded Reinforcement Finetuning0
Doc-CoB: Enhancing Multi-Modal Document Understanding with Visual Chain-of-Boxes Reasoning0
ARB: A Comprehensive Arabic Multimodal Reasoning BenchmarkCode1
The Hidden Structure -- Improving Legal Document Understanding Through Explicit Text Formatting0
WildDoc: How Far Are We from Achieving Comprehensive and Robust Document Understanding in the Wild?0
Document Image Rectification Bases on Self-Adaptive Multitask Fusion0
Show:102550
← PrevPage 2 of 31Next →

No leaderboard results yet.