SOTAVerified

document understanding

Document understanding involves document classification, layout analysis, information extraction, and DocQA.

Papers

Showing 7180 of 309 papers

TitleStatusHype
PaddleOCR 3.0 Technical Report0
DrishtiKon: Multi-Granular Visual Grounding for Text-Rich Document ImagesCode0
Class-Agnostic Region-of-Interest Matching in Document ImagesCode0
Seeing is Believing? Mitigating OCR Hallucinations in Multimodal Large Language Models0
PP-DocBee2: Improved Baselines with Efficient Data for Multimodal Document Understanding0
WikiMixQA: A Multimodal Benchmark for Question Answering over Tables and Charts0
DiCoRe: Enhancing Zero-shot Event Detection via Divergent-Convergent LLM Reasoning0
A Survey on Vietnamese Document Analysis and Recognition: Challenges and Future Directions0
Infinity Parser: Layout Aware Reinforcement Learning for Scanned Document ParsingCode0
Point-RFT: Improving Multimodal Reasoning with Visually Grounded Reinforcement Finetuning0
Show:102550
← PrevPage 8 of 31Next →

No leaderboard results yet.