SOTAVerified

document understanding

Document understanding involves document classification, layout analysis, information extraction, and DocQA.

Papers

Showing 8190 of 309 papers

TitleStatusHype
Point-RFT: Improving Multimodal Reasoning with Visually Grounded Reinforcement Finetuning0
Doc-CoB: Enhancing Multi-Modal Document Understanding with Visual Chain-of-Boxes Reasoning0
The Hidden Structure -- Improving Legal Document Understanding Through Explicit Text Formatting0
WildDoc: How Far Are We from Achieving Comprehensive and Robust Document Understanding in the Wild?0
Document Image Rectification Bases on Self-Adaptive Multitask Fusion0
Automated Parsing of Engineering Drawings for Structured Information Extraction Using a Fine-tuned Document Understanding Transformer0
Evaluating Menu OCR and Translation: A Benchmark for Aligning Human and Automated Evaluations in Large Vision-Language ModelsCode0
Relation-Rich Visual Document Generator for Visual Information ExtractionCode0
NoTeS-Bank: Benchmarking Neural Transcription and Search for Scientific Notes Understanding0
QID: Efficient Query-Informed ViTs in Data-Scarce Regimes for OCR-free Visual Document Understanding0
Show:102550
← PrevPage 9 of 31Next →

No leaderboard results yet.