SOTAVerified

document understanding

Document understanding involves document classification, layout analysis, information extraction, and DocQA.

Papers

Showing 111120 of 309 papers

TitleStatusHype
Memory-Augmented Agent Training for Business Document Understanding0
Learned Compression for Compressed LearningCode0
DocVLM: Make Your VLM an Efficient Reader0
Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling0
BigDocs: An Open and Permissively-Licensed Dataset for Training Multimodal Models on Document and Code Tasks0
MATATA: Weakly Supervised End-to-End MAthematical Tool-Augmented Reasoning for Tabular Applications0
DOGE: Towards Versatile Visual Document Grounding and Referring0
StructFormer: Document Structure-based Masked Attention and its Impact on Language Model Pre-Training0
Information Extraction from Heterogeneous Documents without Ground Truth Labels using Synthetic Label Generation and Knowledge Distillation0
Is Cognition consistent with Perception? Assessing and Mitigating Multimodal Knowledge Conflicts in Document Understanding0
Show:102550
← PrevPage 12 of 31Next →

No leaderboard results yet.