SOTAVerified

document understanding

Document understanding involves document classification, layout analysis, information extraction, and DocQA.

Papers

Showing 5160 of 309 papers

TitleStatusHype
Docopilot: Improving Multimodal Models for Document-Level UnderstandingCode1
DocLayLLM: An Efficient Multi-modal Extension of Large Language Models for Text-rich Document UnderstandingCode1
LongDocURL: a Comprehensive Multimodal Long Document Benchmark Integrating Understanding, Reasoning, and LocatingCode1
Zero-Shot Prompting and Few-Shot Fine-Tuning: Revisiting Document Image Classification Using Large Language Models0
Typhoon 2: A Family of Open Text and Multimodal Thai Large Language ModelsCode1
Memory-Augmented Agent Training for Business Document Understanding0
Learned Compression for Compressed LearningCode0
DocVLM: Make Your VLM an Efficient Reader0
Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time ScalingCode0
BigDocs: An Open and Permissively-Licensed Dataset for Training Multimodal Models on Document and Code Tasks0
Show:102550
← PrevPage 6 of 31Next →

No leaderboard results yet.