SOTAVerified

document understanding

Document understanding involves document classification, layout analysis, information extraction, and DocQA.

Papers

Showing 131140 of 309 papers

TitleStatusHype
LayoutLLM: Large Language Model Instruction Tuning for Visually Rich Document Understanding0
mPLUG-DocOwl 1.5: Unified Structure Learning for OCR-free Document Understanding0
TextMonkey: An OCR-Free Large Multimodal Model for Understanding DocumentCode5
Enhancing Visual Document Understanding with Contrastive Learning in Large Visual-Language Models0
Hierarchical Multimodal Pre-training for Visually Rich Webpage UnderstandingCode1
3MVRD: Multimodal Multi-task Multi-teacher Visually-Rich Form Document UnderstandingCode0
Read and Think: An Efficient Step-wise Multimodal Language Model for Document Understanding and Reasoning0
RJUA-MedDQA: A Multimodal Benchmark for Medical Document Question Answering and Clinical Reasoning0
LAPDoc: Layout-Aware Prompting for Documents0
Financial Report Chunking for Effective Retrieval Augmented GenerationCode0
Show:102550
← PrevPage 14 of 31Next →

No leaderboard results yet.