SOTAVerified

document understanding

Document understanding involves document classification, layout analysis, information extraction, and DocQA.

Papers

Showing 4150 of 309 papers

TitleStatusHype
CAMEL-Bench: A Comprehensive Arabic LMM BenchmarkCode1
DocLayLLM: An Efficient Multi-modal Extension of Large Language Models for Text-rich Document UnderstandingCode1
DocFormer: End-to-End Transformer for Document UnderstandingCode1
Docopilot: Improving Multimodal Models for Document-Level UnderstandingCode1
DocFormerv2: Local Features for Document UnderstandingCode1
DocumentCLIP: Linking Figures and Main Body Text in Reflowed DocumentsCode1
LEMONADE: A Large Multilingual Expert-Annotated Abstractive Event Dataset for the Real WorldCode1
FRAG: Frame Selection Augmented Generation for Long Video and Long Document UnderstandingCode1
Going Full-TILT Boogie on Document Understanding with Text-Image-Layout TransformerCode1
Hierarchical Multimodal Pre-training for Visually Rich Webpage UnderstandingCode1
Show:102550
← PrevPage 5 of 31Next →

No leaderboard results yet.