SOTAVerified

document understanding

Document understanding involves document classification, layout analysis, information extraction, and DocQA.

Papers

Showing 3140 of 309 papers

TitleStatusHype
ARB: A Comprehensive Arabic Multimodal Reasoning BenchmarkCode1
Docopilot: Improving Multimodal Models for Document-Level UnderstandingCode1
DocLayLLM: An Efficient and Effective Multi-modal Extension of Large Language Models for Text-rich Document UnderstandingCode1
DocLayLLM: An Efficient Multi-modal Extension of Large Language Models for Text-rich Document UnderstandingCode1
Leveraging Visual Tokens for Extended Text Contexts in Multi-Modal LearningCode1
Doc2Graph: a Task Agnostic Document Understanding Framework based on Graph Neural NetworksCode1
Hierarchical Multimodal Pre-training for Visually Rich Webpage UnderstandingCode1
DocFormer: End-to-End Transformer for Document UnderstandingCode1
DANIEL: A fast Document Attention Network for Information Extraction and Labelling of handwritten documentsCode1
Going Full-TILT Boogie on Document Understanding with Text-Image-Layout TransformerCode1
Show:102550
← PrevPage 4 of 31Next →

No leaderboard results yet.