document understanding

Document understanding involves document classification, layout analysis, information extraction, and DocQA.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 126–150 of 309 papers

Title	Date	Tasks	Status
Can AI Models Appreciate Document Aesthetics? An Exploration of Legibility and Layout Quality in Relation to Prediction Confidence	Mar 27, 2024	Document AIdocument understanding	—Unverified
Document Collection Visual Question Answering	Apr 27, 2021	document understandingQuestion Answering	—Unverified
A Survey on Vietnamese Document Analysis and Recognition: Challenges and Future Directions	Jun 5, 2025	Computational Efficiencydocument understanding	—Unverified
Calculating Semantic Similarity between Academic Articles using Topic Event and Ontology	Nov 30, 2017	Articlesdocument understanding	—Unverified
DocPedia: Unleashing the Power of Large Multimodal Model in the Frequency Domain for Versatile Document Understanding	Nov 20, 2023	document understandingLanguage Modeling	—Unverified
Building and better understanding vision-language models: insights and future directions	Aug 22, 2024	document understanding	—Unverified
A Survey on MLLM-based Visually Rich Document Understanding: Methods, Challenges, and Emerging Trends	Jul 14, 2025	document understandingOptical Character Recognition	—Unverified
BuDDIE: A Business Document Dataset for Multi-task Information Extraction	Apr 5, 2024	Document Classificationdocument understanding	—Unverified
DocMamba: Efficient Document Pre-training with State Space Model	Sep 18, 2024	document understanding	—Unverified
A Survey and Approach to Chart Classification	Jul 9, 2023	Chart UnderstandingClassification	—Unverified
DocLLM: A layout-aware generative language model for multimodal document understanding	Dec 31, 2023	document understandingLanguage Modeling	—Unverified
BROS: A Pre-trained Language Model for Understanding Texts in Document	Jan 1, 2021	DecoderDiversity	—Unverified
LayoutMask: Enhance Text-Layout Interaction in Multi-modal Pre-training for Document Understanding	May 30, 2023	document-image-classificationDocument Image Classification	—Unverified
Is Cognition consistent with Perception? Assessing and Mitigating Multimodal Knowledge Conflicts in Document Understanding	Nov 12, 2024	document understandingOptical Character Recognition (OCR)	—Unverified
BoundingDocs: a Unified Dataset for Document Question Answering with Spatial Annotations	Jan 6, 2025	Document AIdocument understanding	—Unverified
LAPDoc: Layout-Aware Prompting for Documents	Feb 15, 2024	document understandingKey Information Extraction	—Unverified
DocKylin: A Large Multimodal Model for Visual Document Understanding with Efficient Visual Slimming	Jun 27, 2024	document understanding	—Unverified
A Simple yet Effective Layout Token in Large Language Models for Document Understanding	Mar 24, 2025	document understandingPosition	—Unverified
Information Extraction from Heterogeneous Documents without Ground Truth Labels using Synthetic Label Generation and Knowledge Distillation	Nov 22, 2024	Anomaly Detectiondocument understanding	—Unverified
DocKD: Knowledge Distillation from LLMs for Open-World Document Understanding Models	Oct 4, 2024	document understandingKnowledge Distillation	—Unverified
LAMPRET: Layout-Aware Multimodal PreTraining for Document Understanding	Apr 16, 2021	document understanding	—Unverified
LayoutLLM: Large Language Model Instruction Tuning for Visually Rich Document Understanding	Mar 21, 2024	document-image-classificationDocument Image Classification	—Unverified
DocGraphLM: Documental Graph Language Model for Information Extraction	Jan 5, 2024	document understandingLanguage Modeling	—Unverified
Improving Keyphrase Extraction with Data Augmentation and Information Filtering	Sep 11, 2022	Data Augmentationdocument understanding	—Unverified
Joint Structured Learning and Predictions under Logical Constraints in Conditional Random Fields	Aug 25, 2017	BIG-bench Machine Learningdocument understanding	—Unverified

Show:10 25 50

← PrevPage 6 of 13Next →

No leaderboard results yet.