document understanding

Document understanding involves document classification, layout analysis, information extraction, and DocQA.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 201–225 of 309 papers

Title	Date	Tasks	Status
Automated Parsing of Engineering Drawings for Structured Information Extraction Using a Fine-tuned Document Understanding Transformer	May 2, 2025	document understandingHallucination	—Unverified
Automatic Knowledge Extraction with Human Interface	Apr 9, 2021	document understanding	—Unverified
AWESOME: GPU Memory-constrained Long Document Summarization using Memory Mechanism and Global Salient Content	May 24, 2023	Document Summarizationdocument understanding	—Unverified
BERT-AL: BERT for Arbitrarily Long Document Understanding	Jan 1, 2020	document understandingText Summarization	—Unverified
BigDocs: An Open and Permissively-Licensed Dataset for Training Multimodal Models on Document and Code Tasks	Dec 5, 2024	Code Generationdocument understanding	—Unverified
Bi-VLDoc: Bidirectional Vision-Language Modeling for Visually-Rich Document Understanding	Jun 27, 2022	Document Classificationdocument understanding	—Unverified
BoundingDocs: a Unified Dataset for Document Question Answering with Spatial Annotations	Jan 6, 2025	Document AIdocument understanding	—Unverified
BROS: A Pre-trained Language Model for Understanding Texts in Document	Jan 1, 2021	DecoderDiversity	—Unverified
BuDDIE: A Business Document Dataset for Multi-task Information Extraction	Apr 5, 2024	Document Classificationdocument understanding	—Unverified
Building and better understanding vision-language models: insights and future directions	Aug 22, 2024	document understanding	—Unverified
Calculating Semantic Similarity between Academic Articles using Topic Event and Ontology	Nov 30, 2017	Articlesdocument understanding	—Unverified
Can AI Models Appreciate Document Aesthetics? An Exploration of Legibility and Layout Quality in Relation to Prediction Confidence	Mar 27, 2024	Document AIdocument understanding	—Unverified
Read and Think: An Efficient Step-wise Multimodal Language Model for Document Understanding and Reasoning	Feb 26, 2024	Data Augmentationdocument understanding	—Unverified
ClueWeb22: 10 Billion Web Documents with Visual and Semantic Information	Nov 29, 2022	document understandingRetrieval	—Unverified
CREPE: Coordinate-Aware End-to-End Document Parser	May 1, 2024	document understandingOptical Character Recognition (OCR)	—Unverified
DavarOCR: A Toolbox for OCR and Multi-Modal Document Understanding	Jul 14, 2022	document understandingOptical Character Recognition (OCR)	—Unverified
DAViD: Domain Adaptive Visually-Rich Document Understanding with Synthetic Insights	Oct 2, 2024	document understandingDomain Adaptation	—Unverified
Decontextualization: Making Sentences Stand-Alone	Feb 9, 2021	document understandingQuestion Answering	—Unverified
DeeperDive: The Unreasonable Effectiveness of Weak Supervision in Document Understanding A Case Study in Collaboration with UiPath Inc	Aug 17, 2022	document understandingForm	—Unverified
Deep Learning based Key Information Extraction from Business Documents: Systematic Literature Review	Jul 23, 2024	Deep Learningdocument understanding	—Unverified
DiCoRe: Enhancing Zero-shot Event Detection via Divergent-Convergent LLM Reasoning	Jun 5, 2025	document understandingEvent Detection	—Unverified
DistilDoc: Knowledge Distillation for Visually-Rich Document Applications	Jun 12, 2024	document-image-classificationDocument Image Classification	—Unverified
DLUE: Benchmarking Document Language Understanding	May 16, 2023	BenchmarkingDocument Classification	—Unverified
Doc2Im: document to image conversion through self-attentive embedding	Nov 8, 2018	Document To Image Conversiondocument understanding	—Unverified
Doc-CoB: Enhancing Multi-Modal Document Understanding with Visual Chain-of-Boxes Reasoning	May 24, 2025	document understandingVisual Reasoning	—Unverified

Show:10 25 50

← PrevPage 9 of 13Next →

No leaderboard results yet.