document understanding

Document understanding involves document classification, layout analysis, information extraction, and DocQA.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 151–200 of 309 papers

Title	Date	Tasks	Status
GlobalDoc: A Cross-Modal Vision-Language Framework for Real-World Document Image Retrieval and Classification	Sep 11, 2023	document-image-classificationDocument Image Classification	—Unverified
Transformer-based Approach for Document Understanding	Oct 16, 2022	DecoderDocument Layout Analysis	—Unverified
Two to Five Truths in Non-Negative Matrix Factorization	May 6, 2023	Clusteringdocument understanding	—Unverified
Understanding Long Documents with Different Position-Aware Attentions	Aug 17, 2022	document understandingPosition	—Unverified
UniDoc: Unified Pretraining Framework for Document Understanding	Dec 1, 2021	document understandingSelf-Supervised Learning	—Unverified
Unified Pretraining Framework for Document Understanding	Apr 22, 2022	Document Layout Analysisdocument understanding	—Unverified
Unimodal and Multimodal Representation Training for Relation Extraction	Nov 11, 2022	document understandingRelation	—Unverified
ViRED: Prediction of Visual Relations in Engineering Drawings	Sep 2, 2024	Decoderdocument understanding	—Unverified
WebFormer: The Web-page Transformer for Structure Information Extraction	Feb 1, 2022	Deep Attentiondocument understanding	—Unverified
"What is the value of templates?" Rethinking Document Information Extraction Datasets for LLMs	Oct 20, 2024	document understandingKey Information Extraction	—Unverified
What Makes a Good Dataset for Symbol Description Reading?	Apr 17, 2023	document understandingMath	—Unverified
WikiMixQA: A Multimodal Benchmark for Question Answering over Tables and Charts	Jun 18, 2025	document understandingMultiple-choice	—Unverified
Workshop on Document Intelligence Understanding	Jul 31, 2023	document understandingVisual Question Answering (VQA)	—Unverified
XFUND: A Benchmark Dataset for Multilingual Visually Rich Form Understanding	May 1, 2022	document understandingForm	—Unverified
Deep Learning based Visually Rich Document Content Understanding: A Survey	Aug 2, 2024	Deep Learningdocument understanding	—Unverified
Zero-Shot Prompting and Few-Shot Fine-Tuning: Revisiting Document Image Classification Using Large Language Models	Dec 18, 2024	Document Classificationdocument-image-classification	—Unverified
WildDoc: How Far Are We from Achieving Comprehensive and Robust Document Understanding in the Wild?	May 16, 2025	document understanding	—Unverified
VRDU: A Benchmark for Visually-rich Document Understanding	Nov 15, 2022	document understanding	—Unverified
Acronym Identification and Disambiguation Shared Tasks for Scientific Document Understanding	Dec 22, 2020	document understanding	—Unverified
A LayoutLMv3-Based Model for Enhanced Relation Extraction in Visually-Rich Documents	Apr 16, 2024	document understandingKey Information Extraction	—Unverified
A Multi-Modal Multilingual Benchmark for Document Image Classification	Oct 25, 2023	ClassificationCross-Lingual Transfer	—Unverified
Arctic-TILT. Business Document Understanding at Sub-Billion Scale	Aug 8, 2024	document understandingGPU	—Unverified
A Retrospective Recount of Computer Architecture Research with a Data-Driven Study of Over Four Decades of ISCA Publications	Jun 22, 2019	document understandingNatural Language Understanding	—Unverified
A Simple yet Effective Layout Token in Large Language Models for Document Understanding	Mar 24, 2025	document understandingPosition	—Unverified
Assessing Generative AI value in a public sector context: evidence from a field experiment	Feb 13, 2025	document understanding	—Unverified
A Survey and Approach to Chart Classification	Jul 9, 2023	Chart UnderstandingClassification	—Unverified
A Survey on MLLM-based Visually Rich Document Understanding: Methods, Challenges, and Emerging Trends	Jul 14, 2025	document understandingOptical Character Recognition	—Unverified
A Survey on Vietnamese Document Analysis and Recognition: Challenges and Future Directions	Jun 5, 2025	Computational Efficiencydocument understanding	—Unverified
AT-BERT: Adversarial Training BERT for Acronym Identification Winning Solution for SDU@AAAI-21	Jan 11, 2021	document understandingUnsupervised Pre-training	—Unverified
A Token-level Text Image Foundation Model for Document Understanding	Mar 4, 2025	document understandingVisual Question Answering (VQA)	—Unverified
Attention-Based Graph Neural Network with Global Context Awareness for Document Understanding	Oct 1, 2020	document understandinggraph construction	—Unverified
Attention Where It Matters: Rethinking Visual Document Understanding with Selective Region Concentration	Sep 3, 2023	Decoderdocument understanding	—Unverified
A User-Centered Concept Mining System for Query and Document Understanding at Tencent	May 21, 2019	document understandingKnowledge Base Construction	—Unverified
Auto-encodeurs pour la compr\'ehension de documents parl\'es (Auto-encoders for Spoken Document Understanding)	Jul 1, 2016	document understanding	—Unverified
Automated Parsing of Engineering Drawings for Structured Information Extraction Using a Fine-tuned Document Understanding Transformer	May 2, 2025	document understandingHallucination	—Unverified
Automatic Knowledge Extraction with Human Interface	Apr 9, 2021	document understanding	—Unverified
AWESOME: GPU Memory-constrained Long Document Summarization using Memory Mechanism and Global Salient Content	May 24, 2023	Document Summarizationdocument understanding	—Unverified
BERT-AL: BERT for Arbitrarily Long Document Understanding	Jan 1, 2020	document understandingText Summarization	—Unverified
BigDocs: An Open and Permissively-Licensed Dataset for Training Multimodal Models on Document and Code Tasks	Dec 5, 2024	Code Generationdocument understanding	—Unverified
Bi-VLDoc: Bidirectional Vision-Language Modeling for Visually-Rich Document Understanding	Jun 27, 2022	Document Classificationdocument understanding	—Unverified
BoundingDocs: a Unified Dataset for Document Question Answering with Spatial Annotations	Jan 6, 2025	Document AIdocument understanding	—Unverified
BROS: A Pre-trained Language Model for Understanding Texts in Document	Jan 1, 2021	DecoderDiversity	—Unverified
BuDDIE: A Business Document Dataset for Multi-task Information Extraction	Apr 5, 2024	Document Classificationdocument understanding	—Unverified
Building and better understanding vision-language models: insights and future directions	Aug 22, 2024	document understanding	—Unverified
Calculating Semantic Similarity between Academic Articles using Topic Event and Ontology	Nov 30, 2017	Articlesdocument understanding	—Unverified
Can AI Models Appreciate Document Aesthetics? An Exploration of Legibility and Layout Quality in Relation to Prediction Confidence	Mar 27, 2024	Document AIdocument understanding	—Unverified
Read and Think: An Efficient Step-wise Multimodal Language Model for Document Understanding and Reasoning	Feb 26, 2024	Data Augmentationdocument understanding	—Unverified
ClueWeb22: 10 Billion Web Documents with Visual and Semantic Information	Nov 29, 2022	document understandingRetrieval	—Unverified
CREPE: Coordinate-Aware End-to-End Document Parser	May 1, 2024	document understandingOptical Character Recognition (OCR)	—Unverified
DAViD: Domain Adaptive Visually-Rich Document Understanding with Synthetic Insights	Oct 2, 2024	document understandingDomain Adaptation	—Unverified

Show:10 25 50

← PrevPage 4 of 7Next →

No leaderboard results yet.