document understanding

Document understanding involves document classification, layout analysis, information extraction, and DocQA.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 101–150 of 309 papers

Title	Date	Tasks	Status	Score
DocMIA: Document-Level Membership Inference Attacks against DocVQA Models	Feb 6, 2025	document understandingInference Attack	CodeCode Available	5
Message Passing Attention Networks for Document Understanding	Aug 17, 2019	document understandingMulti-Modal Document Classification	CodeCode Available	5
A Survey of Deep Learning Approaches for OCR and Document Understanding	Nov 27, 2020	document understandingOptical Character Recognition (OCR)	CodeCode Available	5
Blockwise Self-Attention for Long Document Understanding	Nov 7, 2019	document understandingLanguage Modeling	CodeCode Available	5
Is ChatGPT A Good Keyphrase Generator? A Preliminary Study	Mar 23, 2023	Diversitydocument understanding	CodeCode Available	5
Marten: Visual Question Answering with Mask Generation for Multi-modal Document Understanding	Mar 18, 2025	document understandingQuestion Answering	CodeCode Available	5
MarkupLM: Pre-training of Text and Markup Language for Visually Rich Document Understanding	May 1, 2022	document understanding	CodeCode Available	5
Matching Article Pairs with Graphical Decomposition and Convolutions	Feb 21, 2018	Articlesdocument understanding	CodeCode Available	5
EvaLDA: Efficient Evasion Attacks Towards Latent Dirichlet Allocation	Dec 9, 2020	document understandingInformation Retrieval	CodeCode Available	5
Evaluating Menu OCR and Translation: A Benchmark for Aligning Human and Automated Evaluations in Large Vision-Language Models	Apr 16, 2025	document understandingLayout Design	CodeCode Available	5
M-DocSum: Do LVLMs Genuinely Comprehend Interleaved Image-Text in Document Summarization?	Mar 27, 2025	Document Summarizationdocument understanding	CodeCode Available	5
Multimodal Adaptive Inference for Document Image Classification with Anytime Early Exiting	May 21, 2024	document-image-classificationDocument Image Classification	CodeCode Available	5
M^6Doc: A Large-Scale Multi-Format, Multi-Type, Multi-Layout, Multi-Language, Multi-Annotation Category Dataset for Modern Document Layout Analysis	May 15, 2023	ArticlesDocument Layout Analysis	CodeCode Available	5
Improving Clinical Document Understanding on COVID-19 Research with Spark NLP	Dec 7, 2020	AnatomyClinical Assertion Status Detection	CodeCode Available	5
Financial Report Chunking for Effective Retrieval Augmented Generation	Feb 5, 2024	Chunkingdocument understanding	CodeCode Available	5
Information Redundancy and Biases in Public Document Information Extraction Benchmarks	Apr 28, 2023	document understandingKey Information Extraction	CodeCode Available	5
Long-Range Transformer Architectures for Document Understanding	Sep 11, 2023	document understandingInformation Retrieval	CodeCode Available	5
Hypergraph based Understanding for Document Semantic Entity Recognition	Jul 9, 2024	document understanding	CodeCode Available	5
Machine Unlearning for Document Classification	Apr 29, 2024	ClassificationDocument Classification	CodeCode Available	5
Bidirectional Context-Aware Hierarchical Attention Network for Document Understanding	Aug 16, 2019	Abstractive Text Summarizationdocument understanding	CodeCode Available	5
LayoutXLM: Multimodal Pre-training for Multilingual Visually-rich Document Understanding	Apr 18, 2021	Document Image Classificationdocument understanding	CodeCode Available	5
Learned Compression for Compressed Learning	Dec 12, 2024	Colorizationdocument understanding	CodeCode Available	5
LayoutLLM: Layout Instruction Tuning with Large Language Models for Document Understanding	Apr 8, 2024	Document AIdocument understanding	CodeCode Available	5
KALM: Knowledge-Aware Integration of Local, Document, and Global Contexts for Long Document Understanding	Oct 8, 2022	document understandingKnowledge Graphs	CodeCode Available	5
Knowing Where and What: Unified Word Block Pretraining for Document Understanding	Jul 28, 2022	Contrastive Learningdocument understanding	CodeCode Available	5
LayoutLMv2: Multi-modal Pre-training for Visually-Rich Document Understanding	Dec 29, 2020	Document Image ClassificationDocument Layout Analysis	CodeCode Available	5
MarkupLM: Pre-training of Text and Markup Language for Visually-rich Document Understanding	Oct 16, 2021	document understanding	CodeCode Available	5
Table Detection for Visually Rich Document Images	May 30, 2023	document understandingobject-detection	CodeCode Available	5
DiCoRe: Enhancing Zero-shot Event Detection via Divergent-Convergent LLM Reasoning	Jun 5, 2025	document understandingEvent Detection	—Unverified	0
Génération de question à partir d’analyse sémantique pour l’adaptation non supervisée de modèles de compréhension de documents (Question generation from semantic analysis for unsupervised adaptation of document understanding models)	Jun 1, 2022	document understandingQuestion Generation	—Unverified	0
BERT-AL: BERT for Arbitrarily Long Document Understanding	Jan 1, 2020	document understandingText Summarization	—Unverified	0
From Entity Linking to Question Answering -- Recent Progress on Semantic Grounding Tasks	Dec 1, 2016	document understandingEntity Linking	—Unverified	0
Friendly Topic Assistant for Transformer Based Abstractive Summarization	Nov 1, 2020	Abstractive Text SummarizationDocument Summarization	—Unverified	0
Deep Learning based Key Information Extraction from Business Documents: Systematic Literature Review	Jul 23, 2024	Deep Learningdocument understanding	—Unverified	0
FormNetV2: Multimodal Graph Contrastive Learning for Form Document Information Extraction	May 4, 2023	Contrastive Learningdocument understanding	—Unverified	0
DeeperDive: The Unreasonable Effectiveness of Weak Supervision in Document Understanding A Case Study in Collaboration with UiPath Inc	Aug 17, 2022	document understandingForm	—Unverified	0
AWESOME: GPU Memory-constrained Long Document Summarization using Memory Mechanism and Global Salient Content	May 24, 2023	Document Summarizationdocument understanding	—Unverified	0
A Retrospective Recount of Computer Architecture Research with a Data-Driven Study of Over Four Decades of ISCA Publications	Jun 22, 2019	document understandingNatural Language Understanding	—Unverified	0
FormNet: Structural Encoding beyond Sequential Modeling in Form Document Information Extraction	Mar 16, 2022	Document AIdocument understanding	—Unverified	0
Finding Pragmatic Differences Between Disciplines	Sep 30, 2023	DiversityDocument Summarization	—Unverified	0
Decontextualization: Making Sentences Stand-Alone	Feb 9, 2021	document understandingQuestion Answering	—Unverified	0
Automatic Knowledge Extraction with Human Interface	Apr 9, 2021	document understanding	—Unverified	0
Fast-StrucTexT: An Efficient Hourglass Transformer with Modality-guided Dynamic Token Merge for Document Understanding	May 19, 2023	document understanding	—Unverified	0
DAViD: Domain Adaptive Visually-Rich Document Understanding with Synthetic Insights	Oct 2, 2024	document understandingDomain Adaptation	—Unverified	0
Extract with Order for Coherent Multi-Document Summarization	Jun 12, 2017	Document Summarizationdocument understanding	—Unverified	0
Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling	Dec 6, 2024	document understandingHallucination	—Unverified	0
DavarOCR: A Toolbox for OCR and Multi-Modal Document Understanding	Jul 14, 2022	document understandingOptical Character Recognition (OCR)	—Unverified	0
Automated Parsing of Engineering Drawings for Structured Information Extraction Using a Fine-tuned Document Understanding Transformer	May 2, 2025	document understandingHallucination	—Unverified	0
Arctic-TILT. Business Document Understanding at Sub-Billion Scale	Aug 8, 2024	document understandingGPU	—Unverified	0
ERNIE-mmLayout: Multi-grained MultiModal Transformer for Document Understanding	Sep 18, 2022	Common Sense Reasoningdocument understanding	—Unverified	0

Show:10 25 50

← PrevPage 3 of 7Next →

No leaderboard results yet.