document understanding

Document understanding involves document classification, layout analysis, information extraction, and DocQA.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 251–275 of 309 papers

Title	Date	Tasks	Status
Learned Compression for Compressed Learning	Dec 12, 2024	Colorizationdocument understanding	CodeCode Available
LayoutXLM: Multimodal Pre-training for Multilingual Visually-rich Document Understanding	Apr 18, 2021	Document Image Classificationdocument understanding	CodeCode Available
LayoutLMv2: Multi-modal Pre-training for Visually-Rich Document Understanding	Dec 29, 2020	Document Image ClassificationDocument Layout Analysis	CodeCode Available
LayoutLLM: Layout Instruction Tuning with Large Language Models for Document Understanding	Apr 8, 2024	Document AIdocument understanding	CodeCode Available
Knowing Where and What: Unified Word Block Pretraining for Document Understanding	Jul 28, 2022	Contrastive Learningdocument understanding	CodeCode Available
Evaluating Menu OCR and Translation: A Benchmark for Aligning Human and Automated Evaluations in Large Vision-Language Models	Apr 16, 2025	document understandingLayout Design	CodeCode Available
Long-Range Transformer Architectures for Document Understanding	Sep 11, 2023	document understandingInformation Retrieval	CodeCode Available
ChuLo: Chunk-Level Key Information Representation for Long Document Processing	Oct 14, 2024	ChunkingClassification	CodeCode Available
DrishtiKon: Multi-Granular Visual Grounding for Text-Rich Document Images	Jun 26, 2025	document understandingOptical Character Recognition (OCR)	CodeCode Available
Skim-Attention: Learning to Focus via Document Layout	Sep 2, 2021	document understandingLanguage Modeling	CodeCode Available
3MVRD: Multimodal Multi-task Multi-teacher Visually-Rich Form Document Understanding	Feb 28, 2024	document understandingForm	CodeCode Available
M^6Doc: A Large-Scale Multi-Format, Multi-Type, Multi-Layout, Multi-Language, Multi-Annotation Category Dataset for Modern Document Layout Analysis	May 15, 2023	ArticlesDocument Layout Analysis	CodeCode Available
KALM: Knowledge-Aware Integration of Local, Document, and Global Contexts for Long Document Understanding	Oct 8, 2022	document understandingKnowledge Graphs	CodeCode Available
Machine Unlearning for Document Classification	Apr 29, 2024	ClassificationDocument Classification	CodeCode Available
MarkupLM: Pre-training of Text and Markup Language for Visually-rich Document Understanding	Oct 16, 2021	document understanding	CodeCode Available
MarkupLM: Pre-training of Text and Markup Language for Visually Rich Document Understanding	May 1, 2022	document understanding	CodeCode Available
Marten: Visual Question Answering with Mask Generation for Multi-modal Document Understanding	Mar 18, 2025	document understandingQuestion Answering	CodeCode Available
Zero-Shot Complex Question-Answering on Long Scientific Documents	Mar 4, 2025	Answer Generationdocument understanding	CodeCode Available
Pre-training Meets Clustering: A Hybrid Extractive Multi-document Summarization Model	May 25, 2023	ClusteringDocument Summarization	CodeCode Available
Matching Article Pairs with Graphical Decomposition and Convolutions	Feb 21, 2018	Articlesdocument understanding	CodeCode Available
Primer AI's Systems for Acronym Identification and Disambiguation	Dec 14, 2020	document understandingSentence	CodeCode Available
Is ChatGPT A Good Keyphrase Generator? A Preliminary Study	Mar 23, 2023	Diversitydocument understanding	CodeCode Available
M-DocSum: Do LVLMs Genuinely Comprehend Interleaved Image-Text in Document Summarization?	Mar 27, 2025	Document Summarizationdocument understanding	CodeCode Available
Information Redundancy and Biases in Public Document Information Extraction Benchmarks	Apr 28, 2023	document understandingKey Information Extraction	CodeCode Available
EvaLDA: Efficient Evasion Attacks Towards Latent Dirichlet Allocation	Dec 9, 2020	document understandingInformation Retrieval	CodeCode Available

Show:10 25 50

← PrevPage 11 of 13Next →

No leaderboard results yet.