document understanding

Document understanding involves document classification, layout analysis, information extraction, and DocQA.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 201–250 of 309 papers

Title	Date	Tasks	Status
A Survey and Approach to Chart Classification	Jul 9, 2023	Chart UnderstandingClassification	—Unverified
mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding	Jul 4, 2023	document understandingLanguage Modeling	CodeCode Available
DocumentNet: Bridging the Data Gap in Document Pre-Training	Jun 15, 2023	document understandingEntity Retrieval	—Unverified
Do-GOOD: Towards Distribution Shift Evaluation for Pre-Trained Visual Document Understanding Models	Jun 5, 2023	document understandingQuestion Answering	CodeCode Available
Table Detection for Visually Rich Document Images	May 30, 2023	document understandingobject-detection	CodeCode Available
LayoutMask: Enhance Text-Layout Interaction in Multi-modal Pre-training for Document Understanding	May 30, 2023	document-image-classificationDocument Image Classification	—Unverified
Pre-training Meets Clustering: A Hybrid Extractive Multi-document Summarization Model	May 25, 2023	ClusteringDocument Summarization	CodeCode Available
AWESOME: GPU Memory-constrained Long Document Summarization using Memory Mechanism and Global Salient Content	May 24, 2023	Document Summarizationdocument understanding	—Unverified
DUBLIN -- Document Understanding By Language-Image Network	May 23, 2023	Document Classificationdocument understanding	—Unverified
Fast-StrucTexT: An Efficient Hourglass Transformer with Modality-guided Dynamic Token Merge for Document Understanding	May 19, 2023	document understanding	—Unverified
Sequence-to-Sequence Pre-training with Unified Modality Masking for Visual Document Understanding	May 16, 2023	Decoderdocument understanding	—Unverified
DLUE: Benchmarking Document Language Understanding	May 16, 2023	BenchmarkingDocument Classification	—Unverified
M^6Doc: A Large-Scale Multi-Format, Multi-Type, Multi-Layout, Multi-Language, Multi-Annotation Category Dataset for Modern Document Layout Analysis	May 15, 2023	ArticlesDocument Layout Analysis	CodeCode Available
Two to Five Truths in Non-Negative Matrix Factorization	May 6, 2023	Clusteringdocument understanding	—Unverified
FormNetV2: Multimodal Graph Contrastive Learning for Form Document Information Extraction	May 4, 2023	Contrastive Learningdocument understanding	—Unverified
Revisiting Table Detection Datasets for Visually Rich Documents	May 4, 2023	document understandingobject-detection	—Unverified
Information Redundancy and Biases in Public Document Information Extraction Benchmarks	Apr 28, 2023	document understandingKey Information Extraction	CodeCode Available
What Makes a Good Dataset for Symbol Description Reading?	Apr 17, 2023	document understandingMath	—Unverified
PDFVQA: A New Dataset for Real-World VQA on PDF Documents	Apr 13, 2023	document understandingKey Information Extraction	—Unverified
Is ChatGPT A Good Keyphrase Generator? A Preliminary Study	Mar 23, 2023	Diversitydocument understanding	CodeCode Available
Wukong-Reader: Multi-modal Pre-training for Fine-grained Visual Document Understanding	Dec 19, 2022	Contrastive Learningdocument understanding	CodeCode Available
Multimodal Tree Decoder for Table of Contents Extraction in Document Images	Dec 6, 2022	Decoderdocument understanding	CodeCode Available
ClueWeb22: 10 Billion Web Documents with Visual and Semantic Information	Nov 29, 2022	document understandingRetrieval	—Unverified
VRDU: A Benchmark for Visually-rich Document Understanding	Nov 15, 2022	document understanding	—Unverified
QueryForm: A Simple Zero-shot Form Entity Query Framework	Nov 14, 2022	document understandingForm	—Unverified
Unimodal and Multimodal Representation Training for Relation Extraction	Nov 11, 2022	document understandingRelation	—Unverified
Transformer-based Approach for Document Understanding	Oct 16, 2022	DecoderDocument Layout Analysis	—Unverified
KALM: Knowledge-Aware Integration of Local, Document, and Global Contexts for Long Document Understanding	Oct 8, 2022	document understandingKnowledge Graphs	CodeCode Available
XDoc: Unified Pre-training for Cross-Format Document Understanding	Oct 6, 2022	document understandingSemantic entity labeling	CodeCode Available
ERNIE-mmLayout: Multi-grained MultiModal Transformer for Document Understanding	Sep 18, 2022	Common Sense Reasoningdocument understanding	—Unverified
One-Shot Doc Snippet Detection: Powering Search in Document Beyond Text	Sep 12, 2022	document understandingobject-detection	—Unverified
Improving Keyphrase Extraction with Data Augmentation and Information Filtering	Sep 11, 2022	Data Augmentationdocument understanding	—Unverified
DeeperDive: The Unreasonable Effectiveness of Weak Supervision in Document Understanding A Case Study in Collaboration with UiPath Inc	Aug 17, 2022	document understandingForm	—Unverified
Understanding Long Documents with Different Position-Aware Attentions	Aug 17, 2022	document understandingPosition	—Unverified
Knowing Where and What: Unified Word Block Pretraining for Document Understanding	Jul 28, 2022	Contrastive Learningdocument understanding	CodeCode Available
Towards Complex Document Understanding By Discrete Reasoning	Jul 25, 2022	document understandingQuestion Answering	—Unverified
DavarOCR: A Toolbox for OCR and Multi-Modal Document Understanding	Jul 14, 2022	document understandingOptical Character Recognition (OCR)	CodeCode Available
Bi-VLDoc: Bidirectional Vision-Language Modeling for Visually-Rich Document Understanding	Jun 27, 2022	Document Classificationdocument understanding	—Unverified
Test-Time Adaptation for Visual Document Understanding	Jun 15, 2022	document understandingDomain Adaptation	—Unverified
RDU: A Region-based Approach to Form-style Document Understanding	Jun 14, 2022	document understandingForm	—Unverified
Génération de question à partir d’analyse sémantique pour l’adaptation non supervisée de modèles de compréhension de documents (Question generation from semantic analysis for unsupervised adaptation of document understanding models)	Jun 1, 2022	document understandingQuestion Generation	—Unverified
MATrIX -- Modality-Aware Transformer for Information eXtraction	May 17, 2022	document understanding	—Unverified
MarkupLM: Pre-training of Text and Markup Language for Visually Rich Document Understanding	May 1, 2022	document understanding	CodeCode Available
DuReader_vis: A Chinese Dataset for Open-domain Document Visual Question Answering	May 1, 2022	document understandingOpen-Domain Question Answering	CodeCode Available
XFUND: A Benchmark Dataset for Multilingual Visually Rich Form Understanding	May 1, 2022	document understandingForm	—Unverified
Unified Pretraining Framework for Document Understanding	Apr 22, 2022	Document Layout Analysisdocument understanding	—Unverified
Robust Text Line Detection in Historical Documents: Learning and Evaluation Methods	Mar 23, 2022	document understandingLine Detection	—Unverified
FormNet: Structural Encoding beyond Sequential Modeling in Form Document Information Extraction	Mar 16, 2022	Document AIdocument understanding	—Unverified
Hierarchical BERT for Medical Document Understanding	Mar 11, 2022	document understandingSentence	—Unverified
WebFormer: The Web-page Transformer for Structure Information Extraction	Feb 1, 2022	Deep Attentiondocument understanding	—Unverified

Show:10 25 50

← PrevPage 5 of 7Next →

No leaderboard results yet.