document understanding

Document understanding involves document classification, layout analysis, information extraction, and DocQA.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 176–200 of 309 papers

Title	Date	Tasks	Status	Hype
DocumentNet: Bridging the Data Gap in Document Pre-Training	Jun 15, 2023	document understandingEntity Retrieval	—Unverified	0
DocumentCLIP: Linking Figures and Main Body Text in Reflowed Documents	Jun 9, 2023	Contrastive Learningdocument understanding	CodeCode Available	1
Do-GOOD: Towards Distribution Shift Evaluation for Pre-Trained Visual Document Understanding Models	Jun 5, 2023	document understandingQuestion Answering	CodeCode Available	0
DocFormerv2: Local Features for Document Understanding	Jun 2, 2023	Decoderdocument understanding	CodeCode Available	1
Table Detection for Visually Rich Document Images	May 30, 2023	document understandingobject-detection	CodeCode Available	0
LayoutMask: Enhance Text-Layout Interaction in Multi-modal Pre-training for Document Understanding	May 30, 2023	document-image-classificationDocument Image Classification	—Unverified	0
PaLI-X: On Scaling up a Multilingual Vision and Language Model	May 29, 2023	Chart Question Answeringdocument understanding	CodeCode Available	1
Pre-training Meets Clustering: A Hybrid Extractive Multi-document Summarization Model	May 25, 2023	ClusteringDocument Summarization	CodeCode Available	0
AWESOME: GPU Memory-constrained Long Document Summarization using Memory Mechanism and Global Salient Content	May 24, 2023	Document Summarizationdocument understanding	—Unverified	0
Visually-Situated Natural Language Understanding with Contrastive Reading Model and Frozen Large Language Models	May 24, 2023	document understandingImage Captioning	CodeCode Available	1
DUBLIN -- Document Understanding By Language-Image Network	May 23, 2023	Document Classificationdocument understanding	—Unverified	0
Fast-StrucTexT: An Efficient Hourglass Transformer with Modality-guided Dynamic Token Merge for Document Understanding	May 19, 2023	document understanding	—Unverified	0
Sequence-to-Sequence Pre-training with Unified Modality Masking for Visual Document Understanding	May 16, 2023	Decoderdocument understanding	—Unverified	0
DLUE: Benchmarking Document Language Understanding	May 16, 2023	BenchmarkingDocument Classification	—Unverified	0
M^6Doc: A Large-Scale Multi-Format, Multi-Type, Multi-Layout, Multi-Language, Multi-Annotation Category Dataset for Modern Document Layout Analysis	May 15, 2023	ArticlesDocument Layout Analysis	CodeCode Available	0
Document Understanding Dataset and Evaluation (DUDE)	May 15, 2023	Document AIdocument understanding	CodeCode Available	1
Two to Five Truths in Non-Negative Matrix Factorization	May 6, 2023	Clusteringdocument understanding	—Unverified	0
Revisiting Table Detection Datasets for Visually Rich Documents	May 4, 2023	document understandingobject-detection	—Unverified	0
FormNetV2: Multimodal Graph Contrastive Learning for Form Document Information Extraction	May 4, 2023	Contrastive Learningdocument understanding	—Unverified	0
LineFormer: Rethinking Line Chart Data Extraction as Instance Segmentation	May 3, 2023	Data Visualizationdocument understanding	CodeCode Available	1
CCpdf: Building a High Quality Corpus for Visually Rich Documents from Web Crawl Data	Apr 28, 2023	document understandingLanguage Modeling	CodeCode Available	1
Information Redundancy and Biases in Public Document Information Extraction Benchmarks	Apr 28, 2023	document understandingKey Information Extraction	CodeCode Available	0
What Makes a Good Dataset for Symbol Description Reading?	Apr 17, 2023	document understandingMath	—Unverified	0
PDFVQA: A New Dataset for Real-World VQA on PDF Documents	Apr 13, 2023	document understandingKey Information Extraction	—Unverified	0
Is ChatGPT A Good Keyphrase Generator? A Preliminary Study	Mar 23, 2023	Diversitydocument understanding	CodeCode Available	0

Show:10 25 50

← PrevPage 8 of 13Next →

No leaderboard results yet.