SOTAVerified|Agents Browse Leaderboard About

document understanding

Document understanding involves document classification, layout analysis, information extraction, and DocQA.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 201–225 of 309 papers

Title	Date	Tasks	Status
A Survey and Approach to Chart Classification	Jul 9, 2023	Chart UnderstandingClassification	—Unverified
mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding	Jul 4, 2023	document understandingLanguage Modeling	—Unverified
DocumentNet: Bridging the Data Gap in Document Pre-Training	Jun 15, 2023	document understandingEntity Retrieval	—Unverified
Do-GOOD: Towards Distribution Shift Evaluation for Pre-Trained Visual Document Understanding Models	Jun 5, 2023	document understandingQuestion Answering	CodeCode Available
Table Detection for Visually Rich Document Images	May 30, 2023	document understandingobject-detection	CodeCode Available
LayoutMask: Enhance Text-Layout Interaction in Multi-modal Pre-training for Document Understanding	May 30, 2023	document-image-classificationDocument Image Classification	—Unverified
Pre-training Meets Clustering: A Hybrid Extractive Multi-document Summarization Model	May 25, 2023	ClusteringDocument Summarization	CodeCode Available
AWESOME: GPU Memory-constrained Long Document Summarization using Memory Mechanism and Global Salient Content	May 24, 2023	Document Summarizationdocument understanding	—Unverified
DUBLIN -- Document Understanding By Language-Image Network	May 23, 2023	Document Classificationdocument understanding	—Unverified
Fast-StrucTexT: An Efficient Hourglass Transformer with Modality-guided Dynamic Token Merge for Document Understanding	May 19, 2023	document understanding	—Unverified
Sequence-to-Sequence Pre-training with Unified Modality Masking for Visual Document Understanding	May 16, 2023	Decoderdocument understanding	—Unverified
DLUE: Benchmarking Document Language Understanding	May 16, 2023	BenchmarkingDocument Classification	—Unverified
M^6Doc: A Large-Scale Multi-Format, Multi-Type, Multi-Layout, Multi-Language, Multi-Annotation Category Dataset for Modern Document Layout Analysis	May 15, 2023	ArticlesDocument Layout Analysis	CodeCode Available
Two to Five Truths in Non-Negative Matrix Factorization	May 6, 2023	Clusteringdocument understanding	—Unverified
FormNetV2: Multimodal Graph Contrastive Learning for Form Document Information Extraction	May 4, 2023	Contrastive Learningdocument understanding	—Unverified
Revisiting Table Detection Datasets for Visually Rich Documents	May 4, 2023	document understandingobject-detection	—Unverified
Information Redundancy and Biases in Public Document Information Extraction Benchmarks	Apr 28, 2023	document understandingKey Information Extraction	CodeCode Available
What Makes a Good Dataset for Symbol Description Reading?	Apr 17, 2023	document understandingMath	—Unverified
PDFVQA: A New Dataset for Real-World VQA on PDF Documents	Apr 13, 2023	document understandingKey Information Extraction	—Unverified
Is ChatGPT A Good Keyphrase Generator? A Preliminary Study	Mar 23, 2023	Diversitydocument understanding	CodeCode Available
Wukong-Reader: Multi-modal Pre-training for Fine-grained Visual Document Understanding	Dec 19, 2022	Contrastive Learningdocument understanding	CodeCode Available
Multimodal Tree Decoder for Table of Contents Extraction in Document Images	Dec 6, 2022	Decoderdocument understanding	CodeCode Available
ClueWeb22: 10 Billion Web Documents with Visual and Semantic Information	Nov 29, 2022	document understandingRetrieval	—Unverified
VRDU: A Benchmark for Visually-rich Document Understanding	Nov 15, 2022	document understanding	—Unverified
QueryForm: A Simple Zero-shot Form Entity Query Framework	Nov 14, 2022	document understandingForm	—Unverified

Show:10 25 50

← PrevPage 9 of 13Next →

No leaderboard results yet.