document understanding

Document understanding involves document classification, layout analysis, information extraction, and DocQA.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 201–225 of 309 papers

Title	Date	Tasks	Status	Hype
M6Doc: A Large-Scale Multi-Format, Multi-Type, Multi-Layout, Multi-Language, Multi-Annotation Category Dataset for Modern Document Layout Analysis	Jan 1, 2023	ArticlesDocument Layout Analysis	CodeCode Available	1
Wukong-Reader: Multi-modal Pre-training for Fine-grained Visual Document Understanding	Dec 19, 2022	Contrastive Learningdocument understanding	CodeCode Available	0
Multimodal Tree Decoder for Table of Contents Extraction in Document Images	Dec 6, 2022	Decoderdocument understanding	CodeCode Available	0
Unifying Vision, Text, and Layout for Universal Document Processing	Dec 5, 2022	Document AIdocument understanding	CodeCode Available	3
ClueWeb22: 10 Billion Web Documents with Visual and Semantic Information	Nov 29, 2022	document understandingRetrieval	—Unverified	0
VRDU: A Benchmark for Visually-rich Document Understanding	Nov 15, 2022	document understanding	—Unverified	0
QueryForm: A Simple Zero-shot Form Entity Query Framework	Nov 14, 2022	document understandingForm	—Unverified	0
Unimodal and Multimodal Representation Training for Relation Extraction	Nov 11, 2022	document understandingRelation	—Unverified	0
On Web-based Visual Corpus Construction for Visual Document Understanding	Nov 7, 2022	document understandingOptical Character Recognition (OCR)	CodeCode Available	1
Transformer-based Approach for Document Understanding	Oct 16, 2022	DecoderDocument Layout Analysis	—Unverified	0
ERNIE-Layout: Layout Knowledge Enhanced Pre-training for Visually-rich Document Understanding	Oct 12, 2022	document-image-classificationDocument Image Classification	CodeCode Available	1
KALM: Knowledge-Aware Integration of Local, Document, and Global Contexts for Long Document Understanding	Oct 8, 2022	document understandingKnowledge Graphs	CodeCode Available	0
XDoc: Unified Pre-training for Cross-Format Document Understanding	Oct 6, 2022	document understandingSemantic entity labeling	CodeCode Available	0
DocQueryNet: Value Retrieval with Arbitrary Queries for Form-like Documents	Oct 1, 2022	document understandingForm	CodeCode Available	1
ERNIE-mmLayout: Multi-grained MultiModal Transformer for Document Understanding	Sep 18, 2022	Common Sense Reasoningdocument understanding	—Unverified	0
One-Shot Doc Snippet Detection: Powering Search in Document Beyond Text	Sep 12, 2022	document understandingobject-detection	—Unverified	0
Improving Keyphrase Extraction with Data Augmentation and Information Filtering	Sep 11, 2022	Data Augmentationdocument understanding	—Unverified	0
Doc2Graph: a Task Agnostic Document Understanding Framework based on Graph Neural Networks	Aug 23, 2022	Document Layout Analysisdocument understanding	CodeCode Available	1
DeeperDive: The Unreasonable Effectiveness of Weak Supervision in Document Understanding A Case Study in Collaboration with UiPath Inc	Aug 17, 2022	document understandingForm	—Unverified	0
Understanding Long Documents with Different Position-Aware Attentions	Aug 17, 2022	document understandingPosition	—Unverified	0
Knowing Where and What: Unified Word Block Pretraining for Document Understanding	Jul 28, 2022	Contrastive Learningdocument understanding	CodeCode Available	0
Towards Complex Document Understanding By Discrete Reasoning	Jul 25, 2022	document understandingQuestion Answering	—Unverified	0
DavarOCR: A Toolbox for OCR and Multi-Modal Document Understanding	Jul 14, 2022	document understandingOptical Character Recognition (OCR)	—Unverified	0
Bi-VLDoc: Bidirectional Vision-Language Modeling for Visually-Rich Document Understanding	Jun 27, 2022	Document Classificationdocument understanding	—Unverified	0
Test-Time Adaptation for Visual Document Understanding	Jun 15, 2022	document understandingDomain Adaptation	—Unverified	0

Show:10 25 50

← PrevPage 9 of 13Next →

No leaderboard results yet.