SOTAVerified

document understanding

Document understanding involves document classification, layout analysis, information extraction, and DocQA.

Papers

Showing 76100 of 309 papers

TitleStatusHype
Message Passing Attention Networks for Document UnderstandingCode0
Data-driven Coreference-based Ontology BuildingCode0
Matching Article Pairs with Graphical Decomposition and ConvolutionsCode0
M-DocSum: Do LVLMs Genuinely Comprehend Interleaved Image-Text in Document Summarization?Code0
MarkupLM: Pre-training of Text and Markup Language for Visually-rich Document UnderstandingCode0
MarkupLM: Pre-training of Text and Markup Language for Visually Rich Document UnderstandingCode0
3MVRD: Multimodal Multi-task Multi-teacher Visually-Rich Form Document UnderstandingCode0
M^6Doc: A Large-Scale Multi-Format, Multi-Type, Multi-Layout, Multi-Language, Multi-Annotation Category Dataset for Modern Document Layout AnalysisCode0
DrishtiKon: Multi-Granular Visual Grounding for Text-Rich Document ImagesCode0
Class-Agnostic Region-of-Interest Matching in Document ImagesCode0
Marten: Visual Question Answering with Mask Generation for Multi-modal Document UnderstandingCode0
ChuLo: Chunk-Level Key Information Representation for Long Document ProcessingCode0
Chargrid: Towards Understanding 2D DocumentsCode0
LayoutXLM: Multimodal Pre-training for Multilingual Visually-rich Document UnderstandingCode0
LayoutLMv2: Multi-modal Pre-training for Visually-Rich Document UnderstandingCode0
Learned Compression for Compressed LearningCode0
LayoutLLM: Layout Instruction Tuning with Large Language Models for Document UnderstandingCode0
Do-GOOD: Towards Distribution Shift Evaluation for Pre-Trained Visual Document Understanding ModelsCode0
Is ChatGPT A Good Keyphrase Generator? A Preliminary StudyCode0
Information Redundancy and Biases in Public Document Information Extraction BenchmarksCode0
KALM: Knowledge-Aware Integration of Local, Document, and Global Contexts for Long Document UnderstandingCode0
Infinity Parser: Layout Aware Reinforcement Learning for Scanned Document ParsingCode0
Improving Clinical Document Understanding on COVID-19 Research with Spark NLPCode0
Machine Unlearning for Document ClassificationCode0
Information Extraction from Visually Rich Documents Using Directed Weighted Graph Neural NetworkCode0
Show:102550
← PrevPage 4 of 13Next →

No leaderboard results yet.