SOTAVerified

document understanding

Document understanding involves document classification, layout analysis, information extraction, and DocQA.

Papers

Showing 101125 of 309 papers

TitleStatusHype
ChuLo: Chunk-Level Key Information Representation for Long Document ProcessingCode0
Chargrid: Towards Understanding 2D DocumentsCode0
DocXChain: A Powerful Open-Source Toolchain for Document Parsing and BeyondCode0
MarkupLM: Pre-training of Text and Markup Language for Visually Rich Document UnderstandingCode0
Marten: Visual Question Answering with Mask Generation for Multi-modal Document UnderstandingCode0
ERNIE-Layout: Layout-Knowledge Enhanced Multi-modal Pre-training for Document UnderstandingCode0
M^6Doc: A Large-Scale Multi-Format, Multi-Type, Multi-Layout, Multi-Language, Multi-Annotation Category Dataset for Modern Document Layout AnalysisCode0
Machine Unlearning for Document ClassificationCode0
EvaLDA: Efficient Evasion Attacks Towards Latent Dirichlet AllocationCode0
Evaluating Menu OCR and Translation: A Benchmark for Aligning Human and Automated Evaluations in Large Vision-Language ModelsCode0
3MVRD: Multimodal Multi-task Multi-teacher Visually-Rich Form Document UnderstandingCode0
MarkupLM: Pre-training of Text and Markup Language for Visually-rich Document UnderstandingCode0
Matching Article Pairs with Graphical Decomposition and ConvolutionsCode0
DavarOCR: A Toolbox for OCR and Multi-Modal Document UnderstandingCode0
Learned Compression for Compressed LearningCode0
LayoutXLM: Multimodal Pre-training for Multilingual Visually-rich Document UnderstandingCode0
Long-Range Transformer Architectures for Document UnderstandingCode0
DocMIA: Document-Level Membership Inference Attacks against DocVQA ModelsCode0
PP-DocBee: Improving Multimodal Document Understanding Through a Bag of TricksCode0
Is ChatGPT A Good Keyphrase Generator? A Preliminary StudyCode0
KALM: Knowledge-Aware Integration of Local, Document, and Global Contexts for Long Document UnderstandingCode0
A Survey of Deep Learning Approaches for OCR and Document UnderstandingCode0
Knowing Where and What: Unified Word Block Pretraining for Document UnderstandingCode0
Blockwise Self-Attention for Long Document UnderstandingCode0
Information Extraction from Visually Rich Documents Using Directed Weighted Graph Neural NetworkCode0
Show:102550
← PrevPage 5 of 13Next →

No leaderboard results yet.