SOTAVerified

document understanding

Document understanding involves document classification, layout analysis, information extraction, and DocQA.

Papers

Showing 76100 of 309 papers

TitleStatusHype
DavarOCR: A Toolbox for OCR and Multi-Modal Document UnderstandingCode0
M-DocSum: Do LVLMs Genuinely Comprehend Interleaved Image-Text in Document Summarization?Code0
Multimodal Structured Generation: CVPR's 2nd MMFM Challenge Technical ReportCode0
Data-driven Coreference-based Ontology BuildingCode0
Marten: Visual Question Answering with Mask Generation for Multi-modal Document UnderstandingCode0
MarkupLM: Pre-training of Text and Markup Language for Visually Rich Document UnderstandingCode0
Matching Article Pairs with Graphical Decomposition and ConvolutionsCode0
M^6Doc: A Large-Scale Multi-Format, Multi-Type, Multi-Layout, Multi-Language, Multi-Annotation Category Dataset for Modern Document Layout AnalysisCode0
Machine Unlearning for Document ClassificationCode0
Long-Range Transformer Architectures for Document UnderstandingCode0
DrishtiKon: Multi-Granular Visual Grounding for Text-Rich Document ImagesCode0
Class-Agnostic Region-of-Interest Matching in Document ImagesCode0
3MVRD: Multimodal Multi-task Multi-teacher Visually-Rich Form Document UnderstandingCode0
MarkupLM: Pre-training of Text and Markup Language for Visually-rich Document UnderstandingCode0
Multimodal Tree Decoder for Table of Contents Extraction in Document ImagesCode0
Do-GOOD: Towards Distribution Shift Evaluation for Pre-Trained Visual Document Understanding ModelsCode0
ChuLo: Chunk-Level Key Information Representation for Long Document ProcessingCode0
Chargrid: Towards Understanding 2D DocumentsCode0
DocXChain: A Powerful Open-Source Toolchain for Document Parsing and BeyondCode0
LayoutLMv2: Multi-modal Pre-training for Visually-Rich Document UnderstandingCode0
LayoutLLM: Layout Instruction Tuning with Large Language Models for Document UnderstandingCode0
LayoutXLM: Multimodal Pre-training for Multilingual Visually-rich Document UnderstandingCode0
Knowing Where and What: Unified Word Block Pretraining for Document UnderstandingCode0
Learned Compression for Compressed LearningCode0
Information Redundancy and Biases in Public Document Information Extraction BenchmarksCode0
Show:102550
← PrevPage 4 of 13Next →

No leaderboard results yet.