SOTAVerified

document understanding

Document understanding involves document classification, layout analysis, information extraction, and DocQA.

Papers

Showing 251275 of 309 papers

TitleStatusHype
Learned Compression for Compressed LearningCode0
LayoutXLM: Multimodal Pre-training for Multilingual Visually-rich Document UnderstandingCode0
LayoutLMv2: Multi-modal Pre-training for Visually-Rich Document UnderstandingCode0
LayoutLLM: Layout Instruction Tuning with Large Language Models for Document UnderstandingCode0
Knowing Where and What: Unified Word Block Pretraining for Document UnderstandingCode0
Evaluating Menu OCR and Translation: A Benchmark for Aligning Human and Automated Evaluations in Large Vision-Language ModelsCode0
Long-Range Transformer Architectures for Document UnderstandingCode0
ChuLo: Chunk-Level Key Information Representation for Long Document ProcessingCode0
DrishtiKon: Multi-Granular Visual Grounding for Text-Rich Document ImagesCode0
Skim-Attention: Learning to Focus via Document LayoutCode0
3MVRD: Multimodal Multi-task Multi-teacher Visually-Rich Form Document UnderstandingCode0
M^6Doc: A Large-Scale Multi-Format, Multi-Type, Multi-Layout, Multi-Language, Multi-Annotation Category Dataset for Modern Document Layout AnalysisCode0
KALM: Knowledge-Aware Integration of Local, Document, and Global Contexts for Long Document UnderstandingCode0
Machine Unlearning for Document ClassificationCode0
MarkupLM: Pre-training of Text and Markup Language for Visually-rich Document UnderstandingCode0
MarkupLM: Pre-training of Text and Markup Language for Visually Rich Document UnderstandingCode0
Marten: Visual Question Answering with Mask Generation for Multi-modal Document UnderstandingCode0
Zero-Shot Complex Question-Answering on Long Scientific DocumentsCode0
Pre-training Meets Clustering: A Hybrid Extractive Multi-document Summarization ModelCode0
Matching Article Pairs with Graphical Decomposition and ConvolutionsCode0
Primer AI's Systems for Acronym Identification and DisambiguationCode0
Is ChatGPT A Good Keyphrase Generator? A Preliminary StudyCode0
M-DocSum: Do LVLMs Genuinely Comprehend Interleaved Image-Text in Document Summarization?Code0
Information Redundancy and Biases in Public Document Information Extraction BenchmarksCode0
EvaLDA: Efficient Evasion Attacks Towards Latent Dirichlet AllocationCode0
Show:102550
← PrevPage 11 of 13Next →

No leaderboard results yet.