SOTAVerified

document understanding

Document understanding involves document classification, layout analysis, information extraction, and DocQA.

Papers

Showing 101150 of 309 papers

TitleStatusHype
DocMIA: Document-Level Membership Inference Attacks against DocVQA ModelsCode0
Message Passing Attention Networks for Document UnderstandingCode0
A Survey of Deep Learning Approaches for OCR and Document UnderstandingCode0
Blockwise Self-Attention for Long Document UnderstandingCode0
Is ChatGPT A Good Keyphrase Generator? A Preliminary StudyCode0
Marten: Visual Question Answering with Mask Generation for Multi-modal Document UnderstandingCode0
MarkupLM: Pre-training of Text and Markup Language for Visually Rich Document UnderstandingCode0
Matching Article Pairs with Graphical Decomposition and ConvolutionsCode0
EvaLDA: Efficient Evasion Attacks Towards Latent Dirichlet AllocationCode0
Evaluating Menu OCR and Translation: A Benchmark for Aligning Human and Automated Evaluations in Large Vision-Language ModelsCode0
M-DocSum: Do LVLMs Genuinely Comprehend Interleaved Image-Text in Document Summarization?Code0
Multimodal Adaptive Inference for Document Image Classification with Anytime Early ExitingCode0
M^6Doc: A Large-Scale Multi-Format, Multi-Type, Multi-Layout, Multi-Language, Multi-Annotation Category Dataset for Modern Document Layout AnalysisCode0
Improving Clinical Document Understanding on COVID-19 Research with Spark NLPCode0
Financial Report Chunking for Effective Retrieval Augmented GenerationCode0
Information Redundancy and Biases in Public Document Information Extraction BenchmarksCode0
Long-Range Transformer Architectures for Document UnderstandingCode0
Hypergraph based Understanding for Document Semantic Entity RecognitionCode0
Machine Unlearning for Document ClassificationCode0
Bidirectional Context-Aware Hierarchical Attention Network for Document UnderstandingCode0
LayoutXLM: Multimodal Pre-training for Multilingual Visually-rich Document UnderstandingCode0
Learned Compression for Compressed LearningCode0
LayoutLLM: Layout Instruction Tuning with Large Language Models for Document UnderstandingCode0
KALM: Knowledge-Aware Integration of Local, Document, and Global Contexts for Long Document UnderstandingCode0
Knowing Where and What: Unified Word Block Pretraining for Document UnderstandingCode0
LayoutLMv2: Multi-modal Pre-training for Visually-Rich Document UnderstandingCode0
MarkupLM: Pre-training of Text and Markup Language for Visually-rich Document UnderstandingCode0
Table Detection for Visually Rich Document ImagesCode0
DiCoRe: Enhancing Zero-shot Event Detection via Divergent-Convergent LLM Reasoning0
Génération de question à partir d’analyse sémantique pour l’adaptation non supervisée de modèles de compréhension de documents (Question generation from semantic analysis for unsupervised adaptation of document understanding models)0
BERT-AL: BERT for Arbitrarily Long Document Understanding0
From Entity Linking to Question Answering -- Recent Progress on Semantic Grounding Tasks0
Friendly Topic Assistant for Transformer Based Abstractive Summarization0
Deep Learning based Key Information Extraction from Business Documents: Systematic Literature Review0
FormNetV2: Multimodal Graph Contrastive Learning for Form Document Information Extraction0
DeeperDive: The Unreasonable Effectiveness of Weak Supervision in Document Understanding A Case Study in Collaboration with UiPath Inc0
AWESOME: GPU Memory-constrained Long Document Summarization using Memory Mechanism and Global Salient Content0
A Retrospective Recount of Computer Architecture Research with a Data-Driven Study of Over Four Decades of ISCA Publications0
FormNet: Structural Encoding beyond Sequential Modeling in Form Document Information Extraction0
Finding Pragmatic Differences Between Disciplines0
Decontextualization: Making Sentences Stand-Alone0
Automatic Knowledge Extraction with Human Interface0
Fast-StrucTexT: An Efficient Hourglass Transformer with Modality-guided Dynamic Token Merge for Document Understanding0
DAViD: Domain Adaptive Visually-Rich Document Understanding with Synthetic Insights0
Extract with Order for Coherent Multi-Document Summarization0
Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling0
DavarOCR: A Toolbox for OCR and Multi-Modal Document Understanding0
Automated Parsing of Engineering Drawings for Structured Information Extraction Using a Fine-tuned Document Understanding Transformer0
Arctic-TILT. Business Document Understanding at Sub-Billion Scale0
ERNIE-mmLayout: Multi-grained MultiModal Transformer for Document Understanding0
Show:102550
← PrevPage 3 of 7Next →

No leaderboard results yet.