SOTAVerified

Document AI

Papers

Showing 125 of 40 papers

TitleStatusHype
DocRes: A Generalist Model Toward Unifying Document Image Restoration TasksCode4
Unifying Vision, Text, and Layout for Universal Document ProcessingCode3
LayoutLM: Pre-training of Text and Layout for Document Image UnderstandingCode2
Modular Multimodal Machine Learning for Extraction of Theorems and Proofs in Long Scientific Documents (Extended Version)Code1
On Efficient Language and Vision Assistants for Visually-Situated Natural Language Understanding: What Matters in Reading and ReasoningCode1
DiT: Self-supervised Pre-training for Document Image TransformerCode1
DocTrack: A Visually-Rich Document Dataset Really Aligned with Human Eye Movement for Machine ReadingCode1
ICL-D3IE: In-Context Learning with Diverse Demonstrations Updating for Document Information ExtractionCode1
Document AI: A Comparative Study of Transformer-Based, Graph-Based Models, and Convolutional Neural Networks For Document Layout AnalysisCode1
Document Understanding Dataset and Evaluation (DUDE)Code1
Document Intelligence Metrics for Visually Rich Document EvaluationCode1
OfficeBench: Benchmarking Language Agents across Multiple Applications for Office AutomationCode1
Context-Aware Chart Element DetectionCode1
A Multi-Modal Multilingual Benchmark for Document Image Classification0
BoundingDocs: a Unified Dataset for Document Question Answering with Spatial Annotations0
Can AI Models Appreciate Document Aesthetics? An Exploration of Legibility and Layout Quality in Relation to Prediction Confidence0
Development of a Legal Document AI-Chatbot0
Document AI: Benchmarks, Models and Applications0
DoPTA: Improving Document Layout Analysis using Patch-Text Alignment0
Enhancing Document AI Data Generation Through Graph-Based Synthetic Layouts0
FormNet: Structural Encoding beyond Sequential Modeling in Form Document Information Extraction0
H2OVL-Mississippi Vision Language Models Technical Report0
ICDAR 2023 Competition on Structured Text Extraction from Visually-Rich Document Images0
LongFin: A Multimodal Document Understanding Model for Long Financial Domain Documents0
Model Reporting for Certifiable AI: A Proposal from Merging EU Regulation into AI Development0
Show:102550
← PrevPage 1 of 2Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1LayoutLMv3Average F199.21Unverified