SOTAVerified

Document AI

Papers

Showing 125 of 40 papers

TitleStatusHype
DocRes: A Generalist Model Toward Unifying Document Image Restoration TasksCode4
Unifying Vision, Text, and Layout for Universal Document ProcessingCode3
LayoutLM: Pre-training of Text and Layout for Document Image UnderstandingCode2
OfficeBench: Benchmarking Language Agents across Multiple Applications for Office AutomationCode1
On Efficient Language and Vision Assistants for Visually-Situated Natural Language Understanding: What Matters in Reading and ReasoningCode1
DocTrack: A Visually-Rich Document Dataset Really Aligned with Human Eye Movement for Machine ReadingCode1
Document AI: A Comparative Study of Transformer-Based, Graph-Based Models, and Convolutional Neural Networks For Document Layout AnalysisCode1
Modular Multimodal Machine Learning for Extraction of Theorems and Proofs in Long Scientific Documents (Extended Version)Code1
Document Understanding Dataset and Evaluation (DUDE)Code1
Context-Aware Chart Element DetectionCode1
ICL-D3IE: In-Context Learning with Diverse Demonstrations Updating for Document Information ExtractionCode1
Document Intelligence Metrics for Visually Rich Document EvaluationCode1
DiT: Self-supervised Pre-training for Document Image TransformerCode1
Infinity Parser: Layout Aware Reinforcement Learning for Scanned Document ParsingCode0
NoTeS-Bank: Benchmarking Neural Transcription and Search for Scientific Notes Understanding0
BoundingDocs: a Unified Dataset for Document Question Answering with Spatial Annotations0
DoPTA: Improving Document Layout Analysis using Patch-Text Alignment0
Enhancing Document AI Data Generation Through Graph-Based Synthetic Layouts0
H2OVL-Mississippi Vision Language Models Technical Report0
Out-of-Distribution Detection with Attention Head Masking for Multimodal Document ClassificationCode0
Design of a Quality Management System based on the EU Artificial Intelligence ActCode0
XFormParser: A Simple and Effective Multimodal Multilingual Semi-structured Form ParserCode0
LayoutLLM: Layout Instruction Tuning with Large Language Models for Document UnderstandingCode0
Can AI Models Appreciate Document Aesthetics? An Exploration of Legibility and Layout Quality in Relation to Prediction Confidence0
Towards Human-Like Machine Comprehension: Few-Shot Relational Learning in Visually-Rich Documents0
Show:102550
← PrevPage 1 of 2Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1LayoutLMv3Average F199.21Unverified