SOTAVerified

document understanding

Document understanding involves document classification, layout analysis, information extraction, and DocQA.

Papers

Showing 126150 of 309 papers

TitleStatusHype
Can AI Models Appreciate Document Aesthetics? An Exploration of Legibility and Layout Quality in Relation to Prediction Confidence0
Document Collection Visual Question Answering0
A Survey on Vietnamese Document Analysis and Recognition: Challenges and Future Directions0
Calculating Semantic Similarity between Academic Articles using Topic Event and Ontology0
DocPedia: Unleashing the Power of Large Multimodal Model in the Frequency Domain for Versatile Document Understanding0
Building and better understanding vision-language models: insights and future directions0
A Survey on MLLM-based Visually Rich Document Understanding: Methods, Challenges, and Emerging Trends0
BuDDIE: A Business Document Dataset for Multi-task Information Extraction0
DocMamba: Efficient Document Pre-training with State Space Model0
A Survey and Approach to Chart Classification0
DocLLM: A layout-aware generative language model for multimodal document understanding0
BROS: A Pre-trained Language Model for Understanding Texts in Document0
LayoutMask: Enhance Text-Layout Interaction in Multi-modal Pre-training for Document Understanding0
Is Cognition consistent with Perception? Assessing and Mitigating Multimodal Knowledge Conflicts in Document Understanding0
BoundingDocs: a Unified Dataset for Document Question Answering with Spatial Annotations0
LAPDoc: Layout-Aware Prompting for Documents0
DocKylin: A Large Multimodal Model for Visual Document Understanding with Efficient Visual Slimming0
A Simple yet Effective Layout Token in Large Language Models for Document Understanding0
Information Extraction from Heterogeneous Documents without Ground Truth Labels using Synthetic Label Generation and Knowledge Distillation0
DocKD: Knowledge Distillation from LLMs for Open-World Document Understanding Models0
LAMPRET: Layout-Aware Multimodal PreTraining for Document Understanding0
LayoutLLM: Large Language Model Instruction Tuning for Visually Rich Document Understanding0
DocGraphLM: Documental Graph Language Model for Information Extraction0
Improving Keyphrase Extraction with Data Augmentation and Information Filtering0
Joint Structured Learning and Predictions under Logical Constraints in Conditional Random Fields0
Show:102550
← PrevPage 6 of 13Next →

No leaderboard results yet.