SOTAVerified

document understanding

Document understanding involves document classification, layout analysis, information extraction, and DocQA.

Papers

Showing 201225 of 309 papers

TitleStatusHype
Automated Parsing of Engineering Drawings for Structured Information Extraction Using a Fine-tuned Document Understanding Transformer0
Automatic Knowledge Extraction with Human Interface0
AWESOME: GPU Memory-constrained Long Document Summarization using Memory Mechanism and Global Salient Content0
BERT-AL: BERT for Arbitrarily Long Document Understanding0
BigDocs: An Open and Permissively-Licensed Dataset for Training Multimodal Models on Document and Code Tasks0
Bi-VLDoc: Bidirectional Vision-Language Modeling for Visually-Rich Document Understanding0
BoundingDocs: a Unified Dataset for Document Question Answering with Spatial Annotations0
BROS: A Pre-trained Language Model for Understanding Texts in Document0
BuDDIE: A Business Document Dataset for Multi-task Information Extraction0
Building and better understanding vision-language models: insights and future directions0
Calculating Semantic Similarity between Academic Articles using Topic Event and Ontology0
Can AI Models Appreciate Document Aesthetics? An Exploration of Legibility and Layout Quality in Relation to Prediction Confidence0
Read and Think: An Efficient Step-wise Multimodal Language Model for Document Understanding and Reasoning0
ClueWeb22: 10 Billion Web Documents with Visual and Semantic Information0
CREPE: Coordinate-Aware End-to-End Document Parser0
DavarOCR: A Toolbox for OCR and Multi-Modal Document Understanding0
DAViD: Domain Adaptive Visually-Rich Document Understanding with Synthetic Insights0
Decontextualization: Making Sentences Stand-Alone0
DeeperDive: The Unreasonable Effectiveness of Weak Supervision in Document Understanding A Case Study in Collaboration with UiPath Inc0
Deep Learning based Key Information Extraction from Business Documents: Systematic Literature Review0
DiCoRe: Enhancing Zero-shot Event Detection via Divergent-Convergent LLM Reasoning0
DistilDoc: Knowledge Distillation for Visually-Rich Document Applications0
DLUE: Benchmarking Document Language Understanding0
Doc2Im: document to image conversion through self-attentive embedding0
Doc-CoB: Enhancing Multi-Modal Document Understanding with Visual Chain-of-Boxes Reasoning0
Show:102550
← PrevPage 9 of 13Next →

No leaderboard results yet.