SOTAVerified

document understanding

Document understanding involves document classification, layout analysis, information extraction, and DocQA.

Papers

Showing 5175 of 309 papers

TitleStatusHype
Visually-Situated Natural Language Understanding with Contrastive Reading Model and Frozen Large Language ModelsCode1
ARB: A Comprehensive Arabic Multimodal Reasoning BenchmarkCode1
DocLayLLM: An Efficient Multi-modal Extension of Large Language Models for Text-rich Document UnderstandingCode1
On Web-based Visual Corpus Construction for Visual Document UnderstandingCode1
DocLayLLM: An Efficient and Effective Multi-modal Extension of Large Language Models for Text-rich Document UnderstandingCode1
DocTrack: A Visually-Rich Document Dataset Really Aligned with Human Eye Movement for Machine ReadingCode1
Typhoon 2: A Family of Open Text and Multimodal Thai Large Language ModelsCode1
LineFormer: Rethinking Line Chart Data Extraction as Instance SegmentationCode1
DocFormerv2: Local Features for Document UnderstandingCode1
A Discrete Variational Recurrent Topic Model without the Reparametrization TrickCode1
Going Full-TILT Boogie on Document Understanding with Text-Image-Layout TransformerCode1
End-to-end Document Recognition and Understanding with DessurtCode1
Doc2Graph: a Task Agnostic Document Understanding Framework based on Graph Neural NetworksCode1
Enhancing Visually-Rich Document Understanding via Layout Structure ModelingCode1
Document Understanding Dataset and Evaluation (DUDE)Code1
ERNIE-Layout: Layout Knowledge Enhanced Pre-training for Visually-rich Document UnderstandingCode1
Hierarchical Multimodal Pre-training for Visually Rich Webpage UnderstandingCode1
DocFormer: End-to-End Transformer for Document UnderstandingCode1
PaLI-X: On Scaling up a Multilingual Vision and Language ModelCode1
DiCoRe: Enhancing Zero-shot Event Detection via Divergent-Convergent LLM Reasoning0
BERT-AL: BERT for Arbitrarily Long Document Understanding0
Deep Learning based Key Information Extraction from Business Documents: Systematic Literature Review0
DeeperDive: The Unreasonable Effectiveness of Weak Supervision in Document Understanding A Case Study in Collaboration with UiPath Inc0
AWESOME: GPU Memory-constrained Long Document Summarization using Memory Mechanism and Global Salient Content0
A Retrospective Recount of Computer Architecture Research with a Data-Driven Study of Over Four Decades of ISCA Publications0
Show:102550
← PrevPage 3 of 13Next →

No leaderboard results yet.