SOTAVerified

document understanding

Document understanding involves document classification, layout analysis, information extraction, and DocQA.

Papers

Showing 5175 of 309 papers

TitleStatusHype
Ocean-OCR: Towards General OCR Application via a Vision-Language ModelCode1
ERNIE-Layout: Layout Knowledge Enhanced Pre-training for Visually-rich Document UnderstandingCode1
DANIEL: A fast Document Attention Network for Information Extraction and Labelling of handwritten documentsCode1
Towards Robust Visual Information Extraction in Real World: New Dataset and Novel SolutionCode1
End-to-end Document Recognition and Understanding with DessurtCode1
Modeling Layout Reading Order as Ordering Relations for Visually-rich Document UnderstandingCode1
DocumentCLIP: Linking Figures and Main Body Text in Reflowed DocumentsCode1
WordScape: a Pipeline to extract multilingual, visually rich Documents with Layout Annotations from Web Crawl DataCode1
DocQueryNet: Value Retrieval with Arbitrary Queries for Form-like DocumentsCode1
LineFormer: Rethinking Line Chart Data Extraction as Instance SegmentationCode1
Document Understanding Dataset and Evaluation (DUDE)Code1
M6Doc: A Large-Scale Multi-Format, Multi-Type, Multi-Layout, Multi-Language, Multi-Annotation Category Dataset for Modern Document Layout AnalysisCode1
Doc2Graph: a Task Agnostic Document Understanding Framework based on Graph Neural NetworksCode1
Docopilot: Improving Multimodal Models for Document-Level UnderstandingCode1
MedICaT: A Dataset of Medical Images, Captions, and Textual ReferencesCode1
On the Affinity, Rationality, and Diversity of Hierarchical Topic ModelingCode1
A Discrete Variational Recurrent Topic Model without the Reparametrization TrickCode1
DocFormer: End-to-End Transformer for Document UnderstandingCode1
DocFormerv2: Local Features for Document UnderstandingCode1
Learned Compression for Compressed LearningCode0
LayoutXLM: Multimodal Pre-training for Multilingual Visually-rich Document UnderstandingCode0
Deeper Clinical Document Understanding Using Relation ExtractionCode0
Knowing Where and What: Unified Word Block Pretraining for Document UnderstandingCode0
LayoutLLM: Layout Instruction Tuning with Large Language Models for Document UnderstandingCode0
Is ChatGPT A Good Keyphrase Generator? A Preliminary StudyCode0
Show:102550
← PrevPage 3 of 13Next →

No leaderboard results yet.