SOTAVerified

document understanding

Document understanding involves document classification, layout analysis, information extraction, and DocQA.

Papers

Showing 126150 of 309 papers

TitleStatusHype
MMDocBench: Benchmarking Large Vision-Language Models for Fine-Grained Visual Document Understanding0
Data-driven Coreference-based Ontology BuildingCode0
"What is the value of templates?" Rethinking Document Information Extraction Datasets for LLMs0
Harnessing Webpage UIs for Text-Rich Visual Understanding0
ChuLo: Chunk-Level Key Information Representation for Long Document ProcessingCode0
ReLayout: Towards Real-World Document Understanding via Layout-enhanced Pre-training0
DocKD: Knowledge Distillation from LLMs for Open-World Document Understanding Models0
DAViD: Domain Adaptive Visually-Rich Document Understanding with Synthetic Insights0
Leveraging Long-Context Large Language Models for Multi-Document Understanding and Summarization in Enterprise Applications0
DocMamba: Efficient Document Pre-training with State Space Model0
Leveraging Distillation Techniques for Document Understanding: A Case Study with FLAN-T50
Information Extraction from Visually Rich Documents Using Directed Weighted Graph Neural NetworkCode0
mPLUG-DocOwl2: High-resolution Compressing for OCR-free Multi-page Document Understanding0
ViRED: Prediction of Visual Relations in Engineering Drawings0
The MERIT Dataset: Modelling and Efficiently Rendering Interpretable Transcripts0
SynthDoc: Bilingual Documents Synthesis for Visual Document Understanding0
Building and better understanding vision-language models: insights and future directions0
Arctic-TILT. Business Document Understanding at Sub-Billion Scale0
Deep Learning based Visually Rich Document Content Understanding: A Survey0
Deep Learning based Key Information Extraction from Business Documents: Systematic Literature Review0
Token-level Correlation-guided Compression for Efficient Multimodal Document UnderstandingCode0
NAMER: Non-Autoregressive Modeling for Handwritten Mathematical Expression Recognition0
Hypergraph based Understanding for Document Semantic Entity RecognitionCode0
DocKylin: A Large Multimodal Model for Visual Document Understanding with Efficient Visual Slimming0
DrVideo: Document Retrieval Based Long Video Understanding0
Show:102550
← PrevPage 6 of 13Next →

No leaderboard results yet.