SOTAVerified

document understanding

Document understanding involves document classification, layout analysis, information extraction, and DocQA.

Papers

Showing 176200 of 309 papers

TitleStatusHype
MMDocBench: Benchmarking Large Vision-Language Models for Fine-Grained Visual Document Understanding0
MT^3: Scaling MLLM-based Text Image Machine Translation via Multi-Task Reinforcement Learning0
Multi-modal Information Extraction from Text, Semi-structured, and Tabular Data on the Web0
NAMER: Non-Autoregressive Modeling for Handwritten Mathematical Expression Recognition0
NoTeS-Bank: Benchmarking Neural Transcription and Search for Scientific Notes Understanding0
Notes on Applicability of GPT-4 to Document Understanding0
Object-oriented Neural Programming (OONP) for Document Understanding0
One-Shot Doc Snippet Detection: Powering Search in Document Beyond Text0
On Scaling Up a Multilingual Vision and Language Model0
OPAD: An Optimized Policy-based Active Learning Framework for Document Content Analysis0
PDFVQA: A New Dataset for Real-World VQA on PDF Documents0
Point-RFT: Improving Multimodal Reasoning with Visually Grounded Reinforcement Finetuning0
Position Masking for Improved Layout-Aware Document Understanding0
Probing Position-Aware Attention Mechanism in Long Document Understanding0
ProtoNER: Few shot Incremental Learning for Named Entity Recognition using Prototypical Networks0
PSG: Prompt-based Sequence Generation for Acronym Extraction0
QID: Efficient Query-Informed ViTs in Data-Scarce Regimes for OCR-free Visual Document Understanding0
QueryForm: A Simple Zero-shot Form Entity Query Framework0
RDU: A Region-based Approach to Form-style Document Understanding0
Reinforced UI Instruction Grounding: Towards a Generic UI Task Automation API0
ReLayout: Towards Real-World Document Understanding via Layout-enhanced Pre-training0
Retrieval Augmented Structured Generation: Business Document Information Extraction As Tool Use0
Revisiting Table Detection Datasets for Visually Rich Documents0
RJUA-MedDQA: A Multimodal Benchmark for Medical Document Question Answering and Clinical Reasoning0
Robust Text Line Detection in Historical Documents: Learning and Evaluation Methods0
Show:102550
← PrevPage 8 of 13Next →

No leaderboard results yet.