SOTAVerified

document understanding

Document understanding involves document classification, layout analysis, information extraction, and DocQA.

Papers

Showing 3140 of 309 papers

TitleStatusHype
BiblioPage: A Dataset of Scanned Title Pages for Bibliographic Metadata ExtractionCode0
SFDLA: Source-Free Document Layout AnalysisCode0
A Simple yet Effective Layout Token in Large Language Models for Document Understanding0
MDocAgent: A Multi-Modal Multi-Agent Framework for Document UnderstandingCode3
Marten: Visual Question Answering with Mask Generation for Multi-modal Document UnderstandingCode0
PP-DocBee: Improving Multimodal Document Understanding Through a Bag of TricksCode0
A Token-level Text Image Foundation Model for Document Understanding0
Zero-Shot Complex Question-Answering on Long Scientific DocumentsCode0
Shakti-VLMs: Scalable Vision-Language Models for Enterprise AI0
OmniParser V2: Structured-Points-of-Thought for Unified Visual Text Parsing and Its Generality to Multimodal Large Language ModelsCode0
Show:102550
← PrevPage 4 of 31Next →

No leaderboard results yet.