SOTAVerified|Agents Browse Leaderboard About Blog

document understanding

Document understanding involves document classification, layout analysis, information extraction, and DocQA.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 31–40 of 309 papers

Title	Date	Tasks	Status	Hype
BiblioPage: A Dataset of Scanned Title Pages for Bibliographic Metadata Extraction	Mar 25, 2025	document understandingobject-detection	CodeCode Available	0
SFDLA: Source-Free Document Layout Analysis	Mar 24, 2025	AvgDocument Layout Analysis	CodeCode Available	0
A Simple yet Effective Layout Token in Large Language Models for Document Understanding	Mar 24, 2025	document understandingPosition	—Unverified	0
MDocAgent: A Multi-Modal Multi-Agent Framework for Document Understanding	Mar 18, 2025	document understandingQuestion Answering	CodeCode Available	3
Marten: Visual Question Answering with Mask Generation for Multi-modal Document Understanding	Mar 18, 2025	document understandingQuestion Answering	CodeCode Available	0
PP-DocBee: Improving Multimodal Document Understanding Through a Bag of Tricks	Mar 6, 2025	document understandingLanguage Modeling	—Unverified	0
A Token-level Text Image Foundation Model for Document Understanding	Mar 4, 2025	document understandingVisual Question Answering (VQA)	—Unverified	0
Zero-Shot Complex Question-Answering on Long Scientific Documents	Mar 4, 2025	Answer Generationdocument understanding	CodeCode Available	0
Shakti-VLMs: Scalable Vision-Language Models for Enterprise AI	Feb 24, 2025	document understandingMultimodal Reasoning	—Unverified	0
OmniParser V2: Structured-Points-of-Thought for Unified Visual Text Parsing and Its Generality to Multimodal Large Language Models	Feb 22, 2025	document understandingKey Information Extraction	—Unverified	0

Show:10 25 50

← PrevPage 4 of 31Next →

No leaderboard results yet.