SOTAVerified

document understanding

Document understanding involves document classification, layout analysis, information extraction, and DocQA.

Papers

Showing 110 of 309 papers

TitleStatusHype
Qwen2.5-VL Technical ReportCode11
DocLayout-YOLO: Enhancing Document Layout Analysis through Diverse Synthetic Data and Global-to-Local Adaptive PerceptionCode9
ColPali: Efficient Document Retrieval with Vision Language ModelsCode7
GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement LearningCode7
Focus Anywhere for Fine-grained Multi-page Document UnderstandingCode5
TextMonkey: An OCR-Free Large Multimodal Model for Understanding DocumentCode5
Mini-Monkey: Alleviating the Semantic Sawtooth Effect for Lightweight MLLMs via Complementary Image PyramidCode5
LLMMapReduce: Simplified Long-Sequence Processing using Large Language ModelsCode4
OCR-free Document Understanding TransformerCode3
INTERS: Unlocking the Power of Large Language Models in Search with Instruction TuningCode3
Show:102550
← PrevPage 1 of 31Next →

No leaderboard results yet.