SOTAVerified

document understanding

Document understanding involves document classification, layout analysis, information extraction, and DocQA.

Papers

Showing 101110 of 309 papers

TitleStatusHype
Token-level Correlation-guided Compression for Efficient Multimodal Document UnderstandingCode0
VisFocus: Prompt-Guided Vision Encoders for OCR-Free Dense Document UnderstandingCode1
NAMER: Non-Autoregressive Modeling for Handwritten Mathematical Expression Recognition0
DANIEL: A fast Document Attention Network for Information Extraction and Labelling of handwritten documentsCode1
Hypergraph based Understanding for Document Semantic Entity RecognitionCode0
A Bounding Box is Worth One Token: Interleaving Layout and Text in a Large Language Model for Document UnderstandingCode2
MMLongBench-Doc: Benchmarking Long-context Document Understanding with VisualizationsCode2
ColPali: Efficient Document Retrieval with Vision Language ModelsCode7
DocKylin: A Large Multimodal Model for Visual Document Understanding with Efficient Visual Slimming0
DrVideo: Document Retrieval Based Long Video Understanding0
Show:102550
← PrevPage 11 of 31Next →

No leaderboard results yet.