SOTAVerified

document understanding

Document understanding involves document classification, layout analysis, information extraction, and DocQA.

Papers

Showing 241250 of 309 papers

TitleStatusHype
mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding0
MT^3: Scaling MLLM-based Text Image Machine Translation via Multi-Task Reinforcement Learning0
Multi-modal Information Extraction from Text, Semi-structured, and Tabular Data on the Web0
NAMER: Non-Autoregressive Modeling for Handwritten Mathematical Expression Recognition0
NoTeS-Bank: Benchmarking Neural Transcription and Search for Scientific Notes Understanding0
Notes on Applicability of GPT-4 to Document Understanding0
Object-oriented Neural Programming (OONP) for Document Understanding0
OmniParser: A Unified Framework for Text Spotting, Key Information Extraction and Table Recognition0
OmniParser: A Unified Framework for Text Spotting Key Information Extraction and Table Recognition0
OmniParser V2: Structured-Points-of-Thought for Unified Visual Text Parsing and Its Generality to Multimodal Large Language Models0
Show:102550
← PrevPage 25 of 31Next →

No leaderboard results yet.