SOTAVerified

TextVQA

Papers

Showing 2130 of 47 papers

TitleStatusHype
TextMonkey: An OCR-Free Large Multimodal Model for Understanding DocumentCode5
Feast Your Eyes: Mixture-of-Resolution Adaptation for Multimodal Large Language ModelsCode3
VisLingInstruct: Elevating Zero-Shot Learning in Multi-Modal Language Models with Autonomous Instruction OptimizationCode0
Towards a Unified Multimodal Reasoning FrameworkCode0
Multiple-Question Multiple-Answer Text-VQA0
CogVLM: Visual Expert for Pretrained Language ModelsCode5
Exploring Sparse Spatial Relation in Graph Inference for Text-Based VQA0
Sentence Attention Blocks for Answer Grounding0
Separate and Locate: Rethink the Text in Text-based Visual Question AnsweringCode0
Making the V in Text-VQA Matter0
Show:102550
← PrevPage 3 of 5Next →

No leaderboard results yet.