SOTAVerified|Agents Browse Leaderboard About Blog

TextVQA

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 21–30 of 47 papers

Title	Date	Tasks	Status	Hype
TextMonkey: An OCR-Free Large Multimodal Model for Understanding Document	Mar 7, 2024	document understandingKey Information Extraction	CodeCode Available	5
Feast Your Eyes: Mixture-of-Resolution Adaptation for Multimodal Large Language Models	Mar 5, 2024	TextVQAVisual Question Answering	CodeCode Available	3
VisLingInstruct: Elevating Zero-Shot Learning in Multi-Modal Language Models with Autonomous Instruction Optimization	Feb 12, 2024	In-Context LearningTextVQA	CodeCode Available	0
Towards a Unified Multimodal Reasoning Framework	Dec 22, 2023	Multimodal ReasoningMultiple-choice	CodeCode Available	0
Multiple-Question Multiple-Answer Text-VQA	Nov 15, 2023	DecoderDenoising	—Unverified	0
CogVLM: Visual Expert for Pretrained Language Models	Nov 6, 2023	1 Image, 2*2 StitchingFS-MEVQA	CodeCode Available	5
Exploring Sparse Spatial Relation in Graph Inference for Text-Based VQA	Oct 13, 2023	Graph LearningObject	—Unverified	0
Sentence Attention Blocks for Answer Grounding	Sep 20, 2023	Question AnsweringSentence	—Unverified	0
Separate and Locate: Rethink the Text in Text-based Visual Question Answering	Aug 31, 2023	Optical Character Recognition (OCR)Position	CodeCode Available	0
Making the V in Text-VQA Matter	Aug 1, 2023	Optical Character Recognition (OCR)TextVQA	—Unverified	0

Show:10 25 50

← PrevPage 3 of 5Next →

No leaderboard results yet.