SOTAVerified|Agents Browse Leaderboard About Blog

Visual Question Answering

MLLM Leaderboard

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1651–1660 of 2177 papers

Title	Date	Tasks	Status	Hype
'Just because you are right, doesn't mean I am wrong': Overcoming a Bottleneck in the Development and Evaluation of Open-Ended Visual Question Answering (VQA) Tasks	Mar 28, 2021	Question AnsweringVisual Question Answering	CodeCode Available	0
Generating and Evaluating Explanations of Attended and Error-Inducing Input Regions for VQA Models	Mar 26, 2021	Question AnsweringVisual Question Answering	—Unverified	0
Visual Grounding Strategies for Text-Only Natural Language Processing	Mar 25, 2021	Image RetrievalLanguage Modeling	—Unverified	0
Multi-Modal Answer Validation for Knowledge-Based VQA	Mar 23, 2021	Question AnsweringRetrieval	CodeCode Available	1
How to Design Sample and Computationally Efficient VQA Models	Mar 22, 2021	Question AnsweringVisual Question Answering	—Unverified	0
A Comprehensive Survey of Scene Graphs: Generation and Application	Mar 17, 2021	Image CaptioningQuestion Answering	—Unverified	0
Automatic Generation of Contrast Sets from Scene Graphs: Probing the Compositional Consistency of GQA	Mar 17, 2021	Question AnsweringRelational Reasoning	CodeCode Available	0
Characterizing Misclassifications of Deep NLP Models	Mar 12, 2021	named-entity-recognitionNamed Entity Recognition	—Unverified	0
RL-CSDia: Representation Learning of Computer Science Diagrams	Mar 10, 2021	Question AnsweringRepresentation Learning	—Unverified	0
Select, Substitute, Search: A New Benchmark for Knowledge-Augmented Visual Question Answering	Mar 9, 2021	Optical Character Recognition (OCR)Question Answering	CodeCode Available	0

Show:10 25 50

← PrevPage 166 of 218Next →

All datasets MM-Vet ViP-Bench VQA v2 test-dev BenchLMM MMBench V*bench VQA v2 val MSRVTT-QA VQA v2 test-std MMHal-Bench MSVD-QA PlotQA-D1

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	MMCTAgent (GPT-4 + GPT-4V)	GPT-4 score	74.24	—	Unverified
2	Qwen2-VL-72B	GPT-4 score	74	—	Unverified
3	InternVL2.5-78B	GPT-4 score	72.3	—	Unverified
4	GPT-4o +text rationale +IoT	GPT-4 score	72.2	—	Unverified
5	Lyra-Pro	GPT-4 score	71.4	—	Unverified
6	GLM-4V-Plus	GPT-4 score	71.1	—	Unverified
7	Phantom-7B	GPT-4 score	70.8	—	Unverified
8	InternVL2.5-38B	GPT-4 score	68.8	—	Unverified
9	InternVL2-26B (SGP, token ratio 64%)	GPT-4 score	65.6	—	Unverified
10	Baichuan-Omni (7B)	GPT-4 score	65.4	—	Unverified