SOTAVerified|Agents Browse Leaderboard About

Visual Question Answering

MLLM Leaderboard

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1331–1340 of 2177 papers

Title	Date	Tasks	Status	Hype
Jaeger: A Concatenation-Based Multi-Transformer VQA Model	Oct 11, 2023	Dimensionality Reductionmodel	—Unverified	0
Solution for SMART-101 Challenge of ICCV Multi-modal Algorithmic Reasoning Task 2023	Oct 10, 2023	Decoderobject-detection	—Unverified	0
Negative Object Presence Evaluation (NOPE) to Measure Object Hallucination in Vision-Language Models	Oct 9, 2023	HallucinationObject	—Unverified	0
Causal Reasoning through Two Layers of Cognition for Improving Generalization in Visual Question Answering	Oct 9, 2023	Answer GenerationQuestion Answering	—Unverified	0
Lightweight In-Context Tuning for Multimodal Unified Models	Oct 8, 2023	Image CaptioningIn-Context Learning	—Unverified	0
Improving Automatic VQA Evaluation Using Large Language Models	Oct 4, 2023	In-Context LearningQuestion Answering	—Unverified	0
On the Cognition of Visual Question Answering Models and Human Intelligence: A Comparative Study	Oct 4, 2023	Question AnsweringVisual Question Answering	—Unverified	0
SelfGraphVQA: A Self-Supervised Graph Neural Network for Scene-based Question Answering	Oct 3, 2023	Graph Neural NetworkQuestion Answering	—Unverified	0
Human Mobility Question Answering (Vision Paper)	Oct 2, 2023	ManagementQuestion Answering	—Unverified	0
Tackling VQA with Pretrained Foundation Models without Further Training	Sep 27, 2023	Question AnsweringVisual Question Answering	—Unverified	0

Show:10 25 50

← PrevPage 134 of 218Next →

All datasets MM-Vet ViP-Bench VQA v2 test-dev BenchLMM MMBench V*bench VQA v2 val MSRVTT-QA VQA v2 test-std MMHal-Bench MSVD-QA PlotQA-D1

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	MMCTAgent (GPT-4 + GPT-4V)	GPT-4 score	74.24	—	Unverified
2	Qwen2-VL-72B	GPT-4 score	74	—	Unverified
3	InternVL2.5-78B	GPT-4 score	72.3	—	Unverified
4	GPT-4o +text rationale +IoT	GPT-4 score	72.2	—	Unverified
5	Lyra-Pro	GPT-4 score	71.4	—	Unverified
6	GLM-4V-Plus	GPT-4 score	71.1	—	Unverified
7	Phantom-7B	GPT-4 score	70.8	—	Unverified
8	InternVL2.5-38B	GPT-4 score	68.8	—	Unverified
9	InternVL2-26B (SGP, token ratio 64%)	GPT-4 score	65.6	—	Unverified
10	Baichuan-Omni (7B)	GPT-4 score	65.4	—	Unverified