SOTAVerified|Agents Browse Leaderboard About Blog

Visual Question Answering

MLLM Leaderboard

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1951–1960 of 2177 papers

Title	Date	Tasks	Status	Hype
Query and Attention Augmentation for Knowledge-Based Explainable Reasoning	Jan 1, 2022	Question AnsweringVisual Question Answering	CodeCode Available	0
Dynamic Memory Networks for Visual and Textual Question Answering	Mar 4, 2016	Question AnsweringVisual Question Answering	CodeCode Available	0
LayoutLMv3: Pre-training for Document AI with Unified Text and Image Masking	Apr 18, 2022	cross-modal alignmentDocument AI	CodeCode Available	0
LayoutLMv2: Multi-modal Pre-training for Visually-Rich Document Understanding	Dec 29, 2020	Document Image ClassificationDocument Layout Analysis	CodeCode Available	0
An Improved Attention for Visual Question Answering	Nov 4, 2020	DecoderQuestion Answering	CodeCode Available	0
TIJO: Trigger Inversion with Joint Optimization for Defending Multimodal Backdoored Models	Aug 7, 2023	backdoor defenseobject-detection	CodeCode Available	0
TimeCausality: Evaluating the Causal Ability in Time Dimension for Vision Language Models	May 21, 2025	Human AgingQuestion Answering	CodeCode Available	0
Adaptively Clustering Neighbor Elements for Image-Text Generation	Jan 5, 2023	ClusteringDecoder	CodeCode Available	0
Dynamic Key-value Memory Enhanced Multi-step Graph Reasoning for Knowledge-based Visual Question Answering	Mar 6, 2022	Graph AttentionQuestion Answering	CodeCode Available	0
DVQA: Understanding Data Visualizations via Question Answering	Jan 24, 2018	ArticlesChart Question Answering	CodeCode Available	0

Show:10 25 50

← PrevPage 196 of 218Next →

All datasets MM-Vet ViP-Bench VQA v2 test-dev BenchLMM MMBench V*bench VQA v2 val MSRVTT-QA VQA v2 test-std MMHal-Bench MSVD-QA PlotQA-D1

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	MMCTAgent (GPT-4 + GPT-4V)	GPT-4 score	74.24	—	Unverified
2	Qwen2-VL-72B	GPT-4 score	74	—	Unverified
3	InternVL2.5-78B	GPT-4 score	72.3	—	Unverified
4	GPT-4o +text rationale +IoT	GPT-4 score	72.2	—	Unverified
5	Lyra-Pro	GPT-4 score	71.4	—	Unverified
6	GLM-4V-Plus	GPT-4 score	71.1	—	Unverified
7	Phantom-7B	GPT-4 score	70.8	—	Unverified
8	InternVL2.5-38B	GPT-4 score	68.8	—	Unverified
9	InternVL2-26B (SGP, token ratio 64%)	GPT-4 score	65.6	—	Unverified
10	Baichuan-Omni (7B)	GPT-4 score	65.4	—	Unverified