SOTAVerified|Agents Browse Leaderboard About

Visual Question Answering

MLLM Leaderboard

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 661–670 of 2177 papers

Title	Date	Tasks	Status	Hype
DistilDoc: Knowledge Distillation for Visually-Rich Document Applications	Jun 12, 2024	document-image-classificationDocument Image Classification	—Unverified	0
Benchmarking Vision-Language Contrastive Methods for Medical Representation Learning	Jun 11, 2024	BenchmarkingContrastive Learning	CodeCode Available	0
RS-Agent: Automating Remote Sensing Tasks through Intelligent Agent	Jun 11, 2024	AI AgentDescriptive	CodeCode Available	2
CVQA: Culturally-diverse Multilingual Visual Question Answering Benchmark	Jun 10, 2024	DiversityQuestion Answering	—Unverified	0
Solution for SMART-101 Challenge of CVPR Multi-modal Algorithmic Reasoning Task 2024	Jun 10, 2024	Language Modellingobject-detection	—Unverified	0
VCR: A Task for Pixel-Level Complex Reasoning in Vision Language Models via Restoring Occluded Text	Jun 10, 2024	Language ModelingLanguage Modelling	CodeCode Available	1
Towards Semantic Equivalence of Tokenization in Multimodal LLM	Jun 7, 2024	Visual Question Answering	—Unverified	0
RoboMamba: Efficient Vision-Language-Action Model for Robotic Reasoning and Manipulation	Jun 6, 2024	Common Sense ReasoningMamba	—Unverified	0
DeepStack: Deeply Stacking Visual Tokens is Surprisingly Simple and Effective for LMMs	Jun 6, 2024	Language ModellingLarge Language Model	—Unverified	0
Understanding Information Storage and Transfer in Multi-modal Large Language Models	Jun 6, 2024	Factual Visual Question AnsweringModel Editing	—Unverified	0

Show:10 25 50

← PrevPage 67 of 218Next →

All datasets MM-Vet ViP-Bench VQA v2 test-dev BenchLMM MMBench V*bench VQA v2 val MSRVTT-QA VQA v2 test-std MMHal-Bench MSVD-QA PlotQA-D1

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	MMCTAgent (GPT-4 + GPT-4V)	GPT-4 score	74.24	—	Unverified
2	Qwen2-VL-72B	GPT-4 score	74	—	Unverified
3	InternVL2.5-78B	GPT-4 score	72.3	—	Unverified
4	GPT-4o +text rationale +IoT	GPT-4 score	72.2	—	Unverified
5	Lyra-Pro	GPT-4 score	71.4	—	Unverified
6	GLM-4V-Plus	GPT-4 score	71.1	—	Unverified
7	Phantom-7B	GPT-4 score	70.8	—	Unverified
8	InternVL2.5-38B	GPT-4 score	68.8	—	Unverified
9	InternVL2-26B (SGP, token ratio 64%)	GPT-4 score	65.6	—	Unverified
10	Baichuan-Omni (7B)	GPT-4 score	65.4	—	Unverified