SOTAVerified

Visual Question Answering

MLLM Leaderboard

Papers

Showing 20812090 of 2177 papers

TitleStatusHype
Generating and Evaluating Explanations of Attended and Error-Inducing Input Regions for VQA Models0
Knowing Where to Look? Analysis on Attention of Visual Question Answering System0
Unshuffling Data for Improved Generalization0
Knowledge Acquisition for Visual Question Answering via Iterative Querying0
Knowledge-Augmented Language Models Interpreting Structured Chest X-Ray Findings0
Knowledge-Based Counterfactual Queries for Visual Question Answering0
Knowledge-Based Visual Question Answering in Videos0
Knowledge Condensation and Reasoning for Knowledge-based VQA0
Knowledge Detection by Relevant Question and Image Attributes in Visual Question Answering0
Unshuffling Data for Improved Generalization in Visual Question Answering0
Show:102550
← PrevPage 209 of 218Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1MMCTAgent (GPT-4 + GPT-4V)GPT-4 score74.24Unverified
2Qwen2-VL-72BGPT-4 score74Unverified
3InternVL2.5-78BGPT-4 score72.3Unverified
4GPT-4o +text rationale +IoTGPT-4 score72.2Unverified
5Lyra-ProGPT-4 score71.4Unverified
6GLM-4V-PlusGPT-4 score71.1Unverified
7Phantom-7BGPT-4 score70.8Unverified
8InternVL2.5-38BGPT-4 score68.8Unverified
9InternVL2-26B (SGP, token ratio 64%)GPT-4 score65.6Unverified
10Baichuan-Omni (7B)GPT-4 score65.4Unverified