SOTAVerified

Visual Question Answering

MLLM Leaderboard

Papers

Showing 531540 of 2177 papers

TitleStatusHype
Mind Your Outliers! Investigating the Negative Impact of Outliers on Active Learning for Visual Question AnsweringCode1
Perception Matters: Detecting Perception Failures of VQA Models Using Metamorphic TestingCode1
Predicting Human Scanpaths in Visual Question AnsweringCode1
RSTNet: Captioning With Adaptive Attention on Visual and Non-Visual WordsCode1
Probing Image-Language Transformers for Verb UnderstandingCode1
Check It Again: Progressive Visual Question Answering via Visual EntailmentCode1
Multi-modal Understanding and Generation for Medical Images and Text via Vision-Language Pre-TrainingCode1
Multiple Meta-model Quantifying for Medical Visual Question AnsweringCode1
Found a Reason for me? Weakly-supervised Grounded Visual Question Answering using CapsulesCode1
Passage Retrieval for Outside-Knowledge Visual Question AnsweringCode1
Show:102550
← PrevPage 54 of 218Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1MMCTAgent (GPT-4 + GPT-4V)GPT-4 score74.24Unverified
2Qwen2-VL-72BGPT-4 score74Unverified
3InternVL2.5-78BGPT-4 score72.3Unverified
4GPT-4o +text rationale +IoTGPT-4 score72.2Unverified
5Lyra-ProGPT-4 score71.4Unverified
6GLM-4V-PlusGPT-4 score71.1Unverified
7Phantom-7BGPT-4 score70.8Unverified
8InternVL2.5-38BGPT-4 score68.8Unverified
9InternVL2-26B (SGP, token ratio 64%)GPT-4 score65.6Unverified
10Baichuan-Omni (7B)GPT-4 score65.4Unverified