SOTAVerified

Visual Question Answering

MLLM Leaderboard

Papers

Showing 20762100 of 2177 papers

TitleStatusHype
`Just because you are right, doesn't mean I am wrong': Overcoming a bottleneck in development and evaluation of Open-Ended VQA tasks0
KAnoCLIP: Zero-Shot Anomaly Detection through Knowledge-Driven Prompt Learning and Enhanced Cross-Modal Integration0
Fusion of Domain-Adapted Vision and Language Models for Medical Visual Question Answering0
Kernel Pooling for Convolutional Neural Networks0
Zero-Shot Anomaly Detection in Battery Thermal Images Using Visual Question Answering with Prior Knowledge0
Generating and Evaluating Explanations of Attended and Error-Inducing Input Regions for VQA Models0
Knowing Where to Look? Analysis on Attention of Visual Question Answering System0
Unshuffling Data for Improved Generalization0
Knowledge Acquisition for Visual Question Answering via Iterative Querying0
Knowledge-Augmented Language Models Interpreting Structured Chest X-Ray Findings0
Knowledge-Based Counterfactual Queries for Visual Question Answering0
Knowledge-Based Visual Question Answering in Videos0
Knowledge Condensation and Reasoning for Knowledge-based VQA0
Knowledge Detection by Relevant Question and Image Attributes in Visual Question Answering0
Unshuffling Data for Improved Generalization in Visual Question Answering0
Fusion of Detected Objects in Text for Visual Question Answering0
FunBench: Benchmarking Fundus Reading Skills of MLLMs0
Answer-Type Prediction for Visual Question Answering0
KOSMOS-2.5: A Multimodal Literate Model0
From Text to Visuals: Using LLMs to Generate Math Diagrams with Vector Graphics0
Unsupervised Keyword Extraction for Full-sentence VQA0
Unveiling Cross Modality Bias in Visual Question Answering: A Causal View with Possible Worlds VQA0
KVQA: Knowledge-Aware Visual Question Answering0
From Strings to Things: Knowledge-Enabled VQA Model That Can Read and Reason0
From Shallow to Deep: Compositional Reasoning over Graphs for Visual Question Answering0
Show:102550
← PrevPage 84 of 88Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1MMCTAgent (GPT-4 + GPT-4V)GPT-4 score74.24Unverified
2Qwen2-VL-72BGPT-4 score74Unverified
3InternVL2.5-78BGPT-4 score72.3Unverified
4GPT-4o +text rationale +IoTGPT-4 score72.2Unverified
5Lyra-ProGPT-4 score71.4Unverified
6GLM-4V-PlusGPT-4 score71.1Unverified
7Phantom-7BGPT-4 score70.8Unverified
8InternVL2.5-38BGPT-4 score68.8Unverified
9InternVL2-26B (SGP, token ratio 64%)GPT-4 score65.6Unverified
10Baichuan-Omni (7B)GPT-4 score65.4Unverified