SOTAVerified

Visual Question Answering

MLLM Leaderboard

Papers

Showing 751775 of 2177 papers

TitleStatusHype
Bayesian Attention Belief Networks0
CVQA: Culturally-diverse Multilingual Visual Question Answering Benchmark0
BARTPhoBEiT: Pre-trained Sequence-to-Sequence and Image Transformers Models for Vietnamese Visual Question Answering0
C-VQA: A Compositional Split of the Visual Question Answering (VQA) v1.0 Dataset0
Interpretable Bilingual Multimodal Large Language Model for Diverse Biomedical Tasks0
Barriers in Integrating Medical Visual Question Answering into Radiology Workflows: A Scoping Review and Clinicians' Insights0
Curriculum Script Distillation for Multilingual Visual Question Answering0
A Causal Approach to Mitigate Modality Preference Bias in Medical Visual Question Answering0
Curriculum Learning for Compositional Visual Reasoning0
Curriculum Learning Effectively Improves Low Data VQA0
An Empirical Study of Batch Normalization and Group Normalization in Conditional Computation0
Prompting Medical Large Vision-Language Models to Diagnose Pathologies by Visual Question Answering0
Dynamic Clue Bottlenecks: Towards Interpretable-by-Design Visual Question Answering0
CTRL-O: Language-Controllable Object-Centric Visual Representation Learning0
Barking Up The Syntactic Tree: Enhancing VLM Training with Syntactic Losses0
CT-Agent: A Multimodal-LLM Agent for 3D CT Radiology Question Answering0
CS-VQA: Visual Question Answering with Compressively Sensed Images0
Balancing Performance and Efficiency in Zero-shot Robotic Navigation0
CrossVQA: Scalably Generating Benchmarks for Systematically Testing VQA Generalization0
Cross-Modal Safety Mechanism Transfer in Large Vision-Language Models0
Cross-Modal Retrieval Augmentation for Multi-Modal Classification0
BACON: Improving Clarity of Image Captions via Bag-of-Concept Graphs0
An Empirical Evaluation of Visual Question Answering for Novel Objects0
Interpretable Counting for Visual Question Answering0
Cross-modal Knowledge Reasoning for Knowledge-based Visual Question Answering0
Show:102550
← PrevPage 31 of 88Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1MMCTAgent (GPT-4 + GPT-4V)GPT-4 score74.24Unverified
2Qwen2-VL-72BGPT-4 score74Unverified
3InternVL2.5-78BGPT-4 score72.3Unverified
4GPT-4o +text rationale +IoTGPT-4 score72.2Unverified
5Lyra-ProGPT-4 score71.4Unverified
6GLM-4V-PlusGPT-4 score71.1Unverified
7Phantom-7BGPT-4 score70.8Unverified
8InternVL2.5-38BGPT-4 score68.8Unverified
9InternVL2-26B (SGP, token ratio 64%)GPT-4 score65.6Unverified
10Baichuan-Omni (7B)GPT-4 score65.4Unverified