Visual Question Answering
MLLM Leaderboard
Papers
Showing 51–60 of 2177 papers
All datasetsMM-VetViP-BenchVQA v2 test-devBenchLMMMMBenchV*benchVQA v2 valMSRVTT-QAVQA v2 test-stdMMHal-BenchMSVD-QAPlotQA-D1
Benchmark Results
| # | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| 1 | MMCTAgent (GPT-4 + GPT-4V) | GPT-4 score | 74.24 | — | Unverified |
| 2 | Qwen2-VL-72B | GPT-4 score | 74 | — | Unverified |
| 3 | InternVL2.5-78B | GPT-4 score | 72.3 | — | Unverified |
| 4 | GPT-4o +text rationale +IoT | GPT-4 score | 72.2 | — | Unverified |
| 5 | Lyra-Pro | GPT-4 score | 71.4 | — | Unverified |
| 6 | GLM-4V-Plus | GPT-4 score | 71.1 | — | Unverified |
| 7 | Phantom-7B | GPT-4 score | 70.8 | — | Unverified |
| 8 | InternVL2.5-38B | GPT-4 score | 68.8 | — | Unverified |
| 9 | InternVL2-26B (SGP, token ratio 64%) | GPT-4 score | 65.6 | — | Unverified |
| 10 | Baichuan-Omni (7B) | GPT-4 score | 65.4 | — | Unverified |