Multimodal Reasoning
Reasoning over multimodal inputs.
Papers
Showing 1–10 of 302 papers
Benchmark Results
| # | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| 1 | GPT-4V | Accuracy | 24 | — | Unverified |
| 2 | Gemini Pro | Accuracy | 13.2 | — | Unverified |
| 3 | LLaVa-1.5-13B | Accuracy | 1.8 | — | Unverified |
| 4 | LLaVa-1.5-7B | Accuracy | 1.5 | — | Unverified |
| 5 | BLIP2-FLAN-T5-XXL | Accuracy | 0.9 | — | Unverified |
| 6 | QWEN | Accuracy | 0.9 | — | Unverified |
| 7 | CogVLM | Accuracy | 0.9 | — | Unverified |
| 8 | InstructBLIP | Accuracy | 0.6 | — | Unverified |
| # | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| 1 | GPT4V | Accuracy | 22.76 | — | Unverified |
| 2 | Gemini Pro | Accuracy | 17.66 | — | Unverified |
| 3 | Qwen-VL-Max | Accuracy | 15.59 | — | Unverified |
| 4 | InternLM-XComposer2-VL | Accuracy | 14.54 | — | Unverified |
| # | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| 1 | GPT-4 | Acc | 30.3 | — | Unverified |