Visual Reasoning
Ability to understand actions and reasoning associated with any visual images
Papers
Showing 1–10 of 698 papers
All datasetsWinogroundNLVR2 DevNLVR2 TestCLEVRERBongard-OpenWorldWinoGAViLVSRPHYRE-1B-CrossPHYRE-1B-WithinVASRIRFL: Image Recognition of Figurative LanguageNLVR
Benchmark Results
| # | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| 1 | Gemini-2.0 + CA | 2-Class Accuracy | 93.6 | — | Unverified |
| 2 | GPT-4o + CA | 2-Class Accuracy | 92.8 | — | Unverified |
| 3 | Human | 2-Class Accuracy | 91 | — | Unverified |
| 4 | SNAIL | 2-Class Accuracy | 64 | — | Unverified |
| 5 | InstructBLIP + GPT-4 | 2-Class Accuracy | 63.8 | — | Unverified |
| 6 | BLIP-2 + ChatGPT (Fine-tuned) | 2-Class Accuracy | 63.3 | — | Unverified |
| 7 | InstructBLIP + ChatGPT + Neuro-Symbolic | 2-Class Accuracy | 55.5 | — | Unverified |
| 8 | ChatCaptioner + ChatGPT | 2-Class Accuracy | 49.3 | — | Unverified |
| 9 | Otter | 2-Class Accuracy | 49.3 | — | Unverified |