Visual Reasoning
Ability to understand actions and reasoning associated with any visual images
Papers
Showing 1–10 of 698 papers
All datasetsWinogroundNLVR2 DevNLVR2 TestCLEVRERBongard-OpenWorldWinoGAViLVSRPHYRE-1B-CrossPHYRE-1B-WithinVASRIRFL: Image Recognition of Figurative LanguageNLVR
Benchmark Results
| # | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| 1 | GPT-4o + CA | Text Score | 75.5 | — | Unverified |
| 2 | GPT-4V (CoT, pick b/w two options) | Text Score | 75.25 | — | Unverified |
| 3 | GPT-4V (pick b/w two options) | Text Score | 69.25 | — | Unverified |
| 4 | MMICL + CoCoT | Text Score | 64.25 | — | Unverified |
| 5 | GPT-4V + CoCoT | Text Score | 58.5 | — | Unverified |
| 6 | OpenFlamingo + CoCoT | Text Score | 58.25 | — | Unverified |
| 7 | GPT-4V | Text Score | 54.5 | — | Unverified |
| 8 | FIBER (EqSim) | Text Score | 51.5 | — | Unverified |
| 9 | FIBER (finetuned, Flickr30k) | Text Score | 51.25 | — | Unverified |
| 10 | MMICL + CCoT | Text Score | 51 | — | Unverified |