Visual Reasoning
Ability to understand actions and reasoning associated with any visual images
Papers
Showing 1–10 of 698 papers
All datasetsWinogroundNLVR2 DevNLVR2 TestCLEVRERBongard-OpenWorldWinoGAViLVSRPHYRE-1B-CrossPHYRE-1B-WithinVASRIRFL: Image Recognition of Figurative LanguageNLVR
Benchmark Results
| # | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| 1 | BEiT-3 | Accuracy | 91.51 | — | Unverified |
| 2 | X2-VLM (large) | Accuracy | 88.7 | — | Unverified |
| 3 | XFM (base) | Accuracy | 87.6 | — | Unverified |
| 4 | X2-VLM (base) | Accuracy | 86.2 | — | Unverified |
| 5 | CoCa | Accuracy | 86.1 | — | Unverified |
| 6 | VLMo | Accuracy | 85.64 | — | Unverified |
| 7 | VK-OOD | Accuracy | 84.6 | — | Unverified |
| 8 | SimVLM | Accuracy | 84.53 | — | Unverified |
| 9 | X-VLM (base) | Accuracy | 84.41 | — | Unverified |
| 10 | VK-OOD | Accuracy | 83.9 | — | Unverified |