Visual Reasoning
Ability to understand actions and reasoning associated with any visual images
Papers
Showing 1–10 of 698 papers
All datasetsWinogroundNLVR2 DevNLVR2 TestCLEVRERBongard-OpenWorldWinoGAViLVSRPHYRE-1B-CrossPHYRE-1B-WithinVASRIRFL: Image Recognition of Figurative LanguageNLVR
Benchmark Results
| # | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| 1 | Humans | Jaccard Index | 90 | — | Unverified |
| 2 | ViLT (Zero-Shot) | Jaccard Index | 52 | — | Unverified |
| 3 | X-VLM (Zero-Shot) | Jaccard Index | 46 | — | Unverified |
| 4 | CLIP-ViT-B/32 (Zero-Shot) | Jaccard Index | 41 | — | Unverified |
| 5 | CLIP-ViT-L/14 (Zero-Shot) | Jaccard Index | 40 | — | Unverified |
| 6 | CLIP-RN50x64/14 (Zero-Shot) | Jaccard Index | 38 | — | Unverified |
| 7 | CLIP-RN50 (Zero-Shot) | Jaccard Index | 35 | — | Unverified |
| 8 | CLIP-ViL (Zero-Shot) | Jaccard Index | 15 | — | Unverified |