Visual Reasoning
Ability to understand actions and reasoning associated with any visual images
Papers
Showing 1–10 of 698 papers
All datasetsWinogroundNLVR2 DevNLVR2 TestCLEVRERBongard-OpenWorldWinoGAViLVSRPHYRE-1B-CrossPHYRE-1B-WithinVASRIRFL: Image Recognition of Figurative LanguageNLVR
Benchmark Results
| # | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| 1 | LXMERT | accuracy | 70.1 | — | Unverified |
| 2 | ViLT | accuracy | 69.3 | — | Unverified |
| 3 | CLIP (finetuned) | accuracy | 65.1 | — | Unverified |
| 4 | CLIP (frozen) | accuracy | 56 | — | Unverified |
| 5 | VisualBERT | accuracy | 55.2 | — | Unverified |