SOTAVerified

Visual Commonsense Reasoning

Papers

Showing 3140 of 65 papers

TitleStatusHype
GRILL: Grounded Vision-language Pre-training via Aligning Text and Image Regions0
How Vision-Language Tasks Benefit from Large Pre-trained Models: A Survey0
Improving Vision-and-Language Reasoning via Spatial Relations Modeling0
InterBERT: Vision-and-Language Interaction for Multi-modal Pretraining0
KVL-BERT: Knowledge Enhanced Visual-and-Linguistic BERT for Visual Commonsense Reasoning0
Learning to Agree on Vision Attention for Visual Commonsense Reasoning0
MERLOT Reserve: Neural Script Knowledge through Vision and Language and Sound0
ALGO: Object-Grounded Visual Commonsense Reasoning for Open-World Egocentric Action Recognition0
On Advances in Text Generation from Images Beyond Captioning: A Case Study in Self-Rationalization0
Playing Lottery Tickets with Vision and Language0
Show:102550
← PrevPage 4 of 7Next →

No leaderboard results yet.