SOTAVerified

Visual Commonsense Reasoning

Papers

Showing 2130 of 65 papers

TitleStatusHype
Discovering Novel Actions from Open World Egocentric Videos with Object-Grounded Visual Commonsense Reasoning0
How Vision-Language Tasks Benefit from Large Pre-trained Models: A Survey0
GRILL: Grounded Vision-language Pre-training via Aligning Text and Image Regions0
Learning to Agree on Vision Attention for Visual Commonsense Reasoning0
MERLOT Reserve: Neural Script Knowledge through Vision and Language and Sound0
Breaking Common Sense: WHOOPS! A Vision-and-Language Benchmark of Synthetic and Compositional Images0
Generative Visual Commonsense Answering and Explaining with Generative Scene Graph Constructing0
Commonsense-T2I Challenge: Can Text-to-Image Generation Models Understand Commonsense?0
InterBERT: Vision-and-Language Interaction for Multi-modal Pretraining0
Attention Mechanism based Cognition-level Scene Understanding0
Show:102550
← PrevPage 3 of 7Next →

No leaderboard results yet.