SOTAVerified

Visual Commonsense Reasoning

Papers

Showing 3140 of 65 papers

TitleStatusHype
ILLUME: Rationalizing Vision-Language Models through Human InteractionsCode0
TAB-VCR: Tags and Attributes based Visual Commonsense Reasoning BaselinesCode0
TAB-VCR: Tags and Attributes based VCR BaselinesCode0
Multimodal Adaptive Distillation for Leveraging Unimodal Encoders for Vision-Language Tasks0
A survey on knowledge-enhanced multimodal learning0
Attention Mechanism based Cognition-level Scene Understanding0
Breaking Common Sense: WHOOPS! A Vision-and-Language Benchmark of Synthetic and Compositional Images0
CAVL: Learning Contrastive and Adaptive Representations of Vision and Language0
CLIP-TD: CLIP Targeted Distillation for Vision-Language Tasks0
Commonsense-T2I Challenge: Can Text-to-Image Generation Models Understand Commonsense?0
Show:102550
← PrevPage 4 of 7Next →

No leaderboard results yet.