SOTAVerified

Visual Commonsense Reasoning

Papers

Showing 110 of 65 papers

TitleStatusHype
Compositional Image-Text Matching and Retrieval by Grounding EntitiesCode0
Generative Visual Commonsense Answering and Explaining with Generative Scene Graph Constructing0
How Vision-Language Tasks Benefit from Large Pre-trained Models: A Survey0
Learning to Correction: Explainable Feedback Generation for Visual Commonsense Reasoning DistractorCode0
Improving Visual Commonsense in Language Models via Multiple Image GenerationCode1
Commonsense-T2I Challenge: Can Text-to-Image Generation Models Understand Commonsense?0
ALGO: Object-Grounded Visual Commonsense Reasoning for Open-World Egocentric Action Recognition0
Dragonfly: Multi-Resolution Zoom-In Encoding Enhances Vision-Language ModelsCode2
Do Vision-Language Transformers Exhibit Visual Commonsense? An Empirical Study of VCR0
EventLens: Leveraging Event-Aware Pretraining and Cross-modal Linking Enhances Visual Commonsense Reasoning0
Show:102550
← PrevPage 1 of 7Next →

No leaderboard results yet.