SOTAVerified

Visual Commonsense Reasoning

Papers

Showing 1120 of 65 papers

TitleStatusHype
A Survey on Interpretable Cross-modal ReasoningCode1
MERLOT: Multimodal Neural Script Knowledge ModelsCode1
Broaden the Vision: Geo-Diverse Visual Commonsense ReasoningCode1
Towards artificial general intelligence via a multimodal foundation modelCode1
Improving Visual Commonsense in Language Models via Multiple Image GenerationCode1
Natural Language Rationales with Full-Stack Visual Reasoning: From Pixels to Semantic Frames to Commonsense GraphsCode1
Large-Scale Adversarial Training for Vision-and-Language Representation LearningCode1
ViLBERT: Pretraining Task-Agnostic Visiolinguistic Representations for Vision-and-Language TasksCode1
Do Vision-Language Transformers Exhibit Visual Commonsense? An Empirical Study of VCR0
Discovering Novel Actions from Open World Egocentric Videos with Object-Grounded Visual Commonsense Reasoning0
Show:102550
← PrevPage 2 of 7Next →

No leaderboard results yet.