SOTAVerified

Visual Commonsense Reasoning

Papers

Showing 1120 of 65 papers

TitleStatusHype
A Survey on Interpretable Cross-modal ReasoningCode1
PEVL: Position-enhanced Pre-training and Prompt Tuning for Vision-language ModelsCode1
Broaden the Vision: Geo-Diverse Visual Commonsense ReasoningCode1
Large-Scale Adversarial Training for Vision-and-Language Representation LearningCode1
MERLOT: Multimodal Neural Script Knowledge ModelsCode1
Unifying Vision-and-Language Tasks via Text GenerationCode1
VL-BERT: Pre-training of Generic Visual-Linguistic RepresentationsCode1
Natural Language Rationales with Full-Stack Visual Reasoning: From Pixels to Semantic Frames to Commonsense GraphsCode1
Learning to Correction: Explainable Feedback Generation for Visual Commonsense Reasoning DistractorCode0
Connective Cognition Network for Directional Visual Commonsense ReasoningCode0
Show:102550
← PrevPage 2 of 7Next →

No leaderboard results yet.