SOTAVerified|Agents Browse Leaderboard About Blog

Visual Commonsense Reasoning

Image source: Visual Commonsense Reasoning

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1–10 of 65 papers

Title	Date	Tasks	Status	Hype
All in One: Exploring Unified Video-Language Pre-training	Mar 14, 2022	AllLanguage Modelling	CodeCode Available	2
Learn to Explain: Multimodal Reasoning via Thought Chains for Science Question Answering	Sep 20, 2022	Multimodal Deep LearningMultimodal Reasoning	CodeCode Available	2
GPT4RoI: Instruction Tuning Large Language Model on Region-of-Interest	Jul 7, 2023	AttributeCommon Sense Reasoning	CodeCode Available	2
Dragonfly: Multi-Resolution Zoom-In Encoding Enhances Vision-Language Models	Jun 3, 2024	Image CaptioningLanguage Modelling	CodeCode Available	2
A Survey on Interpretable Cross-modal Reasoning	Sep 5, 2023	Cross-Modal RetrievalDecision Making	CodeCode Available	1
Broaden the Vision: Geo-Diverse Visual Commonsense Reasoning	Sep 14, 2021	Cultural Vocal Bursts Intensity PredictionVisual Commonsense Reasoning	CodeCode Available	1
Improving Visual Commonsense in Language Models via Multiple Image Generation	Jun 19, 2024	Common Sense ReasoningImage Generation	CodeCode Available	1
Fusing Pre-Trained Language Models With Multimodal Prompts Through Reinforcement Learning	Jan 1, 2023	Language ModelingLanguage Modelling	CodeCode Available	1
Large-Scale Adversarial Training for Vision-and-Language Representation Learning	Jun 11, 2020	Image-text RetrievalQuestion Answering	CodeCode Available	1
MERLOT: Multimodal Neural Script Knowledge Models	Jun 4, 2021	Multimodal ReasoningVisual Commonsense Reasoning	CodeCode Available	1

Show:10 25 50

← PrevPage 1 of 7Next →

No leaderboard results yet.