SOTAVerified
|
Agents
Browse
Leaderboard
About
Tasks
›
Visual Commonsense Reasoning
Visual Commonsense Reasoning
Image source:
Visual Commonsense Reasoning
Papers
Recently Added
Most Hyped
Most Active
Needs Verification
Most Verified
Showing 21–30 of 65 papers
Title
Date
Tasks
Status
Hype
How Vision-Language Tasks Benefit from Large Pre-trained Models: A Survey
Dec 11, 2024
Image Captioning
Question Answering
—
Unverified
0
Learning to Correction: Explainable Feedback Generation for Visual Commonsense Reasoning Distractor
Dec 8, 2024
Misconceptions
Multiple-choice
Code
Code Available
0
Commonsense-T2I Challenge: Can Text-to-Image Generation Models Understand Commonsense?
Jun 11, 2024
Adversarial Text
Image Generation
—
Unverified
0
ALGO: Object-Grounded Visual Commonsense Reasoning for Open-World Egocentric Action Recognition
Jun 9, 2024
Action Recognition
Object Recognition
—
Unverified
0
Do Vision-Language Transformers Exhibit Visual Commonsense? An Empirical Study of VCR
May 27, 2024
Question Answering
TAG
—
Unverified
0
EventLens: Leveraging Event-Aware Pretraining and Cross-modal Linking Enhances Visual Commonsense Reasoning
Apr 22, 2024
Visual Commonsense Reasoning
—
Unverified
0
ViP-LLaVA: Making Large Multimodal Models Understand Arbitrary Visual Prompts
Dec 1, 2023
Visual Commonsense Reasoning
Visual Prompting
Code
Code Available
0
Improving Vision-and-Language Reasoning via Spatial Relations Modeling
Nov 9, 2023
Position regression
Relation
—
Unverified
0
ViCor: Bridging Visual Understanding and Commonsense Reasoning with Large Language Models
Oct 9, 2023
Image Captioning
Visual Commonsense Reasoning
—
Unverified
0
Discovering Novel Actions from Open World Egocentric Videos with Object-Grounded Visual Commonsense Reasoning
May 26, 2023
Object Recognition
Visual Commonsense Reasoning
—
Unverified
0
Show:
10
25
50
← Prev
Page 3 of 7
Next →
No leaderboard results yet.