SOTAVerified

Image Description

Papers

Showing 2130 of 154 papers

TitleStatusHype
WIDIn: Wording Image for Domain-Invariant Representation in Single-Source Domain Generalization0
MAGID: An Automated Pipeline for Generating Synthetic Multi-modal DatasetsCode0
Artwork Explanation in Large-scale Vision Language Models0
A Cognitive Evaluation Benchmark of Image Reasoning and Description for Large Vision-Language Models0
Can Large Multimodal Models Uncover Deep Semantics Behind Images?Code1
Seeing the Unseen: Visual Common Sense for Semantic Placement0
InfoVisDial: An Informative Visual Dialogue Dataset by Bridging Large Multimodal and Language Models0
Localized Symbolic Knowledge Distillation for Visual Commonsense ModelsCode0
Impressions: Understanding Visual Semiotics and Aesthetic Impact0
Large Language Models can Share Images, Too!Code0
Show:102550
← PrevPage 3 of 16Next →

No leaderboard results yet.