SOTAVerified

Image Description

Papers

Showing 111120 of 154 papers

TitleStatusHype
Impressions: Understanding Visual Semiotics and Aesthetic Impact0
Improving Description-based Person Re-identification by Multi-granularity Image-text Alignments0
InfoVisDial: An Informative Visual Dialogue Dataset by Bridging Large Multimodal and Language Models0
LaMOuR: Leveraging Language Models for Out-of-Distribution Recovery in Reinforcement Learning0
Language Augmentation in CLIP for Improved Anatomy Detection on Multi-modal Medical Images0
Generating Image Descriptions via Sequential Cross-Modal Alignment Guided by Human GazeCode0
Beyond Part Models: Person Retrieval with Refined Part Pooling (and a Strong Convolutional Baseline)Code0
How Do Image Description Systems Describe People? A Targeted Assessment of System Competence in the PEOPLE-domainCode0
IDEA: Image Description Enhanced CLIP-AdapterCode0
Describing Videos by Exploiting Temporal StructureCode0
Show:102550
← PrevPage 12 of 16Next →

No leaderboard results yet.