SOTAVerified

Image Description

Papers

Showing 4150 of 154 papers

TitleStatusHype
Describing Videos by Exploiting Temporal StructureCode0
Bridging Languages through Images with Deep Partial Canonical Correlation AnalysisCode0
Improving Visual-Semantic Embeddings by Learning Semantically-Enhanced Hard Negatives for Cross-modal Information RetrievalCode0
Difficult Task Yes but Simple Task No: Unveiling the Laziness in Multimodal LLMsCode0
Deep Imbalanced Attribute Classification using Visual Attention AggregationCode0
Does Multimodality Help Human and Machine for Translation and Image Captioning?Code0
Bounding and Filling: A Fast and Flexible Framework for Image CaptioningCode0
IDEA: Image Description Enhanced CLIP-AdapterCode0
Efficient Decentralized Visual Place Recognition From Full-Image DescriptorsCode0
Generating Image Descriptions via Sequential Cross-Modal Alignment Guided by Human GazeCode0
Show:102550
← PrevPage 5 of 16Next →

No leaderboard results yet.