SOTAVerified

Image-text Retrieval

Papers

Showing 161170 of 248 papers

TitleStatusHype
Learning to embed semantic similarity for joint image-text retrieval0
Multimodal Adversarial Defense for Vision-Language Models by Leveraging One-To-Many Relationships0
LightCLIP: Learning Multi-Level Interaction for Lightweight Vision-Language Models0
LLaVE: Large Language and Vision Embedding Models with Hardness-Weighted Contrastive Learning0
Look, Imagine and Match: Improving Textual-Visual Cross-Modal Retrieval with Generative Models0
LoopITR: Combining Dual and Cross Encoder Architectures for Image-Text Retrieval0
LuoJiaHOG: A Hierarchy Oriented Geo-aware Image Caption Dataset for Remote Sensing Image-Text Retrival0
MAMO: Masked Multimodal Modeling for Fine-Grained Vision-Language Representation Learning0
Masked Contrastive Pre-Training for Efficient Video-Text Retrieval0
MASS: Overcoming Language Bias in Image-Text Matching0
Show:102550
← PrevPage 17 of 25Next →

No leaderboard results yet.