SOTAVerified

Image-text Retrieval

Papers

Showing 191200 of 248 papers

TitleStatusHype
Knowledge Transfer Across Modalities with Natural Language Supervision0
Learning Multi-Modal Nonlinear Embeddings: Performance Bounds and an Algorithm0
Learning to embed semantic similarity for joint image-text retrieval0
Multimodal Adversarial Defense for Vision-Language Models by Leveraging One-To-Many Relationships0
LightCLIP: Learning Multi-Level Interaction for Lightweight Vision-Language Models0
LLaVE: Large Language and Vision Embedding Models with Hardness-Weighted Contrastive Learning0
Look, Imagine and Match: Improving Textual-Visual Cross-Modal Retrieval with Generative Models0
LoopITR: Combining Dual and Cross Encoder Architectures for Image-Text Retrieval0
LuoJiaHOG: A Hierarchy Oriented Geo-aware Image Caption Dataset for Remote Sensing Image-Text Retrival0
MAMO: Masked Multimodal Modeling for Fine-Grained Vision-Language Representation Learning0
Show:102550
← PrevPage 20 of 25Next →

No leaderboard results yet.