SOTAVerified

Image-text Retrieval

Papers

Showing 151160 of 248 papers

TitleStatusHype
Improving Adversarial Transferability of Vision-Language Pre-training Models through Collaborative Multimodal Interaction0
Enhancing Conceptual Understanding in Multimodal Contrastive Learning through Hard Negative Samples0
Embracing Language Inclusivity and Diversity in CLIP through Continual Language LearningCode0
Enhancing Image-Text Matching with Adaptive Feature AggregationCode0
SyCoCa: Symmetrizing Contrastive Captioners with Attentive Masking for Multimodal Alignment0
Filter & Align: Leveraging Human Knowledge to Curate Image-Text Data0
LightCLIP: Learning Multi-Level Interaction for Lightweight Vision-Language Models0
IG Captioner: Information Gain Captioners are Strong Zero-shot Classifiers0
A New Fine-grained Alignment Method for Image-text Matching0
MCAD: Multi-teacher Cross-modal Alignment Distillation for efficient image-text retrieval0
Show:102550
← PrevPage 16 of 25Next →

No leaderboard results yet.