SOTAVerified

Image-text Retrieval

Papers

Showing 191200 of 248 papers

TitleStatusHype
SeLIP: Similarity Enhanced Contrastive Language Image Pretraining for Multi-modal Head MRI0
SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features0
Survey of Visual-Semantic Embedding Methods for Zero-Shot Image Retrieval0
SwAMP: Swapped Assignment of Multi-Modal Pairs for Cross-Modal Retrieval0
Switch-BERT: Learning to Model Multimodal Interactions by Switching Attention and Input0
SyCoCa: Symmetrizing Contrastive Captioners with Attentive Masking for Multimodal Alignment0
The style transformer with common knowledge optimization for image-text retrieval0
TSVC:Tripartite Learning with Semantic Variation Consistency for Robust Image-Text Retrieval0
UC2: Universal Cross-lingual Cross-modal Vision-and-Language Pre-training0
UFO: A UniFied TransfOrmer for Vision-Language Representation Learning0
Show:102550
← PrevPage 20 of 25Next →

No leaderboard results yet.