SOTAVerified

Image-text Retrieval

Papers

Showing 101110 of 248 papers

TitleStatusHype
Breaking Language Barriers or Reinforcing Bias? A Study of Gender and Racial Disparities in Multilingual Contrastive Vision Language Models0
Deep Semantic Multimodal Hashing Network for Scalable Image-Text and Video-Text Retrievals0
AnyAttack: Towards Large-scale Self-supervised Adversarial Attacks on Vision-language Models0
Toward Automatic Relevance Judgment using Vision--Language Models for Image--Text Retrieval Evaluation0
DCFormer: Efficient 3D Vision-Language Modeling with Decomposed Convolutions0
CtrlSynth: Controllable Image Text Synthesis for Data-Efficient Multimodal Learning0
Beat: Bi-directional One-to-Many Embedding Alignment for Text-based Person Retrieval0
Knowledge Transfer Across Modalities with Natural Language Supervision0
How to Make Cross Encoder a Good Teacher for Efficient Image-Text Retrieval?0
HiVLP: Hierarchical Vision-Language Pre-Training for Fast Image-Text Retrieval0
Show:102550
← PrevPage 11 of 25Next →

No leaderboard results yet.