SOTAVerified

Image-text Retrieval

Papers

Showing 5160 of 248 papers

TitleStatusHype
Babel-ImageNet: Massively Multilingual Evaluation of Vision-and-Language RepresentationsCode1
Enhancing Vision-Language Pre-Training with Jointly Learned Questioner and Dense CaptionerCode1
ALIP: Adaptive Language-Image Pre-training with Synthetic CaptionCode1
Cross-modal Scene Graph Matching for Relationship-aware Image-Text RetrievalCode1
AdvCLIP: Downstream-agnostic Adversarial Examples in Multimodal Contrastive LearningCode1
A Deep Local and Global Scene-Graph Matching for Image-Text RetrievalCode1
Equivariant Similarity for Vision-Language Foundation ModelsCode1
CVLUE: A New Benchmark Dataset for Chinese Vision-Language Understanding EvaluationCode1
Composing Object Relations and Attributes for Image-Text MatchingCode1
Benchmarking Robustness of Multimodal Image-Text Models under Distribution ShiftCode1
Show:102550
← PrevPage 6 of 25Next →

No leaderboard results yet.