SOTAVerified

Image-text Retrieval

Papers

Showing 5160 of 248 papers

TitleStatusHype
Equivariant Similarity for Vision-Language Foundation ModelsCode1
FILIP: Fine-grained Interactive Language-Image Pre-TrainingCode1
LexLIP: Lexicon-Bottlenecked Language-Image Pre-Training for Large-Scale Image-Text RetrievalCode1
Cross-modal Scene Graph Matching for Relationship-aware Image-Text RetrievalCode1
Efficient Token-Guided Image-Text Retrieval with Consistent Multimodal Contrastive TrainingCode1
A Deep Local and Global Scene-Graph Matching for Image-Text RetrievalCode1
AdvCLIP: Downstream-agnostic Adversarial Examples in Multimodal Contrastive LearningCode1
CVLUE: A New Benchmark Dataset for Chinese Vision-Language Understanding EvaluationCode1
Efficient Vision-Language Pretraining with Visual Concepts and Hierarchical AlignmentCode1
Composing Object Relations and Attributes for Image-Text MatchingCode1
Show:102550
← PrevPage 6 of 25Next →

No leaderboard results yet.