SOTAVerified

Image-text Retrieval

Papers

Showing 3140 of 248 papers

TitleStatusHype
ALIP: Adaptive Language-Image Pre-training with Synthetic CaptionCode1
Babel-ImageNet: Massively Multilingual Evaluation of Vision-and-Language RepresentationsCode1
Efficient Vision-Language Pretraining with Visual Concepts and Hierarchical AlignmentCode1
ESA: External Space Attention Aggregation for Image-Text RetrievalCode1
FILIP: Fine-grained Interactive Language-Image Pre-TrainingCode1
Align before Fuse: Vision and Language Representation Learning with Momentum DistillationCode1
Eye-gaze Guided Multi-modal Alignment for Medical Representation LearningCode1
Dynamic Modality Interaction Modeling for Image-Text RetrievalCode1
A Survey of Medical Vision-and-Language Applications and Their TechniquesCode1
Efficient Token-Guided Image-Text Retrieval with Consistent Multimodal Contrastive TrainingCode1
Show:102550
← PrevPage 4 of 25Next →

No leaderboard results yet.