SOTAVerified

Image-text Retrieval

Papers

Showing 3140 of 248 papers

TitleStatusHype
Babel-ImageNet: Massively Multilingual Evaluation of Vision-and-Language RepresentationsCode1
FETA: Towards Specializing Foundation Models for Expert Task ApplicationsCode1
ESA: External Space Attention Aggregation for Image-Text RetrievalCode1
Eye-gaze Guided Multi-modal Alignment for Medical Representation LearningCode1
Align before Fuse: Vision and Language Representation Learning with Momentum DistillationCode1
Efficient Vision-Language Pretraining with Visual Concepts and Hierarchical AlignmentCode1
Dynamic Modality Interaction Modeling for Image-Text RetrievalCode1
A Survey of Medical Vision-and-Language Applications and Their TechniquesCode1
Efficient Token-Guided Image-Text Retrieval with Consistent Multimodal Contrastive TrainingCode1
Enhancing Vision-Language Pre-Training with Jointly Learned Questioner and Dense CaptionerCode1
Show:102550
← PrevPage 4 of 25Next →

No leaderboard results yet.