SOTAVerified

Image-text Retrieval

Papers

Showing 7180 of 248 papers

TitleStatusHype
Self-Training Large Language Models for Improved Visual Program Synthesis With Visual Reinforcement0
M3D: Advancing 3D Medical Image Analysis with Multi-Modal Large Language ModelsCode3
DreamLIP: Language-Image Pre-training with Long CaptionsCode2
Eye-gaze Guided Multi-modal Alignment for Medical Representation LearningCode1
LuoJiaHOG: A Hierarchy Oriented Geo-aware Image Caption Dataset for Remote Sensing Image-Text Retrival0
Improving Adversarial Transferability of Vision-Language Pre-training Models through Collaborative Multimodal Interaction0
Cross-Modal and Uni-Modal Soft-Label Alignment for Image-Text RetrievalCode2
Enhancing Conceptual Understanding in Multimodal Contrastive Learning through Hard Negative Samples0
Embracing Language Inclusivity and Diversity in CLIP through Continual Language LearningCode0
Enhancing Image-Text Matching with Adaptive Feature AggregationCode0
Show:102550
← PrevPage 8 of 25Next →

No leaderboard results yet.