SOTAVerified

Image-text Retrieval

Papers

Showing 4150 of 248 papers

TitleStatusHype
Eye-gaze Guided Multi-modal Alignment for Medical Representation LearningCode1
ESA: External Space Attention Aggregation for Image-Text RetrievalCode1
FETA: Towards Specializing Foundation Models for Expert Task ApplicationsCode1
Contrasting Intra-Modal and Ranking Cross-Modal Hard Negatives to Enhance Visio-Linguistic Compositional UnderstandingCode1
Enhancing Vision-Language Pre-Training with Jointly Learned Questioner and Dense CaptionerCode1
CoSMo: Content-Style Modulation for Image Retrieval With Text FeedbackCode1
A Survey of Medical Vision-and-Language Applications and Their TechniquesCode1
Equivariant Similarity for Vision-Language Foundation ModelsCode1
CrossGET: Cross-Guided Ensemble of Tokens for Accelerating Vision-Language TransformersCode1
From Association to Generation: Text-only Captioning by Unsupervised Cross-modal MappingCode1
Show:102550
← PrevPage 5 of 25Next →

No leaderboard results yet.