SOTAVerified

Image-text Retrieval

Papers

Showing 8190 of 248 papers

TitleStatusHype
Large-Scale Adversarial Training for Vision-and-Language Representation LearningCode1
Learnable Pillar-based Re-ranking for Image-Text RetrievalCode1
Coarse-to-Fine Vision-Language Pre-training with Fusion in the BackboneCode1
LexLIP: Lexicon-Bottlenecked Language-Image Pre-Training for Large-Scale Image-Text RetrievalCode1
Eye-gaze Guided Multi-modal Alignment for Medical Representation LearningCode1
FETA: Towards Specializing Foundation Models for Expert Task ApplicationsCode1
FlexiViT: One Model for All Patch SizesCode1
A Comparison of Pre-trained Vision-and-Language Models for Multimodal Representation Learning across Medical Images and ReportsCode1
From Association to Generation: Text-only Captioning by Unsupervised Cross-modal MappingCode1
GLoRIA: A Multimodal Global-Local Representation Learning Framework for Label-Efficient Medical Image RecognitionCode1
Show:102550
← PrevPage 9 of 25Next →

No leaderboard results yet.