SOTAVerified

Zero-shot Text-to-Image Retrieval

Papers

Showing 1115 of 15 papers

TitleStatusHype
ERNIE-ViL 2.0: Multi-view Contrastive Learning for Image-Text Pre-trainingCode0
Crossmodal-3600: A Massively Multilingual Multimodal Evaluation Dataset0
FLAVA: A Foundational Language And Vision Alignment ModelCode1
Learning Transferable Visual Models From Natural Language SupervisionCode2
ZSCRGAN: A GAN-based Expectation Maximization Model for Zero-Shot Retrieval of Images from Textual DescriptionsCode0
Show:102550
← PrevPage 2 of 2Next →

No leaderboard results yet.