SOTAVerified

Zero-shot Image Retrieval

Papers

Showing 1120 of 29 papers

TitleStatusHype
Chinese CLIP: Contrastive Vision-Language Pretraining in ChineseCode5
General Image Descriptors for Open World Image Retrieval using ViT CLIPCode1
ERNIE-ViL 2.0: Multi-view Contrastive Learning for Image-Text Pre-trainingCode0
FETA: Towards Specializing Foundation Models for Expert Task ApplicationsCode1
Curriculum Learning for Data-Efficient Vision-Language Alignment0
Cross-lingual and Multilingual CLIPCode2
CCMB: A Large-scale Chinese Cross-modal BenchmarkCode1
Wukong: A 100 Million Large-scale Chinese Cross-modal Pre-training BenchmarkCode0
Visual Representation Learning with Self-Supervised Attention for Low-Label High-data RegimeCode0
FLAVA: A Foundational Language And Vision Alignment ModelCode1
Show:102550
← PrevPage 2 of 3Next →

No leaderboard results yet.