SOTAVerified

Zero-shot Image Retrieval

Papers

Showing 110 of 29 papers

TitleStatusHype
Chinese CLIP: Contrastive Vision-Language Pretraining in ChineseCode5
AltCLIP: Altering the Language Encoder in CLIP for Extended Language CapabilitiesCode4
Cross-lingual and Multilingual CLIPCode2
InternVL: Scaling up Vision Foundation Models and Aligning for Generic Visual-Linguistic TasksCode1
Context-I2W: Mapping Images to Context-dependent Words for Accurate Zero-Shot Composed Image RetrievalCode1
FACTUAL: A Benchmark for Faithful and Consistent Textual Scene Graph ParsingCode1
Pic2Word: Mapping Pictures to Words for Zero-shot Composed Image RetrievalCode1
General Image Descriptors for Open World Image Retrieval using ViT CLIPCode1
FETA: Towards Specializing Foundation Models for Expert Task ApplicationsCode1
CCMB: A Large-scale Chinese Cross-modal BenchmarkCode1
Show:102550
← PrevPage 1 of 3Next →

No leaderboard results yet.