SOTAVerified

Zero-shot Image Retrieval

Papers

Showing 110 of 29 papers

TitleStatusHype
Revisiting CLIP: Efficient Alignment of 3D MRI and Tabular Data using Domain-Specific Foundation ModelsCode0
CLIP-PING: Boosting Lightweight Vision-Language Models with Proximus Intrinsic Neighbors Guidance0
Piecewise-Linear Manifolds for Deep Metric Learning0
M2-Encoder: Advancing Bilingual Image-Text Understanding by Large-scale Efficient PretrainingCode0
InternVL: Scaling up Vision Foundation Models and Aligning for Generic Visual-Linguistic TasksCode1
Context-I2W: Mapping Images to Context-dependent Words for Accurate Zero-Shot Composed Image RetrievalCode1
GrowCLIP: Data-aware Automatic Model Growing for Large-scale Contrastive Language-Image Pre-training0
FACTUAL: A Benchmark for Faithful and Consistent Textual Scene Graph ParsingCode1
Pic2Word: Mapping Pictures to Words for Zero-shot Composed Image RetrievalCode1
AltCLIP: Altering the Language Encoder in CLIP for Extended Language CapabilitiesCode4
Show:102550
← PrevPage 1 of 3Next →

No leaderboard results yet.