SOTAVerified

Zero-Shot Image Classification

Zero-shot image classification is a technique in computer vision where a model can classify images into categories that were not present during training. This is achieved by leveraging semantic information about the categories, such as textual descriptions or relationships between classes.

Papers

Showing 6170 of 111 papers

TitleStatusHype
MedUnifier: Unifying Vision-and-Language Pre-training on Medical Data with Vision Generation Task using Discrete Visual Representations0
A Fistful of Words: Learning Transferable Visual Models from Bag-of-Words Supervision0
Altogether: Image Captioning via Re-aligning Alt-text0
A Progressive Framework of Vision-language Knowledge Distillation and Alignment for Multilingual Scene0
BaFTA: Backprop-Free Test-Time Adaptation For Zero-Shot Vision-Language Models0
Bayesian Test-Time Adaptation for Vision-Language Models0
Beyond the Visible: Multispectral Vision-Language Learning for Earth Observation0
Bridge the Modality and Capability Gaps in Vision-Language Model Selection0
CIBR: Cross-modal Information Bottleneck Regularization for Robust CLIP Generalization0
CLAMP: Contrastive LAnguage Model Prompt-tuning0
Show:102550
← PrevPage 7 of 12Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1OpenClip H/14 (34B)(Laion2B)Top-1 accuracy30.01Unverified
#ModelMetricClaimedVerifiedStatus
1CLIP (ViT B-32)Average Score56.64Unverified
#ModelMetricClaimedVerifiedStatus
1GLIP (Tiny A)Average Score11.4Unverified