SOTAVerified

Zero-Shot Image Classification

Zero-shot image classification is a technique in computer vision where a model can classify images into categories that were not present during training. This is achieved by leveraging semantic information about the categories, such as textual descriptions or relationships between classes.

Papers

Showing 1120 of 111 papers

TitleStatusHype
RemoteCLIP: A Vision Language Foundation Model for Remote SensingCode2
What does a platypus look like? Generating customized prompts for zero-shot image classificationCode2
Scaling Up Visual and Vision-Language Representation Learning With Noisy Text SupervisionCode2
LRSCLIP: A Vision-Language Foundation Model for Aligning Remote Sensing Image with Longer TextCode1
Post-hoc Probabilistic Vision-Language ModelsCode1
TaxaBind: A Unified Embedding Space for Ecological ApplicationsCode1
Interpreting and Analysing CLIP's Zero-Shot Image Classification via Mutual KnowledgeCode1
Mind's Eye: Image Recognition by EEG via Multimodal Similarity-Keeping Contrastive LearningCode1
Sparse Concept Bottleneck Models: Gumbel Tricks in Contrastive LearningCode1
Learn "No" to Say "Yes" Better: Improving Vision-Language Models via NegationsCode1
Show:102550
← PrevPage 2 of 12Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1OpenClip H/14 (34B)(Laion2B)Top-1 accuracy30.01Unverified
#ModelMetricClaimedVerifiedStatus
1CLIP (ViT B-32)Average Score56.64Unverified
#ModelMetricClaimedVerifiedStatus
1GLIP (Tiny A)Average Score11.4Unverified