SOTAVerified

Zero-Shot Image Classification

Zero-shot image classification is a technique in computer vision where a model can classify images into categories that were not present during training. This is achieved by leveraging semantic information about the categories, such as textual descriptions or relationships between classes.

Papers

Showing 91100 of 111 papers

TitleStatusHype
LightCLIP: Learning Multi-Level Interaction for Lightweight Vision-Language Models0
LoGra-Med: Long Context Multi-Graph Alignment for Medical Vision-Language Model0
MADS: Multi-Attribute Document Supervision for Zero-Shot Image Classification0
MoDE: CLIP Data Experts via Clustering0
Multi-method Integration with Confidence-based Weighting for Zero-shot Image Classification0
Noise-Tolerant Few-Shot Unsupervised Adapter for Vision-Language Models0
PaLI: A Jointly-Scaled Multilingual Language-Image Model0
PyramidCLIP: Hierarchical Feature Alignment for Vision-language Model Pretraining0
RA-CLIP: Retrieval Augmented Contrastive Language-Image Pre-Training0
Retaining Knowledge and Enhancing Long-Text Representations in CLIP through Dual-Teacher Distillation0
Show:102550
← PrevPage 10 of 12Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1OpenClip H/14 (34B)(Laion2B)Top-1 accuracy30.01Unverified
#ModelMetricClaimedVerifiedStatus
1CLIP (ViT B-32)Average Score56.64Unverified
#ModelMetricClaimedVerifiedStatus
1GLIP (Tiny A)Average Score11.4Unverified