Zero-Shot Learning

Zero-shot learning (ZSL) is a model's ability to detect classes never seen during training. The condition is that the classes are not known during supervised learning.

Earlier work in zero-shot learning use attributes in a two-step approach to infer unknown classes. In the computer vision context, more recent advances learn mappings from image feature space to semantic space. Other approaches learn non-linear multimodal embeddings. In the modern NLP context, language models can be evaluated on downstream tasks without fine tuning.

Benchmark datasets for zero-shot learning include aPY, AwA, and CUB, among others.

( Image credit: Prototypical Networks for Few shot Learning in PyTorch )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 201–250 of 1864 papers

Title	Date	Tasks	Status	Hype	Score
DC3DO: Diffusion Classifier for 3D Objects	Aug 13, 2024	3D Object ClassificationClassification	CodeCode Available	1	5
Deep Learning Models for Multilingual Hate Speech Detection	Apr 14, 2020	Deep LearningHate Speech Detection	CodeCode Available	1	5
A Closer Look at the Explainability of Contrastive Language-Image Pre-training	Apr 12, 2023	Interactive SegmentationLanguage Modelling	CodeCode Available	1	5
Debiased Learning from Naturally Imbalanced Pseudo-Labels	Jan 5, 2022	counterfactualCounterfactual Reasoning	CodeCode Available	1	5
Decoupling Zero-Shot Semantic Segmentation	Dec 15, 2021	Open Vocabulary Semantic SegmentationSegmentation	CodeCode Available	1	5
AutoQA: From Databases To QA Semantic Parsers With Only Synthetic Training Data	Oct 9, 2020	AttributeNatural Questions	CodeCode Available	1	5
A Brain Graph Foundation Model: Pre-Training and Prompt-Tuning for Any Atlas and Disorder	May 31, 2025	Contrastive LearningMeta-Learning	CodeCode Available	1	5
From Local Details to Global Context: Advancing Vision-Language Models with Attention-Based Selection	May 19, 2025	feature selectionOut-of-Distribution Generalization	CodeCode Available	1	5
Just Shift It: Test-Time Prototype Shifting for Zero-Shot Generalization with Vision-Language Models	Mar 19, 2024	image-classificationImage Classification	CodeCode Available	1	5
Deep Semantic-Visual Alignment for Zero-Shot Remote Sensing Image Scene Classification	Feb 3, 2024	Attributeimage-classification	CodeCode Available	1	5
Differentiable Model Scaling using Differentiable Topk	May 12, 2024	GPUimage-classification	CodeCode Available	1	5
Knowledge-aware Zero-Shot Learning: Survey and Perspective	Feb 26, 2021	BIG-bench Machine LearningSurvey	CodeCode Available	1	5
Differentiable Graph Module (DGM) for Graph Convolutional Networks	Feb 11, 2020	Disease PredictionGraph Neural Network	CodeCode Available	1	5
General Image Descriptors for Open World Image Retrieval using ViT CLIP	Oct 20, 2022	Image RetrievalRetrieval	CodeCode Available	1	5
Discriminative Region-based Multi-Label Zero-Shot Learning	Aug 20, 2021	Image RetrievalMulti-label zero-shot learning	CodeCode Available	1	5
GIMLET: A Unified Graph-Text Model for Instruction-Based Molecule Zero-Shot Learning	May 28, 2023	Property PredictionZero-Shot Learning	CodeCode Available	1	5
Image-free Classifier Injection for Zero-Shot Classification	Aug 21, 2023	ClassificationDecoder	CodeCode Available	1	5
Clinical-Longformer and Clinical-BigBird: Transformers for long clinical sequences	Jan 27, 2022	Clinical KnowledgeDocument Classification	CodeCode Available	1	5
Batch-ICL: Effective, Efficient, and Order-Agnostic In-Context Learning	Jan 12, 2024	In-Context LearningZero-Shot Learning	CodeCode Available	1	5
Discovering Human Interactions With Novel Objects via Zero-Shot Learning	Jun 1, 2020	Human-Object Interaction DetectionObject	CodeCode Available	1	5
Learning Adversarial Semantic Embeddings for Zero-Shot Recognition in Open Worlds	Jul 7, 2023	Open Set LearningZero-Shot Learning	CodeCode Available	1	5
Learning Attention as Disentangler for Compositional Zero-shot Learning	Mar 27, 2023	AttributeCompositional Zero-Shot Learning	CodeCode Available	1	5
Filtering, Distillation, and Hard Negatives for Vision-Language Pre-Training	Jan 5, 2023	Contrastive LearningText Spotting	CodeCode Available	1	5
CLIP2Scene: Towards Label-efficient 3D Scene Understanding by CLIP	Jan 12, 2023	3D Semantic SegmentationContrastive Learning	CodeCode Available	1	5
Fine-Grained Re-Identification	Nov 26, 2020	Person Re-IdentificationZero-Shot Learning	CodeCode Available	1	5
Distilling Large Vision-Language Model with Out-of-Distribution Generalizability	Jul 6, 2023	Few-Shot Image ClassificationImage Classification	CodeCode Available	1	5
Florence: A New Foundation Model for Computer Vision	Nov 22, 2021	Action ClassificationAction Recognition	CodeCode Available	1	5
Domain-aware Visual Bias Eliminating for Generalized Zero-Shot Learning	Mar 30, 2020	Generalized Zero-Shot LearningZero-Shot Learning	CodeCode Available	1	5
CLIPArTT: Adaptation of CLIP to New Domains at Test Time	May 1, 2024	Pseudo LabelTest-time Adaptation	CodeCode Available	1	5
Learning to Compare: Relation Network for Few-Shot Learning	Nov 16, 2017	Few-Shot Image ClassificationFew-Shot Learning	CodeCode Available	1	5
A Neural Network Solves, Explains, and Generates University Math Problems by Program Synthesis and Few-Shot Learning at Human Level	Dec 31, 2021	Few-Shot LearningLanguage Modelling	CodeCode Available	1	5
Dual Feature Augmentation Network for Generalized Zero-shot Learning	Sep 25, 2023	AttributeDiversity	CodeCode Available	1	5
Zero-Shot Learning Through Cross-Modal Transfer	Jan 16, 2013	Outlier DetectionZero-Shot Learning	CodeCode Available	1	5
DUET: Cross-modal Semantic Grounding for Contrastive Zero-shot Learning	Jul 4, 2022	AttributeContrastive Learning	CodeCode Available	1	5
Beyond Prompting: Making Pre-trained Language Models Better Zero-shot Learners by Clustering Representations	Oct 29, 2022	ClusteringSentence	CodeCode Available	1	5
Leveraging Foundation Models for Zero-Shot IoT Sensing	Jul 29, 2024	Data AugmentationGeneralized Zero-Shot Learning	CodeCode Available	1	5
Beyond the Next Token: Towards Prompt-Robust Zero-Shot Classification via Efficient Multi-Token Prediction	Apr 4, 2025	AttributeLanguage Modeling	CodeCode Available	1	5
Efficient Medical Vision-Language Alignment Through Adapting Masked Vision Models	Jun 10, 2025	Contrastive LearningImage-text matching	CodeCode Available	1	5
A Simple Exponential Family Framework for Zero-Shot Learning	Jul 25, 2017	AttributeFew-Shot Learning	CodeCode Available	1	5
AgriCLIP: Adapting CLIP for Agriculture and Livestock via Domain-Specialized Cross-Model Alignment	Oct 2, 2024	Self-Supervised Learningzero-shot-classification	CodeCode Available	1	5
FILP-3D: Enhancing 3D Few-shot Class-incremental Learning with Pre-trained Vision-Language Models	Dec 28, 2023	class-incremental learningClass Incremental Learning	CodeCode Available	1	5
A causal view of compositional zero-shot recognition	Jun 25, 2020	AttributeCompositional Zero-Shot Learning	CodeCode Available	1	5
For Overall Nighttime Visibility: Integrate Irregular Glow Removal With Glow-Aware Enhancement	Sep 23, 2024	Flare RemovalImage Enhancement	CodeCode Available	1	5
A Shared Multi-Attention Framework for Multi-Label Zero-Shot Learning	Jun 1, 2020	Multi-label zero-shot learningZero-Shot Learning	CodeCode Available	1	5
EmoCLIP: A Vision-Language Method for Zero-Shot Video Facial Expression Recognition	Oct 25, 2023	Facial Expression RecognitionFacial Expression Recognition (FER)	CodeCode Available	1	5
FewCLUE: A Chinese Few-shot Learning Evaluation Benchmark	Jul 15, 2021	Few-Shot LearningMachine Reading Comprehension	CodeCode Available	1	5
Causal Curiosity: RL Agents Discovering Self-supervised Experiments for Causal Representation Learning	Oct 7, 2020	Representation LearningZero-Shot Learning	CodeCode Available	1	5
Adversarial Illusions in Multi-Modal Embeddings	Aug 22, 2023	Image GenerationText Generation	CodeCode Available	1	5
CLIBD: Bridging Vision and Genomics for Biodiversity Monitoring at Scale	May 27, 2024	Contrastive LearningZero-Shot Learning	CodeCode Available	1	5
FETA: Towards Specializing Foundation Models for Expert Task Applications	Sep 8, 2022	Domain GeneralizationFew-Shot Learning	CodeCode Available	1	5

Show:10 25 50

← PrevPage 5 of 38Next →

All datasets CUB-200-2011 MedConceptsQA SUN Attribute AwA2 Caltech-101 CIFAR-10 CIFAR-100 COCO-MLT DTD FGVC-Aircraft Flowers-102 Food-101

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ZeroDiff	average top-1 classification accuracy	87.5	—	Unverified
2	DUET	average top-1 classification accuracy	72.3	—	Unverified
3	Composer	average top-1 classification accuracy	69.4	—	Unverified
4	HDC-ZSC-MLP	average top-1 classification accuracy	65.6	—	Unverified
5	ZSL_TF-VAEGAN	average top-1 classification accuracy	64.9	—	Unverified
6	ZLaP	Accuracy	64.3	—	Unverified
7	ZLaP*	Accuracy	64.2	—	Unverified
8	HDC-ZSC	average top-1 classification accuracy	63.8	—	Unverified
9	SPOT	average top-1 classification accuracy	62.9	—	Unverified
10	f-VAEGAN-D2	average top-1 classification accuracy	61	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	dmis-lab/biobert-v1.1	Accuracy	26.15	—	Unverified
2	meta-llama/Meta-Llama-3-8B-Instruct	Accuracy	25.84	—	Unverified
3	epfl-llm/meditron-7b	Accuracy	25.75	—	Unverified
4	dmis-lab/meerkat-7b-v1.0	Accuracy	25.68	—	Unverified
5	meta-llama/Meta-Llama-3-8B-Instruct	Accuracy	25.65	—	Unverified
6	HuggingFaceH4/zephyr-7b-beta	Accuracy	25.54	—	Unverified
7	dmis-lab/biobert-v1.1	Accuracy	25.46	—	Unverified
8	epfl-llm/meditron-70b	Accuracy	25.36	—	Unverified
9	epfl-llm/meditron-70b	Accuracy	25.26	—	Unverified
10	HuggingFaceH4/zephyr-7b-beta	Accuracy	25.06	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ZeroDiff	average top-1 classification accuracy	77.3	—	Unverified
2	SPOT (VAEGAN)	average top-1 classification accuracy	66.04	—	Unverified
3	ZSL_TF-VAEGAN	average top-1 classification accuracy	66	—	Unverified
4	f-VAEGAN	average top-1 classification accuracy	64.7	—	Unverified
5	DUET (Ours)	average top-1 classification accuracy	64.4	—	Unverified
6	LisGAN	average top-1 classification accuracy	61.7	—	Unverified
7	TCN	average top-1 classification accuracy	61.5	—	Unverified
8	f-CLSWGAN	average top-1 classification accuracy	60.8	—	Unverified
9	Cycle-WGAN	average top-1 classification accuracy	59.9	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ZeroDiff	average top-1 classification accuracy	86.4	—	Unverified
2	ZSL-KG	average top-1 classification accuracy	78.08	—	Unverified
3	ZSL_TF-VAEGAN	average top-1 classification accuracy	72.2	—	Unverified
4	DUET (Ours)	average top-1 classification accuracy	69.9	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ZLaP	Accuracy	84	—	Unverified
2	ZLaP*	Accuracy	83.1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ZLaP*	Accuracy	93.6	—	Unverified
2	ZLaP	Accuracy	93.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ZLaP*	Accuracy	74.2	—	Unverified
2	ZLaP	Accuracy	74	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ViT-B/16	Average mAP	60.17	—	Unverified
2	ResNet-50	Average mAP	56.19	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ZLaP	Accuracy	51.2	—	Unverified
2	ZLaP*	Accuracy	51	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ZLaP	Accuracy	29.1	—	Unverified
2	ZLaP*	Accuracy	29	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ZLaP	Accuracy	75.9	—	Unverified
2	ZLaP*	Accuracy	75.5	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ZLaP*	Accuracy	87.9	—	Unverified
2	ZLaP	Accuracy	87.8	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ZLaP	Top 1 Accuracy	72.1	—	Unverified
2	ZLaP*	Top 1 Accuracy	72.1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	HiTeA	Accuracy	21.7	—	Unverified
2	HiTeA	Accuracy	0.46	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	HiTeA	Accuracy	37.4	—	Unverified
2	HiTeA	Accuracy	0.56	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SPOT	average top-1 classification accuracy	71.9	—	Unverified
2	ZSL_TF-VAEGAN	average top-1 classification accuracy	70.8	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ZLaP	Accuracy	90	—	Unverified
2	ZLaP*	Accuracy	89	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ZLaP*	Accuracy	71.8	—	Unverified
2	ZLaP	Accuracy	71.2	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ZLaP*	Accuracy	71.4	—	Unverified
2	ZLaP	Accuracy	71	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ZLaP	Accuracy	76.3	—	Unverified
2	ZLaP*	Accuracy	76.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CLIP(ViT-B/16)	Average mAP	85.77	—	Unverified
2	CLIP(ResNet-50)	Average mAP	84.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ZSL-KG	Top-1	60.54	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	zsl_ADA	Average Per-Class Accuracy	70.9	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ZLaP*	Accuracy	63.2	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	MSDA	Pearson correlation coefficient (PCC)	0.52	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SeViLA	Accuracy	72.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	M^2-Encoder	Accuracy	80.7	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	FrozenBiLM	Accuracy	51.5	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CZSL	A-acc	36	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ZS3Net	k=10 mIOU	26.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ZSL-KG	Accuracy	88.98	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	VideoChat2	Accuracy	40.6	—	Unverified