Zero-Shot Learning

Zero-shot learning (ZSL) is a model's ability to detect classes never seen during training. The condition is that the classes are not known during supervised learning.

Earlier work in zero-shot learning use attributes in a two-step approach to infer unknown classes. In the computer vision context, more recent advances learn mappings from image feature space to semantic space. Other approaches learn non-linear multimodal embeddings. In the modern NLP context, language models can be evaluated on downstream tasks without fine tuning.

Benchmark datasets for zero-shot learning include aPY, AwA, and CUB, among others.

( Image credit: Prototypical Networks for Few shot Learning in PyTorch )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 251–300 of 1864 papers

Title	Date	Tasks	Status	Hype
CLIP2Scene: Towards Label-efficient 3D Scene Understanding by CLIP	Jan 12, 2023	3D Semantic SegmentationContrastive Learning	CodeCode Available	1
Filtering, Distillation, and Hard Negatives for Vision-Language Pre-Training	Jan 5, 2023	Contrastive LearningText Spotting	CodeCode Available	1
Zero-shot Triplet Extraction by Template Infilling	Dec 21, 2022	Data AugmentationLanguage Modeling	CodeCode Available	1
MultiInstruct: Improving Multi-Modal Zero-Shot Learning via Instruction Tuning	Dec 21, 2022	SensitivityTransfer Learning	CodeCode Available	1
On Improving Summarization Factual Consistency from Natural Language Feedback	Dec 20, 2022	Text GenerationZero-Shot Learning	CodeCode Available	1
BLOOM+1: Adding Language Support to BLOOM for Zero-Shot Prompting	Dec 19, 2022	Language ModellingZero-Shot Learning	CodeCode Available	1
Attentive Mask CLIP	Dec 16, 2022	Contrastive LearningRetrieval	CodeCode Available	1
Reproducible scaling laws for contrastive language-image learning	Dec 14, 2022	Image ClassificationOpen Vocabulary Attribute Detection	CodeCode Available	1
LidarCLIP or: How I Learned to Talk to Point Clouds	Dec 13, 2022	Image GenerationRetrieval	CodeCode Available	1
Resolving Semantic Confusions for Improved Zero-Shot Detection	Dec 12, 2022	Generalized Zero-Shot Object DetectionObject Detection	CodeCode Available	1
Open Vocabulary Semantic Segmentation with Patch Aligned Contrastive Learning	Dec 9, 2022	Contrastive Learningimage-classification	CodeCode Available	1
X-Paste: Revisiting Scalable Copy-Paste for Instance Segmentation using CLIP and StableDiffusion	Dec 7, 2022	Data AugmentationInstance Segmentation	CodeCode Available	1
SuS-X: Training-Free Name-Only Transfer of Vision-Language Models	Nov 28, 2022	Retrievalzero-shot-classification	CodeCode Available	1
Decomposed Soft Prompt Guided Fusion Enhancing for Compositional Zero-Shot Learning	Nov 19, 2022	Compositional Zero-Shot LearningNovel Concepts	CodeCode Available	1
AdaptKeyBERT: An Attention-Based approach towards Few-Shot & Zero-Shot Domain Adaptation of KeyBERT	Nov 14, 2022	Domain AdaptationFact Verification	CodeCode Available	1
Hyperparameter optimization in deep multi-target prediction	Nov 8, 2022	AutoMLBenchmarking	CodeCode Available	1
Beyond Prompting: Making Pre-trained Language Models Better Zero-shot Learners by Clustering Representations	Oct 29, 2022	ClusteringSentence	CodeCode Available	1
Being Comes from Not-being: Open-vocabulary Text-to-Motion Generation with Wordless Training	Oct 28, 2022	Language ModellingMotion Generation	CodeCode Available	1
ProGen: Progressive Zero-shot Dataset Generation via In-context Feedback	Oct 22, 2022	Data-free Knowledge DistillationDataset Generation	CodeCode Available	1
General Image Descriptors for Open World Image Retrieval using ViT CLIP	Oct 20, 2022	Image RetrievalRetrieval	CodeCode Available	1
Meta-Learning via Classifier(-free) Diffusion Guidance	Oct 17, 2022	Few-Shot LearningImage Generation	CodeCode Available	1
Improving Object-centric Learning with Query Optimization	Oct 17, 2022	Image SegmentationObject	CodeCode Available	1
Visual Classification via Description from Large Language Models	Oct 13, 2022	ClassificationDescriptive	CodeCode Available	1
LASP: Text-to-Text Optimization for Language-Aware Soft Prompting of Vision & Language Models	Oct 3, 2022	Few-Shot LearningLanguage Modelling	CodeCode Available	1
CALIP: Zero-Shot Enhancement of CLIP with Parameter-free Attention	Sep 28, 2022	Training-free 3D Point Cloud ClassificationTransfer Learning	CodeCode Available	1
Natural Language Inference Prompts for Zero-shot Emotion Classification in Text across Corpora	Sep 14, 2022	Emotion ClassificationNatural Language Inference	CodeCode Available	1
FETA: Towards Specializing Foundation Models for Expert Task Applications	Sep 8, 2022	Domain GeneralizationFew-Shot Learning	CodeCode Available	1
Txt2Img-MHN: Remote Sensing Image Generation from Text Using Modern Hopfield Networks	Aug 8, 2022	Image GenerationText to Image Generation	CodeCode Available	1
Temporal and cross-modal attention for audio-visual zero-shot learning	Jul 20, 2022	GZSL Video ClassificationVideo Classification	CodeCode Available	1
Contributions of Shape, Texture, and Color in Visual Recognition	Jul 19, 2022	AttributeGeneral Classification	CodeCode Available	1
A Personalized Zero-Shot ECG Arrhythmia Monitoring System: From Sparse Representation Based Domain Adaption to Energy Efficient Abnormal Beat Detection for Practical ECG Surveillance	Jul 14, 2022	Arrhythmia DetectionDictionary Learning	CodeCode Available	1
Boosting Zero-shot Learning via Contrastive Optimization of Attribute Representations	Jul 8, 2022	AttributeZero-Shot Learning	CodeCode Available	1
Open-Vocabulary Multi-Label Classification via Multi-Modal Knowledge Transfer	Jul 5, 2022	Image-text matchingKnowledge Distillation	CodeCode Available	1
DUET: Cross-modal Semantic Grounding for Contrastive Zero-shot Learning	Jul 4, 2022	AttributeContrastive Learning	CodeCode Available	1
Siamese Contrastive Embedding Network for Compositional Zero-Shot Learning	Jun 29, 2022	Compositional Zero-Shot LearningDiversity	CodeCode Available	1
ProtoCLIP: Prototypical Contrastive Language Image Pretraining	Jun 22, 2022	zero-shot-classificationZero-Shot Learning	CodeCode Available	1
Zero-Shot Video Question Answering via Frozen Bidirectional Language Models	Jun 16, 2022	Fill MaskLanguage Modeling	CodeCode Available	1
Zero-shot object goal visual navigation	Jun 15, 2022	Knowledge GraphsObject	CodeCode Available	1
Disentangled Ontology Embedding for Zero-shot Learning	Jun 8, 2022	image-classificationImage Classification	CodeCode Available	1
Prompt Injection: Parameterization of Fixed Inputs	May 31, 2022	Semantic ParsingZero-Shot Learning	CodeCode Available	1
CyCLIP: Cyclic Contrastive Language-Image Pretraining	May 28, 2022	Representation LearningVisual Reasoning	CodeCode Available	1
Self-Guided Noise-Free Data Generation for Efficient Zero-Shot Learning	May 25, 2022	text-classificationText Classification	CodeCode Available	1
Disentangling Visual Embeddings for Attributes and Objects	May 17, 2022	AttributeCompositional Zero-Shot Learning	CodeCode Available	1
KG-SP: Knowledge Guided Simple Primitives for Open World Compositional Zero-Shot Learning	May 13, 2022	Compositional Zero-Shot LearningMissing Labels	CodeCode Available	1
Learning to Answer Visual Questions from Web Videos	May 10, 2022	Dataset GenerationQuestion Answering	CodeCode Available	1
Zero-shot Learning for Grapheme to Phoneme Conversion with Language Ensemble	May 1, 2022	8kGrapheme-to-Phoneme Conversion	CodeCode Available	1
Learn to Adapt for Generalized Zero-Shot Text Classification	May 1, 2022	ClassificationGeneralized Zero-Shot Learning	CodeCode Available	1
Zero-Shot Logit Adjustment	Apr 25, 2022	Bayesian InferenceGeneralized Zero-Shot Learning	CodeCode Available	1
No Token Left Behind: Explainability-Aided Image Classification and Generation	Apr 11, 2022	image-classificationImage Classification	CodeCode Available	1
Learning to Compose Soft Prompts for Compositional Zero-Shot Learning	Apr 7, 2022	AttributeCompositional Zero-Shot Learning	CodeCode Available	1

Show:10 25 50

← PrevPage 6 of 38Next →

All datasets CUB-200-2011 MedConceptsQA SUN Attribute AwA2 Caltech-101 CIFAR-10 CIFAR-100 COCO-MLT DTD FGVC-Aircraft Flowers-102 Food-101

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ZeroDiff	average top-1 classification accuracy	87.5	—	Unverified
2	DUET	average top-1 classification accuracy	72.3	—	Unverified
3	Composer	average top-1 classification accuracy	69.4	—	Unverified
4	HDC-ZSC-MLP	average top-1 classification accuracy	65.6	—	Unverified
5	ZSL_TF-VAEGAN	average top-1 classification accuracy	64.9	—	Unverified
6	ZLaP	Accuracy	64.3	—	Unverified
7	ZLaP*	Accuracy	64.2	—	Unverified
8	HDC-ZSC	average top-1 classification accuracy	63.8	—	Unverified
9	SPOT	average top-1 classification accuracy	62.9	—	Unverified
10	f-VAEGAN-D2	average top-1 classification accuracy	61	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	dmis-lab/biobert-v1.1	Accuracy	26.15	—	Unverified
2	meta-llama/Meta-Llama-3-8B-Instruct	Accuracy	25.84	—	Unverified
3	epfl-llm/meditron-7b	Accuracy	25.75	—	Unverified
4	dmis-lab/meerkat-7b-v1.0	Accuracy	25.68	—	Unverified
5	meta-llama/Meta-Llama-3-8B-Instruct	Accuracy	25.65	—	Unverified
6	HuggingFaceH4/zephyr-7b-beta	Accuracy	25.54	—	Unverified
7	dmis-lab/biobert-v1.1	Accuracy	25.46	—	Unverified
8	epfl-llm/meditron-70b	Accuracy	25.36	—	Unverified
9	epfl-llm/meditron-70b	Accuracy	25.26	—	Unverified
10	HuggingFaceH4/zephyr-7b-beta	Accuracy	25.06	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ZeroDiff	average top-1 classification accuracy	77.3	—	Unverified
2	SPOT (VAEGAN)	average top-1 classification accuracy	66.04	—	Unverified
3	ZSL_TF-VAEGAN	average top-1 classification accuracy	66	—	Unverified
4	f-VAEGAN	average top-1 classification accuracy	64.7	—	Unverified
5	DUET (Ours)	average top-1 classification accuracy	64.4	—	Unverified
6	LisGAN	average top-1 classification accuracy	61.7	—	Unverified
7	TCN	average top-1 classification accuracy	61.5	—	Unverified
8	f-CLSWGAN	average top-1 classification accuracy	60.8	—	Unverified
9	Cycle-WGAN	average top-1 classification accuracy	59.9	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ZeroDiff	average top-1 classification accuracy	86.4	—	Unverified
2	ZSL-KG	average top-1 classification accuracy	78.08	—	Unverified
3	ZSL_TF-VAEGAN	average top-1 classification accuracy	72.2	—	Unverified
4	DUET (Ours)	average top-1 classification accuracy	69.9	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ZLaP	Accuracy	84	—	Unverified
2	ZLaP*	Accuracy	83.1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ZLaP*	Accuracy	93.6	—	Unverified
2	ZLaP	Accuracy	93.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ZLaP*	Accuracy	74.2	—	Unverified
2	ZLaP	Accuracy	74	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ViT-B/16	Average mAP	60.17	—	Unverified
2	ResNet-50	Average mAP	56.19	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ZLaP	Accuracy	51.2	—	Unverified
2	ZLaP*	Accuracy	51	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ZLaP	Accuracy	29.1	—	Unverified
2	ZLaP*	Accuracy	29	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ZLaP	Accuracy	75.9	—	Unverified
2	ZLaP*	Accuracy	75.5	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ZLaP*	Accuracy	87.9	—	Unverified
2	ZLaP	Accuracy	87.8	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ZLaP	Top 1 Accuracy	72.1	—	Unverified
2	ZLaP*	Top 1 Accuracy	72.1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	HiTeA	Accuracy	21.7	—	Unverified
2	HiTeA	Accuracy	0.46	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	HiTeA	Accuracy	37.4	—	Unverified
2	HiTeA	Accuracy	0.56	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SPOT	average top-1 classification accuracy	71.9	—	Unverified
2	ZSL_TF-VAEGAN	average top-1 classification accuracy	70.8	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ZLaP	Accuracy	90	—	Unverified
2	ZLaP*	Accuracy	89	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ZLaP*	Accuracy	71.8	—	Unverified
2	ZLaP	Accuracy	71.2	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ZLaP*	Accuracy	71.4	—	Unverified
2	ZLaP	Accuracy	71	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ZLaP	Accuracy	76.3	—	Unverified
2	ZLaP*	Accuracy	76.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CLIP(ViT-B/16)	Average mAP	85.77	—	Unverified
2	CLIP(ResNet-50)	Average mAP	84.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ZSL-KG	Top-1	60.54	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	zsl_ADA	Average Per-Class Accuracy	70.9	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ZLaP*	Accuracy	63.2	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	MSDA	Pearson correlation coefficient (PCC)	0.52	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SeViLA	Accuracy	72.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	M^2-Encoder	Accuracy	80.7	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	FrozenBiLM	Accuracy	51.5	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CZSL	A-acc	36	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ZS3Net	k=10 mIOU	26.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ZSL-KG	Accuracy	88.98	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	VideoChat2	Accuracy	40.6	—	Unverified