Zero-Shot Learning

Zero-shot learning (ZSL) is a model's ability to detect classes never seen during training. The condition is that the classes are not known during supervised learning.

Earlier work in zero-shot learning use attributes in a two-step approach to infer unknown classes. In the computer vision context, more recent advances learn mappings from image feature space to semantic space. Other approaches learn non-linear multimodal embeddings. In the modern NLP context, language models can be evaluated on downstream tasks without fine tuning.

Benchmark datasets for zero-shot learning include aPY, AwA, and CUB, among others.

( Image credit: Prototypical Networks for Few shot Learning in PyTorch )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 351–400 of 1864 papers

Title	Date	Tasks	Status	Hype
Compound Expression Recognition via Multi Model Ensemble for the ABAW7 Challenge	Jul 17, 2024	Ensemble LearningZero-Shot Learning	—Unverified	0
InvAgent: A Large Language Model based Multi-Agent System for Inventory Management in Supply Chains	Jul 16, 2024	Decision MakingLanguage Modeling	CodeCode Available	1
Codebook LLMs: Evaluating LLMs as Measurement Tools for Political Science Concepts	Jul 15, 2024	Zero-Shot Learning	—Unverified	0
Anticipating Future Object Compositions without Forgetting	Jul 15, 2024	AttributeCompositional Zero-Shot Learning	—Unverified	0
Dense Multimodal Alignment for Open-Vocabulary 3D Scene Understanding	Jul 13, 2024	Scene UnderstandingZero-Shot Learning	—Unverified	0
PFPs: Prompt-guided Flexible Pathological Segmentation for Diverse Potential Outcomes Using Large Vision and Language Models	Jul 13, 2024	Language ModelingLanguage Modelling	—Unverified	0
STD-PLM: Understanding Both Spatial and Temporal Properties of Spatial-Temporal Data with PLM	Jul 12, 2024	Few-Shot LearningImputation	CodeCode Available	1
Spiking Tucker Fusion Transformer for Audio-Visual Zero-Shot Learning	Jul 11, 2024	Temporal SequencesZero-Shot Learning	—Unverified	0
CosmoCLIP: Generalizing Large Vision-Language Models for Astronomical Imaging	Jul 10, 2024	Contrastive LearningImage-text Retrieval	—Unverified	0
DuInNet: Dual-Modality Feature Interaction for Point Cloud Completion	Jul 10, 2024	DenoisingPoint Cloud Completion	—Unverified	0
Malicious Path Manipulations via Exploitation of Representation Vulnerabilities of Vision-Language Navigation Systems	Jul 10, 2024	Language ModelingLanguage Modelling	—Unverified	0
Towards a text-based quantitative and explainable histopathology image analysis	Jul 10, 2024	image-classificationImage Classification	CodeCode Available	0
Pseudo-triplet Guided Few-shot Composed Image Retrieval	Jul 8, 2024	Active LearningImage Retrieval	—Unverified	0
FairMedFM: Fairness Benchmarking for Medical Imaging Foundation Models	Jul 1, 2024	BenchmarkingFairness	CodeCode Available	2
Semantic Compositions Enhance Vision-Language Contrastive Learning	Jul 1, 2024	ClassificationContrastive Learning	—Unverified	0
Mitigate the Gap: Investigating Approaches for Improving Cross-Modal Alignment in CLIP	Jun 25, 2024	cross-modal alignmentImage Classification	CodeCode Available	2
BioTrove: A Large Curated Image Dataset Enabling AI for Biodiversity	Jun 25, 2024	Zero-Shot Learning	CodeCode Available	1
At First Sight: Zero-Shot Classification of Astronomical Images with Large Multimodal Models	Jun 24, 2024	AstronomyClassification	—Unverified	0
Evaluation of Language Models in the Medical Context Under Resource-Constrained Settings	Jun 24, 2024	Conditional Text GenerationLanguage Modelling	CodeCode Available	0
Review of Zero-Shot and Few-Shot AI Algorithms in The Medical Domain	Jun 23, 2024	Few-Shot Learningobject-detection	—Unverified	0
Serial Position Effects of Large Language Models	Jun 23, 2024	PositionZero-Shot Learning	—Unverified	0
A Simple Framework for Open-Vocabulary Zero-Shot Segmentation	Jun 23, 2024	Representation Learningzero-shot-classification	—Unverified	0
Contextual Interaction via Primitive-based Adversarial Training For Compositional Zero-shot Learning	Jun 21, 2024	AttributeCompositional Zero-Shot Learning	CodeCode Available	0
CLIP-Decoder : ZeroShot Multilabel Classification using Multimodal CLIP Aligned Representation	Jun 21, 2024	ClassificationDecoder	CodeCode Available	0
Factual Dialogue Summarization via Learning from Large Language Models	Jun 20, 2024	Contrastive LearningData Augmentation	—Unverified	0
A Data-Driven Guided Decoding Mechanism for Diagnostic Captioning	Jun 20, 2024	DiagnosticImage to text	CodeCode Available	0
Using Multimodal Large Language Models for Automated Detection of Traffic Safety Critical Events	Jun 19, 2024	Few-Shot LearningZero-Shot Learning	—Unverified	0
Part-aware Unified Representation of Language and Skeleton for Zero-shot Action Recognition	Jun 19, 2024	Action RecognitionSkeleton Based Action Recognition	CodeCode Available	1
FuseGen: PLM Fusion for Data-generation based Zero-shot Learning	Jun 18, 2024	Zero-Shot Learning	CodeCode Available	0
MAC: A Benchmark for Multiple Attributes Compositional Zero-Shot Learning	Jun 18, 2024	AttributeCompositional Zero-Shot Learning	—Unverified	0
BIOSCAN-5M: A Multimodal Dataset for Insect Biodiversity	Jun 18, 2024	Contrastive LearningLanguage Modelling	CodeCode Available	1
BAMBINO-LM: (Bilingual-)Human-Inspired Continual Pretraining of BabyLM	Jun 17, 2024	Continual Pretrainingzero-shot-classification	—Unverified	0
Fairer Preferences Elicit Improved Human-Aligned Large Language Model Judgments	Jun 17, 2024	FairnessLanguage Modeling	CodeCode Available	1
Exploring the Spectrum of Visio-Linguistic Compositionality and Recognition	Jun 13, 2024	Retrievalzero-shot-classification	CodeCode Available	1
Zero-Shot Learning Over Large Output Spaces : Utilizing Indirect Knowledge Extraction from Large Language Models	Jun 13, 2024	Language ModellingLarge Language Model	—Unverified	0
RWKV-CLIP: A Robust Vision-Language Representation Learner	Jun 11, 2024	Image-text RetrievalRepresentation Learning	CodeCode Available	2
Understanding Visual Concepts Across Models	Jun 11, 2024	Image Generationobject-detection	CodeCode Available	0
BAMO at SemEval-2024 Task 9: BRAINTEASER: A Novel Task Defying Common Sense	Jun 7, 2024	Common Sense ReasoningSentence	CodeCode Available	0
CPLIP: Zero-Shot Learning for Histopathology with Comprehensive Vision-Language Alignment	Jun 7, 2024	Contrastive LearningZero-Shot Learning	CodeCode Available	1
CountCLIP -- [Re] Teaching CLIP to Count to Ten	Jun 5, 2024	zero-shot-classificationZero-Shot Counting	CodeCode Available	1
Attend and Enrich: Enhanced Visual Prompt for Zero-Shot Learning	Jun 5, 2024	AttributeDomain Generalization	—Unverified	0
Exploring Data Efficiency in Zero-Shot Learning with Diffusion Models	Jun 5, 2024	Generalized Zero-Shot LearningZero-Shot Learning	—Unverified	0
Description Boosting for Zero-Shot Entity and Relation Classification	Jun 4, 2024	RelationRelation Classification	CodeCode Available	3
SLANT: Spurious Logo ANalysis Toolkit	Jun 3, 2024	zero-shot-classificationZero-Shot Learning	—Unverified	0
Multi-Modal Generative Embedding Model	May 29, 2024	Caption GenerationCross-Modal Retrieval	—Unverified	0
It's Not a Modality Gap: Characterizing and Addressing the Contrastive Gap	May 28, 2024	image-classificationImage Classification	—Unverified	0
MM-Mixing: Multi-Modal Mixing Alignment for 3D Understanding	May 28, 2024	3D Classification3D Object Recognition	—Unverified	0
CLIBD: Bridging Vision and Genomics for Biodiversity Monitoring at Scale	May 27, 2024	Contrastive LearningZero-Shot Learning	CodeCode Available	1
Listenable Maps for Zero-Shot Audio Classifiers	May 27, 2024	Decoderzero-shot-classification	—Unverified	0
TEII: Think, Explain, Interact and Iterate with Large Language Models to Solve Cross-lingual Emotion Detection	May 27, 2024	Few-Shot LearningLanguage Modeling	CodeCode Available	0

Show:10 25 50

← PrevPage 8 of 38Next →

All datasets CUB-200-2011 MedConceptsQA SUN Attribute AwA2 Caltech-101 CIFAR-10 CIFAR-100 COCO-MLT DTD FGVC-Aircraft Flowers-102 Food-101

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ZeroDiff	average top-1 classification accuracy	87.5	—	Unverified
2	DUET	average top-1 classification accuracy	72.3	—	Unverified
3	Composer	average top-1 classification accuracy	69.4	—	Unverified
4	HDC-ZSC-MLP	average top-1 classification accuracy	65.6	—	Unverified
5	ZSL_TF-VAEGAN	average top-1 classification accuracy	64.9	—	Unverified
6	ZLaP	Accuracy	64.3	—	Unverified
7	ZLaP*	Accuracy	64.2	—	Unverified
8	HDC-ZSC	average top-1 classification accuracy	63.8	—	Unverified
9	SPOT	average top-1 classification accuracy	62.9	—	Unverified
10	f-VAEGAN-D2	average top-1 classification accuracy	61	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	dmis-lab/biobert-v1.1	Accuracy	26.15	—	Unverified
2	meta-llama/Meta-Llama-3-8B-Instruct	Accuracy	25.84	—	Unverified
3	epfl-llm/meditron-7b	Accuracy	25.75	—	Unverified
4	dmis-lab/meerkat-7b-v1.0	Accuracy	25.68	—	Unverified
5	meta-llama/Meta-Llama-3-8B-Instruct	Accuracy	25.65	—	Unverified
6	HuggingFaceH4/zephyr-7b-beta	Accuracy	25.54	—	Unverified
7	dmis-lab/biobert-v1.1	Accuracy	25.46	—	Unverified
8	epfl-llm/meditron-70b	Accuracy	25.36	—	Unverified
9	epfl-llm/meditron-70b	Accuracy	25.26	—	Unverified
10	HuggingFaceH4/zephyr-7b-beta	Accuracy	25.06	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ZeroDiff	average top-1 classification accuracy	77.3	—	Unverified
2	SPOT (VAEGAN)	average top-1 classification accuracy	66.04	—	Unverified
3	ZSL_TF-VAEGAN	average top-1 classification accuracy	66	—	Unverified
4	f-VAEGAN	average top-1 classification accuracy	64.7	—	Unverified
5	DUET (Ours)	average top-1 classification accuracy	64.4	—	Unverified
6	LisGAN	average top-1 classification accuracy	61.7	—	Unverified
7	TCN	average top-1 classification accuracy	61.5	—	Unverified
8	f-CLSWGAN	average top-1 classification accuracy	60.8	—	Unverified
9	Cycle-WGAN	average top-1 classification accuracy	59.9	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ZeroDiff	average top-1 classification accuracy	86.4	—	Unverified
2	ZSL-KG	average top-1 classification accuracy	78.08	—	Unverified
3	ZSL_TF-VAEGAN	average top-1 classification accuracy	72.2	—	Unverified
4	DUET (Ours)	average top-1 classification accuracy	69.9	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ZLaP	Accuracy	84	—	Unverified
2	ZLaP*	Accuracy	83.1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ZLaP*	Accuracy	93.6	—	Unverified
2	ZLaP	Accuracy	93.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ZLaP*	Accuracy	74.2	—	Unverified
2	ZLaP	Accuracy	74	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ViT-B/16	Average mAP	60.17	—	Unverified
2	ResNet-50	Average mAP	56.19	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ZLaP	Accuracy	51.2	—	Unverified
2	ZLaP*	Accuracy	51	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ZLaP	Accuracy	29.1	—	Unverified
2	ZLaP*	Accuracy	29	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ZLaP	Accuracy	75.9	—	Unverified
2	ZLaP*	Accuracy	75.5	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ZLaP*	Accuracy	87.9	—	Unverified
2	ZLaP	Accuracy	87.8	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ZLaP	Top 1 Accuracy	72.1	—	Unverified
2	ZLaP*	Top 1 Accuracy	72.1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	HiTeA	Accuracy	21.7	—	Unverified
2	HiTeA	Accuracy	0.46	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	HiTeA	Accuracy	37.4	—	Unverified
2	HiTeA	Accuracy	0.56	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SPOT	average top-1 classification accuracy	71.9	—	Unverified
2	ZSL_TF-VAEGAN	average top-1 classification accuracy	70.8	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ZLaP	Accuracy	90	—	Unverified
2	ZLaP*	Accuracy	89	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ZLaP*	Accuracy	71.8	—	Unverified
2	ZLaP	Accuracy	71.2	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ZLaP*	Accuracy	71.4	—	Unverified
2	ZLaP	Accuracy	71	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ZLaP*	Accuracy	76.3	—	Unverified
2	ZLaP	Accuracy	76.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CLIP(ViT-B/16)	Average mAP	85.77	—	Unverified
2	CLIP(ResNet-50)	Average mAP	84.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ZSL-KG	Top-1	60.54	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	zsl_ADA	Average Per-Class Accuracy	70.9	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ZLaP*	Accuracy	63.2	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	MSDA	Pearson correlation coefficient (PCC)	0.52	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SeViLA	Accuracy	72.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	M^2-Encoder	Accuracy	80.7	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	FrozenBiLM	Accuracy	51.5	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CZSL	A-acc	36	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ZS3Net	k=10 mIOU	26.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ZSL-KG	Accuracy	88.98	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	VideoChat2	Accuracy	40.6	—	Unverified