SOTAVerified

Zero-Shot Learning

Zero-shot learning (ZSL) is a model's ability to detect classes never seen during training. The condition is that the classes are not known during supervised learning.

Earlier work in zero-shot learning use attributes in a two-step approach to infer unknown classes. In the computer vision context, more recent advances learn mappings from image feature space to semantic space. Other approaches learn non-linear multimodal embeddings. In the modern NLP context, language models can be evaluated on downstream tasks without fine tuning.

Benchmark datasets for zero-shot learning include aPY, AwA, and CUB, among others.

( Image credit: Prototypical Networks for Few shot Learning in PyTorch )

Further readings:

Papers

Showing 251300 of 1864 papers

TitleStatusHype
CLIP2Scene: Towards Label-efficient 3D Scene Understanding by CLIPCode1
Filtering, Distillation, and Hard Negatives for Vision-Language Pre-TrainingCode1
Zero-shot Triplet Extraction by Template InfillingCode1
MultiInstruct: Improving Multi-Modal Zero-Shot Learning via Instruction TuningCode1
On Improving Summarization Factual Consistency from Natural Language FeedbackCode1
BLOOM+1: Adding Language Support to BLOOM for Zero-Shot PromptingCode1
Attentive Mask CLIPCode1
Reproducible scaling laws for contrastive language-image learningCode1
LidarCLIP or: How I Learned to Talk to Point CloudsCode1
Resolving Semantic Confusions for Improved Zero-Shot DetectionCode1
Open Vocabulary Semantic Segmentation with Patch Aligned Contrastive LearningCode1
X-Paste: Revisiting Scalable Copy-Paste for Instance Segmentation using CLIP and StableDiffusionCode1
SuS-X: Training-Free Name-Only Transfer of Vision-Language ModelsCode1
Decomposed Soft Prompt Guided Fusion Enhancing for Compositional Zero-Shot LearningCode1
AdaptKeyBERT: An Attention-Based approach towards Few-Shot & Zero-Shot Domain Adaptation of KeyBERTCode1
Hyperparameter optimization in deep multi-target predictionCode1
Beyond Prompting: Making Pre-trained Language Models Better Zero-shot Learners by Clustering RepresentationsCode1
Being Comes from Not-being: Open-vocabulary Text-to-Motion Generation with Wordless TrainingCode1
ProGen: Progressive Zero-shot Dataset Generation via In-context FeedbackCode1
General Image Descriptors for Open World Image Retrieval using ViT CLIPCode1
Meta-Learning via Classifier(-free) Diffusion GuidanceCode1
Improving Object-centric Learning with Query OptimizationCode1
Visual Classification via Description from Large Language ModelsCode1
LASP: Text-to-Text Optimization for Language-Aware Soft Prompting of Vision & Language ModelsCode1
CALIP: Zero-Shot Enhancement of CLIP with Parameter-free AttentionCode1
Natural Language Inference Prompts for Zero-shot Emotion Classification in Text across CorporaCode1
FETA: Towards Specializing Foundation Models for Expert Task ApplicationsCode1
Txt2Img-MHN: Remote Sensing Image Generation from Text Using Modern Hopfield NetworksCode1
Temporal and cross-modal attention for audio-visual zero-shot learningCode1
Contributions of Shape, Texture, and Color in Visual RecognitionCode1
A Personalized Zero-Shot ECG Arrhythmia Monitoring System: From Sparse Representation Based Domain Adaption to Energy Efficient Abnormal Beat Detection for Practical ECG SurveillanceCode1
Boosting Zero-shot Learning via Contrastive Optimization of Attribute RepresentationsCode1
Open-Vocabulary Multi-Label Classification via Multi-Modal Knowledge TransferCode1
DUET: Cross-modal Semantic Grounding for Contrastive Zero-shot LearningCode1
Siamese Contrastive Embedding Network for Compositional Zero-Shot LearningCode1
ProtoCLIP: Prototypical Contrastive Language Image PretrainingCode1
Zero-Shot Video Question Answering via Frozen Bidirectional Language ModelsCode1
Zero-shot object goal visual navigationCode1
Disentangled Ontology Embedding for Zero-shot LearningCode1
Prompt Injection: Parameterization of Fixed InputsCode1
CyCLIP: Cyclic Contrastive Language-Image PretrainingCode1
Self-Guided Noise-Free Data Generation for Efficient Zero-Shot LearningCode1
Disentangling Visual Embeddings for Attributes and ObjectsCode1
KG-SP: Knowledge Guided Simple Primitives for Open World Compositional Zero-Shot LearningCode1
Learning to Answer Visual Questions from Web VideosCode1
Zero-shot Learning for Grapheme to Phoneme Conversion with Language EnsembleCode1
Learn to Adapt for Generalized Zero-Shot Text ClassificationCode1
Zero-Shot Logit AdjustmentCode1
No Token Left Behind: Explainability-Aided Image Classification and GenerationCode1
Learning to Compose Soft Prompts for Compositional Zero-Shot LearningCode1
Show:102550
← PrevPage 6 of 38Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ZeroDiffaverage top-1 classification accuracy87.5Unverified
2DUETaverage top-1 classification accuracy72.3Unverified
3Composeraverage top-1 classification accuracy69.4Unverified
4HDC-ZSC-MLPaverage top-1 classification accuracy65.6Unverified
5ZSL_TF-VAEGANaverage top-1 classification accuracy64.9Unverified
6ZLaPAccuracy64.3Unverified
7ZLaP*Accuracy64.2Unverified
8HDC-ZSCaverage top-1 classification accuracy63.8Unverified
9SPOTaverage top-1 classification accuracy62.9Unverified
10f-VAEGAN-D2average top-1 classification accuracy61Unverified
#ModelMetricClaimedVerifiedStatus
1dmis-lab/biobert-v1.1Accuracy26.15Unverified
2meta-llama/Meta-Llama-3-8B-InstructAccuracy25.84Unverified
3epfl-llm/meditron-7bAccuracy25.75Unverified
4dmis-lab/meerkat-7b-v1.0Accuracy25.68Unverified
5meta-llama/Meta-Llama-3-8B-InstructAccuracy25.65Unverified
6HuggingFaceH4/zephyr-7b-betaAccuracy25.54Unverified
7dmis-lab/biobert-v1.1Accuracy25.46Unverified
8epfl-llm/meditron-70bAccuracy25.36Unverified
9epfl-llm/meditron-70bAccuracy25.26Unverified
10HuggingFaceH4/zephyr-7b-betaAccuracy25.06Unverified
#ModelMetricClaimedVerifiedStatus
1ZeroDiffaverage top-1 classification accuracy77.3Unverified
2SPOT (VAEGAN)average top-1 classification accuracy66.04Unverified
3ZSL_TF-VAEGANaverage top-1 classification accuracy66Unverified
4f-VAEGANaverage top-1 classification accuracy64.7Unverified
5DUET (Ours)average top-1 classification accuracy64.4Unverified
6LisGANaverage top-1 classification accuracy61.7Unverified
7TCNaverage top-1 classification accuracy61.5Unverified
8f-CLSWGANaverage top-1 classification accuracy60.8Unverified
9Cycle-WGANaverage top-1 classification accuracy59.9Unverified
#ModelMetricClaimedVerifiedStatus
1ZeroDiffaverage top-1 classification accuracy86.4Unverified
2ZSL-KGaverage top-1 classification accuracy78.08Unverified
3ZSL_TF-VAEGANaverage top-1 classification accuracy72.2Unverified
4DUET (Ours)average top-1 classification accuracy69.9Unverified
#ModelMetricClaimedVerifiedStatus
1ZLaPAccuracy84Unverified
2ZLaP*Accuracy83.1Unverified
#ModelMetricClaimedVerifiedStatus
1ZLaP*Accuracy93.6Unverified
2ZLaPAccuracy93.4Unverified
#ModelMetricClaimedVerifiedStatus
1ZLaP*Accuracy74.2Unverified
2ZLaPAccuracy74Unverified
#ModelMetricClaimedVerifiedStatus
1ViT-B/16Average mAP60.17Unverified
2ResNet-50Average mAP56.19Unverified
#ModelMetricClaimedVerifiedStatus
1ZLaPAccuracy51.2Unverified
2ZLaP*Accuracy51Unverified
#ModelMetricClaimedVerifiedStatus
1ZLaPAccuracy29.1Unverified
2ZLaP*Accuracy29Unverified
#ModelMetricClaimedVerifiedStatus
1ZLaPAccuracy75.9Unverified
2ZLaP*Accuracy75.5Unverified
#ModelMetricClaimedVerifiedStatus
1ZLaP*Accuracy87.9Unverified
2ZLaPAccuracy87.8Unverified
#ModelMetricClaimedVerifiedStatus
1ZLaPTop 1 Accuracy72.1Unverified
2ZLaP*Top 1 Accuracy72.1Unverified
#ModelMetricClaimedVerifiedStatus
1HiTeAAccuracy21.7Unverified
2HiTeAAccuracy0.46Unverified
#ModelMetricClaimedVerifiedStatus
1HiTeAAccuracy37.4Unverified
2HiTeAAccuracy0.56Unverified
#ModelMetricClaimedVerifiedStatus
1SPOTaverage top-1 classification accuracy71.9Unverified
2ZSL_TF-VAEGANaverage top-1 classification accuracy70.8Unverified
#ModelMetricClaimedVerifiedStatus
1ZLaPAccuracy90Unverified
2ZLaP*Accuracy89Unverified
#ModelMetricClaimedVerifiedStatus
1ZLaP*Accuracy71.8Unverified
2ZLaPAccuracy71.2Unverified
#ModelMetricClaimedVerifiedStatus
1ZLaP*Accuracy71.4Unverified
2ZLaPAccuracy71Unverified
#ModelMetricClaimedVerifiedStatus
1ZLaP*Accuracy76.3Unverified
2ZLaPAccuracy76.3Unverified
#ModelMetricClaimedVerifiedStatus
1CLIP(ViT-B/16)Average mAP85.77Unverified
2CLIP(ResNet-50)Average mAP84.3Unverified
#ModelMetricClaimedVerifiedStatus
1ZSL-KGTop-160.54Unverified
#ModelMetricClaimedVerifiedStatus
1zsl_ADAAverage Per-Class Accuracy70.9Unverified
#ModelMetricClaimedVerifiedStatus
1ZLaP*Accuracy63.2Unverified
#ModelMetricClaimedVerifiedStatus
1MSDAPearson correlation coefficient (PCC)0.52Unverified
#ModelMetricClaimedVerifiedStatus
1SeViLAAccuracy72.3Unverified
#ModelMetricClaimedVerifiedStatus
1M^2-EncoderAccuracy80.7Unverified
#ModelMetricClaimedVerifiedStatus
1FrozenBiLMAccuracy51.5Unverified
#ModelMetricClaimedVerifiedStatus
1CZSLA-acc36Unverified
#ModelMetricClaimedVerifiedStatus
1ZS3Netk=10 mIOU26.3Unverified
#ModelMetricClaimedVerifiedStatus
1ZSL-KGAccuracy88.98Unverified
#ModelMetricClaimedVerifiedStatus
1VideoChat2Accuracy40.6Unverified