SOTAVerified

Zero-Shot Learning

Zero-shot learning (ZSL) is a model's ability to detect classes never seen during training. The condition is that the classes are not known during supervised learning.

Earlier work in zero-shot learning use attributes in a two-step approach to infer unknown classes. In the computer vision context, more recent advances learn mappings from image feature space to semantic space. Other approaches learn non-linear multimodal embeddings. In the modern NLP context, language models can be evaluated on downstream tasks without fine tuning.

Benchmark datasets for zero-shot learning include aPY, AwA, and CUB, among others.

( Image credit: Prototypical Networks for Few shot Learning in PyTorch )

Further readings:

Papers

Showing 551600 of 1864 papers

TitleStatusHype
FLAVARS: A Multimodal Foundational Language and Vision Alignment Model for Remote Sensing0
Duplex: Dual Prototype Learning for Compositional Zero-Shot Learning0
Generate, Transduct, Adapt: Iterative Transduction with VLMs0
Explore Activation Sparsity in Recurrent LLMs for Energy-Efficient Neuromorphic ComputingCode0
Hidden Entity Detection from GitHub Leveraging Large Language ModelsCode0
Towards a scalable AI-driven framework for data-independent Cyber Threat Intelligence Information Extraction0
A Statistical Theory of Contrastive Pre-training and Multimodal Generative AICode0
Bridged Semantic Alignment for Zero-shot 3D Medical Image Diagnosis0
Advanced Machine Learning Techniques for Social Support Detection on Social Media0
Gaussian Masked Autoencoders0
LLMs & Legal Aid: Understanding Legal Needs Exhibited Through User Queries0
Generalized Zero-Shot Classification via Semantics-Free Inter-Class Feature Generation0
Beyond Image Classification: A Video Benchmark and Dual-Branch Hybrid Discrimination Framework for Compositional Zero-Shot Learning0
LOGICZSL: Exploring Logic-induced Representation for Compositional Zero-shot Learning0
Cross-Modal 3D Representation with Multi-View Images and Point Clouds0
Open-World Objectness Modeling Unifies Novel Object Detection0
LLM-MedQA: Enhancing Medical Question Answering through Case Studies in Large Language Models0
TimeRAF: Retrieval-Augmented Foundation model for Zero-shot Time Series Forecasting0
Improved Feature Generating Framework for Transductive Zero-shot Learning0
Discriminative Image Generation with Diffusion Models for Zero-Shot Learning0
Multiple Consistency-guided Test-Time Adaptation for Contrastive Audio-Language Models with Unlabeled Audio0
Towards Graph Foundation Models: Learning Generalities Across Graphs via Task-TreesCode0
DINOv2 Meets Text: A Unified Framework for Image- and Pixel-Level Vision-Language AlignmentCode0
Counterexample Guided Program Repair Using Zero-Shot Learning and MaxSAT-based Fault Localization0
Adaptive Pruning for Large Language Models with Structural Importance Awareness0
Real Classification by Description: Extending CLIP's Limits of Part Attributes RecognitionCode0
Zero-Shot Image Moderation in Google Ads with LLM-Assisted Textual Descriptions and Cross-modal Co-embeddings0
CRoF: CLIP-based Robust Few-shot Learning on Noisy Labels0
A Simple and Efficient Baseline for Zero-Shot Generative Classification0
Enabling Low-Resource Language Retrieval: Establishing Baselines for Urdu MS MARCOCode0
Discrepancy-Aware Attention Network for Enhanced Audio-Visual Zero-Shot Learning0
Advancing Comprehensive Aesthetic Insight with Multi-Scale Text-Guided Self-Supervised Learning0
An Efficient Framework for Enhancing Discriminative Models via Diffusion TechniquesCode0
Zero-Shot Mono-to-Binaural Speech Synthesis0
Assessing Personalized AI Mentoring with Large Language Models in the Computing Field0
Can Graph Neural Networks Learn Language with Extremely Weak Text Supervision?Code0
Explaining and Mitigating the Modality Gap in Contrastive Multimodal Learning0
Retaining and Enhancing Pre-trained Knowledge in Vision-Language Models with Prompt Ensembling0
Compositional Zero-Shot Learning with Contextualized Cues and Adaptive Contrastive Training0
Privacy-Preserving Customer Support: A Framework for Secure and Scalable Interactions0
DenseVLM: A Retrieval and Decoupled Alignment Framework for Open-Vocabulary Dense Prediction0
JAPAGEN: Efficient Few/Zero-shot Learning via Japanese Training Dataset Generation with LLMCode0
S^3: Synonymous Semantic Space for Improving Zero-Shot Generalization of Vision-Language Models0
Automated Medical Report Generation for ECG Data: Bridging Medical Text and Signal Processing with Deep LearningCode0
Unified Framework for Open-World Compositional Zero-shot LearningCode0
Diffusion in Zero-Shot Learning for Environmental AudioCode0
Multimodal Remote Sensing Scene Classification Using VLMs and Dual-Cross Attention NetworksCode0
Optimizing Large Language Models for Turkish: New Methodologies in Corpus Selection and Training0
Enhancing Robustness of CLIP to Common Corruptions through Bimodal Test-Time Adaptation0
The use of large language models to enhance cancer clinical trial educational materials0
Show:102550
← PrevPage 12 of 38Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ZeroDiffaverage top-1 classification accuracy87.5Unverified
2DUETaverage top-1 classification accuracy72.3Unverified
3Composeraverage top-1 classification accuracy69.4Unverified
4HDC-ZSC-MLPaverage top-1 classification accuracy65.6Unverified
5ZSL_TF-VAEGANaverage top-1 classification accuracy64.9Unverified
6ZLaPAccuracy64.3Unverified
7ZLaP*Accuracy64.2Unverified
8HDC-ZSCaverage top-1 classification accuracy63.8Unverified
9SPOTaverage top-1 classification accuracy62.9Unverified
10f-VAEGAN-D2average top-1 classification accuracy61Unverified
#ModelMetricClaimedVerifiedStatus
1dmis-lab/biobert-v1.1Accuracy26.15Unverified
2meta-llama/Meta-Llama-3-8B-InstructAccuracy25.84Unverified
3epfl-llm/meditron-7bAccuracy25.75Unverified
4dmis-lab/meerkat-7b-v1.0Accuracy25.68Unverified
5meta-llama/Meta-Llama-3-8B-InstructAccuracy25.65Unverified
6HuggingFaceH4/zephyr-7b-betaAccuracy25.54Unverified
7dmis-lab/biobert-v1.1Accuracy25.46Unverified
8epfl-llm/meditron-70bAccuracy25.36Unverified
9epfl-llm/meditron-70bAccuracy25.26Unverified
10HuggingFaceH4/zephyr-7b-betaAccuracy25.06Unverified
#ModelMetricClaimedVerifiedStatus
1ZeroDiffaverage top-1 classification accuracy77.3Unverified
2SPOT (VAEGAN)average top-1 classification accuracy66.04Unverified
3ZSL_TF-VAEGANaverage top-1 classification accuracy66Unverified
4f-VAEGANaverage top-1 classification accuracy64.7Unverified
5DUET (Ours)average top-1 classification accuracy64.4Unverified
6LisGANaverage top-1 classification accuracy61.7Unverified
7TCNaverage top-1 classification accuracy61.5Unverified
8f-CLSWGANaverage top-1 classification accuracy60.8Unverified
9Cycle-WGANaverage top-1 classification accuracy59.9Unverified
#ModelMetricClaimedVerifiedStatus
1ZeroDiffaverage top-1 classification accuracy86.4Unverified
2ZSL-KGaverage top-1 classification accuracy78.08Unverified
3ZSL_TF-VAEGANaverage top-1 classification accuracy72.2Unverified
4DUET (Ours)average top-1 classification accuracy69.9Unverified
#ModelMetricClaimedVerifiedStatus
1ZLaPAccuracy84Unverified
2ZLaP*Accuracy83.1Unverified
#ModelMetricClaimedVerifiedStatus
1ZLaP*Accuracy93.6Unverified
2ZLaPAccuracy93.4Unverified
#ModelMetricClaimedVerifiedStatus
1ZLaP*Accuracy74.2Unverified
2ZLaPAccuracy74Unverified
#ModelMetricClaimedVerifiedStatus
1ViT-B/16Average mAP60.17Unverified
2ResNet-50Average mAP56.19Unverified
#ModelMetricClaimedVerifiedStatus
1ZLaPAccuracy51.2Unverified
2ZLaP*Accuracy51Unverified
#ModelMetricClaimedVerifiedStatus
1ZLaPAccuracy29.1Unverified
2ZLaP*Accuracy29Unverified
#ModelMetricClaimedVerifiedStatus
1ZLaPAccuracy75.9Unverified
2ZLaP*Accuracy75.5Unverified
#ModelMetricClaimedVerifiedStatus
1ZLaP*Accuracy87.9Unverified
2ZLaPAccuracy87.8Unverified
#ModelMetricClaimedVerifiedStatus
1ZLaPTop 1 Accuracy72.1Unverified
2ZLaP*Top 1 Accuracy72.1Unverified
#ModelMetricClaimedVerifiedStatus
1HiTeAAccuracy21.7Unverified
2HiTeAAccuracy0.46Unverified
#ModelMetricClaimedVerifiedStatus
1HiTeAAccuracy37.4Unverified
2HiTeAAccuracy0.56Unverified
#ModelMetricClaimedVerifiedStatus
1SPOTaverage top-1 classification accuracy71.9Unverified
2ZSL_TF-VAEGANaverage top-1 classification accuracy70.8Unverified
#ModelMetricClaimedVerifiedStatus
1ZLaPAccuracy90Unverified
2ZLaP*Accuracy89Unverified
#ModelMetricClaimedVerifiedStatus
1ZLaP*Accuracy71.8Unverified
2ZLaPAccuracy71.2Unverified
#ModelMetricClaimedVerifiedStatus
1ZLaP*Accuracy71.4Unverified
2ZLaPAccuracy71Unverified
#ModelMetricClaimedVerifiedStatus
1ZLaP*Accuracy76.3Unverified
2ZLaPAccuracy76.3Unverified
#ModelMetricClaimedVerifiedStatus
1CLIP(ViT-B/16)Average mAP85.77Unverified
2CLIP(ResNet-50)Average mAP84.3Unverified
#ModelMetricClaimedVerifiedStatus
1ZSL-KGTop-160.54Unverified
#ModelMetricClaimedVerifiedStatus
1zsl_ADAAverage Per-Class Accuracy70.9Unverified
#ModelMetricClaimedVerifiedStatus
1ZLaP*Accuracy63.2Unverified
#ModelMetricClaimedVerifiedStatus
1MSDAPearson correlation coefficient (PCC)0.52Unverified
#ModelMetricClaimedVerifiedStatus
1SeViLAAccuracy72.3Unverified
#ModelMetricClaimedVerifiedStatus
1M^2-EncoderAccuracy80.7Unverified
#ModelMetricClaimedVerifiedStatus
1FrozenBiLMAccuracy51.5Unverified
#ModelMetricClaimedVerifiedStatus
1CZSLA-acc36Unverified
#ModelMetricClaimedVerifiedStatus
1ZS3Netk=10 mIOU26.3Unverified
#ModelMetricClaimedVerifiedStatus
1ZSL-KGAccuracy88.98Unverified
#ModelMetricClaimedVerifiedStatus
1VideoChat2Accuracy40.6Unverified