SOTAVerified

Zero-Shot Learning

Zero-shot learning (ZSL) is a model's ability to detect classes never seen during training. The condition is that the classes are not known during supervised learning.

Earlier work in zero-shot learning use attributes in a two-step approach to infer unknown classes. In the computer vision context, more recent advances learn mappings from image feature space to semantic space. Other approaches learn non-linear multimodal embeddings. In the modern NLP context, language models can be evaluated on downstream tasks without fine tuning.

Benchmark datasets for zero-shot learning include aPY, AwA, and CUB, among others.

( Image credit: Prototypical Networks for Few shot Learning in PyTorch )

Further readings:

Papers

Showing 801850 of 1864 papers

TitleStatusHype
ReGen: Zero-Shot Text Classification via Training Data Generation with Progressive Dense RetrievalCode1
MedBLIP: Bootstrapping Language-Image Pre-training from 3D Medical Images and TextsCode1
ULIP-2: Towards Scalable Multimodal Pre-training for 3D UnderstandingCode2
Alternating Gradient Descent and Mixture-of-Experts for Integrated Multimodal Perception0
Dual Intent Enhanced Graph Neural Network for Session-based New Item RecommendationCode1
ImageBind: One Embedding Space To Bind Them AllCode5
Boosting Visual-Language Models by Exploiting Hard SamplesCode0
Unlocking Practical Applications in Legal Domain: Evaluation of GPT for Zero-Shot Semantic Annotation of Legal Texts0
Analyzing Hong Kong's Legal Judgments from a Computational Linguistics point-of-view0
LLM2Loss: Leveraging Language Models for Explainable Model Diagnostics0
The Benefits of Label-Description Training for Zero-Shot Text ClassificationCode0
Unsupervised Improvement of Audio-Text Cross-Modal RepresentationsCode0
Stance Detection: A Practical Guide to Classifying Political Beliefs in TextCode1
DRPT: Disentangled and Recurrent Prompt Tuning for Compositional Zero-Shot Learning0
Self-similarity-based super-resolution of photoacoustic angiography from hand-drawn doodlesCode1
Unicorn: A Unified Multi-tasking Model for Supporting Matching Tasks in Data IntegrationCode1
Company classification using zero-shot learning0
ZeroSearch: Local Image Search from Text with Zero Shot LearningCode0
The Parrot Dilemma: Human-Labeled vs. LLM-augmented Data in Classification TasksCode1
Generation-driven Contrastive Self-training for Zero-shot Text Classification with Instruction-following LLMCode1
RPLKG: Robust Prompt Learning with Knowledge Graph0
Information Extraction from Documents: Question Answering vs Token Classification in real-world setups0
CLaMP: Contrastive Language-Music Pre-training for Cross-Modal Symbolic Music Information RetrievalCode0
Text2Seg: Remote Sensing Image Semantic Segmentation via Text-Guided Visual Foundation ModelsCode1
Segment Anything Model for Medical Image Analysis: an Experimental StudyCode1
LLM as A Robotic Brain: Unifying Egocentric Memory and Control0
Tailoring Domain Adaptation for Machine Translation Quality EstimationCode0
TagCLIP: Improving Discrimination Ability of Open-Vocabulary Semantic SegmentationCode1
WYTIWYR: A User Intent-Aware Framework with Multi-modal Inputs for Visualization RetrievalCode0
SemEval-2023 Task 12: Sentiment Analysis for African Languages (AfriSenti-SemEval)Code1
On the Opportunities and Challenges of Foundation Models for Geospatial Artificial Intelligence0
What does CLIP know about a red circle? Visual prompt engineering for VLMs0
ChatGPT-4 Outperforms Experts and Crowd Workers in Annotating Political Twitter Messages with Zero-Shot Learning0
RECLIP: Resource-efficient CLIP by Training with Small Images0
A Closer Look at the Explainability of Contrastive Language-Image Pre-trainingCode1
ChatGPT Beyond English: Towards a Comprehensive Evaluation of Large Language Models in Multilingual Learning0
SAM.MD: Zero-shot medical image segmentation capabilities of the Segment Anything Model0
Interaction-Aware Prompting for Zero-Shot Spatio-Temporal Action DetectionCode1
Synthetic Sample Selection for Generalized Zero-Shot Learning0
Structured prompt interrogation and recursive extraction of semantics (SPIRES): A method for populating knowledge bases using zero-shot learningCode2
Zero-shot Medical Image Translation via Frequency-Guided Diffusion ModelsCode1
Learning to Name Classes for Vision and Language Models0
Exploring Vision-Language Models for Imbalanced LearningCode1
AutoLabel: CLIP-based framework for Open-set Video Domain AdaptationCode1
SoftCLIP: Softer Cross-modal Alignment Makes CLIP Stronger0
Zero-shot Entailment of Leaderboards for Empirical AI Research0
Your Diffusion Model is Secretly a Zero-Shot ClassifierCode2
Evaluation of ChatGPT for NLP-based Mental Health Applications0
Variational Distribution Learning for Unsupervised Text-to-Image Generation0
Progressive Semantic-Visual Mutual Adaption for Generalized Zero-Shot LearningCode1
Show:102550
← PrevPage 17 of 38Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ZeroDiffaverage top-1 classification accuracy87.5Unverified
2DUETaverage top-1 classification accuracy72.3Unverified
3Composeraverage top-1 classification accuracy69.4Unverified
4HDC-ZSC-MLPaverage top-1 classification accuracy65.6Unverified
5ZSL_TF-VAEGANaverage top-1 classification accuracy64.9Unverified
6ZLaPAccuracy64.3Unverified
7ZLaP*Accuracy64.2Unverified
8HDC-ZSCaverage top-1 classification accuracy63.8Unverified
9SPOTaverage top-1 classification accuracy62.9Unverified
10f-VAEGAN-D2average top-1 classification accuracy61Unverified
#ModelMetricClaimedVerifiedStatus
1dmis-lab/biobert-v1.1Accuracy26.15Unverified
2meta-llama/Meta-Llama-3-8B-InstructAccuracy25.84Unverified
3epfl-llm/meditron-7bAccuracy25.75Unverified
4dmis-lab/meerkat-7b-v1.0Accuracy25.68Unverified
5meta-llama/Meta-Llama-3-8B-InstructAccuracy25.65Unverified
6HuggingFaceH4/zephyr-7b-betaAccuracy25.54Unverified
7dmis-lab/biobert-v1.1Accuracy25.46Unverified
8epfl-llm/meditron-70bAccuracy25.36Unverified
9epfl-llm/meditron-70bAccuracy25.26Unverified
10HuggingFaceH4/zephyr-7b-betaAccuracy25.06Unverified
#ModelMetricClaimedVerifiedStatus
1ZeroDiffaverage top-1 classification accuracy77.3Unverified
2SPOT (VAEGAN)average top-1 classification accuracy66.04Unverified
3ZSL_TF-VAEGANaverage top-1 classification accuracy66Unverified
4f-VAEGANaverage top-1 classification accuracy64.7Unverified
5DUET (Ours)average top-1 classification accuracy64.4Unverified
6LisGANaverage top-1 classification accuracy61.7Unverified
7TCNaverage top-1 classification accuracy61.5Unverified
8f-CLSWGANaverage top-1 classification accuracy60.8Unverified
9Cycle-WGANaverage top-1 classification accuracy59.9Unverified
#ModelMetricClaimedVerifiedStatus
1ZeroDiffaverage top-1 classification accuracy86.4Unverified
2ZSL-KGaverage top-1 classification accuracy78.08Unverified
3ZSL_TF-VAEGANaverage top-1 classification accuracy72.2Unverified
4DUET (Ours)average top-1 classification accuracy69.9Unverified
#ModelMetricClaimedVerifiedStatus
1ZLaPAccuracy84Unverified
2ZLaP*Accuracy83.1Unverified
#ModelMetricClaimedVerifiedStatus
1ZLaP*Accuracy93.6Unverified
2ZLaPAccuracy93.4Unverified
#ModelMetricClaimedVerifiedStatus
1ZLaP*Accuracy74.2Unverified
2ZLaPAccuracy74Unverified
#ModelMetricClaimedVerifiedStatus
1ViT-B/16Average mAP60.17Unverified
2ResNet-50Average mAP56.19Unverified
#ModelMetricClaimedVerifiedStatus
1ZLaPAccuracy51.2Unverified
2ZLaP*Accuracy51Unverified
#ModelMetricClaimedVerifiedStatus
1ZLaPAccuracy29.1Unverified
2ZLaP*Accuracy29Unverified
#ModelMetricClaimedVerifiedStatus
1ZLaPAccuracy75.9Unverified
2ZLaP*Accuracy75.5Unverified
#ModelMetricClaimedVerifiedStatus
1ZLaP*Accuracy87.9Unverified
2ZLaPAccuracy87.8Unverified
#ModelMetricClaimedVerifiedStatus
1ZLaPTop 1 Accuracy72.1Unverified
2ZLaP*Top 1 Accuracy72.1Unverified
#ModelMetricClaimedVerifiedStatus
1HiTeAAccuracy21.7Unverified
2HiTeAAccuracy0.46Unverified
#ModelMetricClaimedVerifiedStatus
1HiTeAAccuracy37.4Unverified
2HiTeAAccuracy0.56Unverified
#ModelMetricClaimedVerifiedStatus
1SPOTaverage top-1 classification accuracy71.9Unverified
2ZSL_TF-VAEGANaverage top-1 classification accuracy70.8Unverified
#ModelMetricClaimedVerifiedStatus
1ZLaPAccuracy90Unverified
2ZLaP*Accuracy89Unverified
#ModelMetricClaimedVerifiedStatus
1ZLaP*Accuracy71.8Unverified
2ZLaPAccuracy71.2Unverified
#ModelMetricClaimedVerifiedStatus
1ZLaP*Accuracy71.4Unverified
2ZLaPAccuracy71Unverified
#ModelMetricClaimedVerifiedStatus
1ZLaPAccuracy76.3Unverified
2ZLaP*Accuracy76.3Unverified
#ModelMetricClaimedVerifiedStatus
1CLIP(ViT-B/16)Average mAP85.77Unverified
2CLIP(ResNet-50)Average mAP84.3Unverified
#ModelMetricClaimedVerifiedStatus
1ZSL-KGTop-160.54Unverified
#ModelMetricClaimedVerifiedStatus
1zsl_ADAAverage Per-Class Accuracy70.9Unverified
#ModelMetricClaimedVerifiedStatus
1ZLaP*Accuracy63.2Unverified
#ModelMetricClaimedVerifiedStatus
1MSDAPearson correlation coefficient (PCC)0.52Unverified
#ModelMetricClaimedVerifiedStatus
1SeViLAAccuracy72.3Unverified
#ModelMetricClaimedVerifiedStatus
1M^2-EncoderAccuracy80.7Unverified
#ModelMetricClaimedVerifiedStatus
1FrozenBiLMAccuracy51.5Unverified
#ModelMetricClaimedVerifiedStatus
1CZSLA-acc36Unverified
#ModelMetricClaimedVerifiedStatus
1ZS3Netk=10 mIOU26.3Unverified
#ModelMetricClaimedVerifiedStatus
1ZSL-KGAccuracy88.98Unverified
#ModelMetricClaimedVerifiedStatus
1VideoChat2Accuracy40.6Unverified