SOTAVerified

Zero-Shot Learning

Zero-shot learning (ZSL) is a model's ability to detect classes never seen during training. The condition is that the classes are not known during supervised learning.

Earlier work in zero-shot learning use attributes in a two-step approach to infer unknown classes. In the computer vision context, more recent advances learn mappings from image feature space to semantic space. Other approaches learn non-linear multimodal embeddings. In the modern NLP context, language models can be evaluated on downstream tasks without fine tuning.

Benchmark datasets for zero-shot learning include aPY, AwA, and CUB, among others.

( Image credit: Prototypical Networks for Few shot Learning in PyTorch )

Further readings:

Papers

Showing 51100 of 1864 papers

TitleStatusHype
Fill the Gap: Quantifying and Reducing the Modality Gap in Image-Text Representation Learning0
Advancing Email Spam Detection: Leveraging Zero-Shot Learning and Large Language ModelsCode0
On the effectiveness of Large Language Models in the mechanical design domainCode0
Helping Large Language Models Protect Themselves: An Enhanced Filtering and Summarization System0
Investigating Task Arithmetic for Zero-Shot Information RetrievalCode0
DeeCLIP: A Robust and Generalizable Transformer-Based Framework for Detecting AI-Generated ImagesCode1
Beyond Labels: Zero-Shot Diabetic Foot Ulcer Wound Segmentation with Self-attention Diffusion Models and the Potential for Text-Guided Customization0
Tell Me What You Know About Sexism: Expert-LLM Interaction Strategies and Co-Created Definitions for Zero-Shot Sexism DetectionCode0
Think2SQL: Reinforce LLM Reasoning Capabilities for Text2SQL0
Med-2D SegNet: A Light Weight Deep Neural Network for Medical 2D Image Segmentation0
RadZero: Similarity-Based Cross-Attention for Explainable Vision-Language Alignment in Radiology with Zero-Shot Multi-Task Capability0
MultiADS: Defect-aware Supervision for Multi-type Anomaly Detection and Segmentation in Zero-Shot Learning0
Structured Extraction of Process Structure Properties Relationships in Materials Science0
Beyond the Next Token: Towards Prompt-Robust Zero-Shot Classification via Efficient Multi-Token PredictionCode1
Refining CLIP's Spatial Awareness: A Visual-Centric Perspective0
UniFault: A Fault Diagnosis Foundation Model from Bearing Data0
Transductive One-Shot Learning Meet Subspace Decomposition0
GLiNER-BioMed: A Suite of Efficient Models for Open Biomedical Named Entity RecognitionCode1
SALT: A Flexible Semi-Automatic Labeling Tool for General LiDAR Point Clouds with Cross-Scene Adaptability and 4D ConsistencyCode2
GenSwarm: Scalable Multi-Robot Code-Policy Generation and Deployment via Language ModelsCode1
CIBR: Cross-modal Information Bottleneck Regularization for Robust CLIP Generalization0
ViLAaD: Enhancing "Attracting and Dispersing'' Source-Free Domain Adaptation with Vision-and-Language Model0
Visual and Semantic Prompt Collaboration for Generalized Zero-Shot Learning0
Large Language Models are Unreliable for Cyber Threat Intelligence0
Extremely Simple Out-of-distribution Detection for Audio-visual Generalized Zero-shot Learning0
fine-CLIP: Enhancing Zero-Shot Fine-Grained Surgical Action Recognition with Vision-Language Models0
Enhancing Small Language Models for Cross-Lingual Generalized Zero-Shot Classification with Soft Prompt Tuning0
Seeing What Matters: Empowering CLIP with Patch Generation-to-Selection0
CAARMA: Class Augmentation with Adversarial Mixup Regularization0
Semantic Segmentation of Transparent and Opaque Drinking Glasses with the Help of Zero-shot Learning0
Sparseformer: a Transferable Transformer with Multi-granularity Token Sparsification for Medical Time Series Classification0
Leveraging MoE-based Large Language Model for Zero-Shot Multi-Task Semantic Communication0
Bayesian Modeling of Zero-Shot Classifications for Urban Flood DetectionCode0
Advancing Medical Representation Learning Through High-Quality DataCode1
Real-Time Cell Sorting with Scalable In Situ FPGA-Accelerated Deep LearningCode0
TLAC: Two-stage LMM Augmented CLIP for Zero-Shot ClassificationCode0
An experimental approach on Few Shot Class Incremental Learning0
Systematic Classification of Studies Investigating Social Media Conversations about Long COVID Using a Novel Zero-Shot Transformer Framework0
Zero-TIG: Temporal Consistency-Aware Zero-Shot Illumination-Guided Low-light Video EnhancementCode0
Leveraging Vision-Language Embeddings for Zero-Shot Learning in Histopathology Images0
ChatGPT Encounters Morphing Attack Detection: Zero-Shot MAD with Multi-Modal Large Language Models and General Vision Models0
SVIP: Semantically Contextualized Visual Patches for Zero-Shot Learning0
Have LLMs Made Active Learning Obsolete? Surveying the NLP Community0
Lend a Hand: Semi Training-Free Cued Speech Recognition via MLLM-Driven Hand Modeling for Barrier-free CommunicationCode0
Controlling Latent Diffusion Using Latent CLIPCode1
Investigating the Effectiveness of a Socratic Chain-of-Thoughts Reasoning Method for Task Planning in Robotics, A Case Study0
MADS: Multi-Attribute Document Supervision for Zero-Shot Image Classification0
Generative AI in Transportation Planning: A Survey0
A Zero-shot Learning Method Based on Large Language Models for Multi-modal Knowledge Graph Embedding0
DiffCLIP: Differential Attention Meets CLIPCode2
Show:102550
← PrevPage 2 of 38Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ZeroDiffaverage top-1 classification accuracy87.5Unverified
2DUETaverage top-1 classification accuracy72.3Unverified
3Composeraverage top-1 classification accuracy69.4Unverified
4HDC-ZSC-MLPaverage top-1 classification accuracy65.6Unverified
5ZSL_TF-VAEGANaverage top-1 classification accuracy64.9Unverified
6ZLaPAccuracy64.3Unverified
7ZLaP*Accuracy64.2Unverified
8HDC-ZSCaverage top-1 classification accuracy63.8Unverified
9SPOTaverage top-1 classification accuracy62.9Unverified
10f-VAEGAN-D2average top-1 classification accuracy61Unverified
#ModelMetricClaimedVerifiedStatus
1dmis-lab/biobert-v1.1Accuracy26.15Unverified
2meta-llama/Meta-Llama-3-8B-InstructAccuracy25.84Unverified
3epfl-llm/meditron-7bAccuracy25.75Unverified
4dmis-lab/meerkat-7b-v1.0Accuracy25.68Unverified
5meta-llama/Meta-Llama-3-8B-InstructAccuracy25.65Unverified
6HuggingFaceH4/zephyr-7b-betaAccuracy25.54Unverified
7dmis-lab/biobert-v1.1Accuracy25.46Unverified
8epfl-llm/meditron-70bAccuracy25.36Unverified
9epfl-llm/meditron-70bAccuracy25.26Unverified
10HuggingFaceH4/zephyr-7b-betaAccuracy25.06Unverified
#ModelMetricClaimedVerifiedStatus
1ZeroDiffaverage top-1 classification accuracy77.3Unverified
2SPOT (VAEGAN)average top-1 classification accuracy66.04Unverified
3ZSL_TF-VAEGANaverage top-1 classification accuracy66Unverified
4f-VAEGANaverage top-1 classification accuracy64.7Unverified
5DUET (Ours)average top-1 classification accuracy64.4Unverified
6LisGANaverage top-1 classification accuracy61.7Unverified
7TCNaverage top-1 classification accuracy61.5Unverified
8f-CLSWGANaverage top-1 classification accuracy60.8Unverified
9Cycle-WGANaverage top-1 classification accuracy59.9Unverified
#ModelMetricClaimedVerifiedStatus
1ZeroDiffaverage top-1 classification accuracy86.4Unverified
2ZSL-KGaverage top-1 classification accuracy78.08Unverified
3ZSL_TF-VAEGANaverage top-1 classification accuracy72.2Unverified
4DUET (Ours)average top-1 classification accuracy69.9Unverified
#ModelMetricClaimedVerifiedStatus
1ZLaPAccuracy84Unverified
2ZLaP*Accuracy83.1Unverified
#ModelMetricClaimedVerifiedStatus
1ZLaP*Accuracy93.6Unverified
2ZLaPAccuracy93.4Unverified
#ModelMetricClaimedVerifiedStatus
1ZLaP*Accuracy74.2Unverified
2ZLaPAccuracy74Unverified
#ModelMetricClaimedVerifiedStatus
1ViT-B/16Average mAP60.17Unverified
2ResNet-50Average mAP56.19Unverified
#ModelMetricClaimedVerifiedStatus
1ZLaPAccuracy51.2Unverified
2ZLaP*Accuracy51Unverified
#ModelMetricClaimedVerifiedStatus
1ZLaPAccuracy29.1Unverified
2ZLaP*Accuracy29Unverified
#ModelMetricClaimedVerifiedStatus
1ZLaPAccuracy75.9Unverified
2ZLaP*Accuracy75.5Unverified
#ModelMetricClaimedVerifiedStatus
1ZLaP*Accuracy87.9Unverified
2ZLaPAccuracy87.8Unverified
#ModelMetricClaimedVerifiedStatus
1ZLaPTop 1 Accuracy72.1Unverified
2ZLaP*Top 1 Accuracy72.1Unverified
#ModelMetricClaimedVerifiedStatus
1HiTeAAccuracy21.7Unverified
2HiTeAAccuracy0.46Unverified
#ModelMetricClaimedVerifiedStatus
1HiTeAAccuracy37.4Unverified
2HiTeAAccuracy0.56Unverified
#ModelMetricClaimedVerifiedStatus
1SPOTaverage top-1 classification accuracy71.9Unverified
2ZSL_TF-VAEGANaverage top-1 classification accuracy70.8Unverified
#ModelMetricClaimedVerifiedStatus
1ZLaPAccuracy90Unverified
2ZLaP*Accuracy89Unverified
#ModelMetricClaimedVerifiedStatus
1ZLaP*Accuracy71.8Unverified
2ZLaPAccuracy71.2Unverified
#ModelMetricClaimedVerifiedStatus
1ZLaP*Accuracy71.4Unverified
2ZLaPAccuracy71Unverified
#ModelMetricClaimedVerifiedStatus
1ZLaP*Accuracy76.3Unverified
2ZLaPAccuracy76.3Unverified
#ModelMetricClaimedVerifiedStatus
1CLIP(ViT-B/16)Average mAP85.77Unverified
2CLIP(ResNet-50)Average mAP84.3Unverified
#ModelMetricClaimedVerifiedStatus
1ZSL-KGTop-160.54Unverified
#ModelMetricClaimedVerifiedStatus
1zsl_ADAAverage Per-Class Accuracy70.9Unverified
#ModelMetricClaimedVerifiedStatus
1ZLaP*Accuracy63.2Unverified
#ModelMetricClaimedVerifiedStatus
1MSDAPearson correlation coefficient (PCC)0.52Unverified
#ModelMetricClaimedVerifiedStatus
1SeViLAAccuracy72.3Unverified
#ModelMetricClaimedVerifiedStatus
1M^2-EncoderAccuracy80.7Unverified
#ModelMetricClaimedVerifiedStatus
1FrozenBiLMAccuracy51.5Unverified
#ModelMetricClaimedVerifiedStatus
1CZSLA-acc36Unverified
#ModelMetricClaimedVerifiedStatus
1ZS3Netk=10 mIOU26.3Unverified
#ModelMetricClaimedVerifiedStatus
1ZSL-KGAccuracy88.98Unverified
#ModelMetricClaimedVerifiedStatus
1VideoChat2Accuracy40.6Unverified