SOTAVerified

Zero-Shot Learning

Zero-shot learning (ZSL) is a model's ability to detect classes never seen during training. The condition is that the classes are not known during supervised learning.

Earlier work in zero-shot learning use attributes in a two-step approach to infer unknown classes. In the computer vision context, more recent advances learn mappings from image feature space to semantic space. Other approaches learn non-linear multimodal embeddings. In the modern NLP context, language models can be evaluated on downstream tasks without fine tuning.

Benchmark datasets for zero-shot learning include aPY, AwA, and CUB, among others.

( Image credit: Prototypical Networks for Few shot Learning in PyTorch )

Further readings:

Papers

Showing 701750 of 1864 papers

TitleStatusHype
Explain via Any Concept: Concept Bottleneck Model with Open Vocabulary Concepts0
Evaluating Vision-Language Models for Zero-Shot Detection, Classification, and Association of Motorcycles, Passengers, and Helmets0
Do Large Language Models Speak All Languages Equally? A Comparative Study in Low-Resource Settings0
Geometric Algebra Meets Large Language Models: Instruction-Based Transformations of Separate Meshes in 3D, Interactive and Controllable Scenes0
AdaCBM: An Adaptive Concept Bottleneck Model for Explainable and Accurate DiagnosisCode0
Out-Of-Distribution Detection for Audio-visual Generalized Zero-Shot Learning: A General FrameworkCode0
EZSR: Event-based Zero-Shot Recognition0
Prompting Encoder Models for Zero-Shot Classification: A Cross-Domain Study in Italian0
Generative Diffusion Model Bootstraps Zero-shot Classification of Fetal Ultrasound Images In Underrepresented African PopulationsCode0
I can listen but cannot read: An evaluation of two-tower multimodal systems for instrument recognitionCode0
Visual-Semantic Decomposition and Partial Alignment for Document-based Zero-Shot LearningCode0
Rethinking Domain Adaptation and Generalization in the Era of CLIP0
Multi-modal Relation Distillation for Unified 3D Representation Learning0
Zero-Shot Underwater Gesture RecognitionCode0
An Application of Large Language Models to Coding Negotiation Transcripts0
Audio-visual Generalized Zero-shot Learning the Easy Way0
Attention Based Simple Primitives for Open World Compositional Zero-Shot LearningCode0
Compound Expression Recognition via Multi Model Ensemble for the ABAW7 Challenge0
ModalChorus: Visual Probing and Alignment of Multi-modal Embeddings via Modal Fusion MapCode0
Codebook LLMs: Evaluating LLMs as Measurement Tools for Political Science Concepts0
Anticipating Future Object Compositions without Forgetting0
Dense Multimodal Alignment for Open-Vocabulary 3D Scene Understanding0
PFPs: Prompt-guided Flexible Pathological Segmentation for Diverse Potential Outcomes Using Large Vision and Language Models0
Spiking Tucker Fusion Transformer for Audio-Visual Zero-Shot Learning0
CosmoCLIP: Generalizing Large Vision-Language Models for Astronomical Imaging0
Malicious Path Manipulations via Exploitation of Representation Vulnerabilities of Vision-Language Navigation Systems0
Towards a text-based quantitative and explainable histopathology image analysisCode0
DuInNet: Dual-Modality Feature Interaction for Point Cloud Completion0
Pseudo-triplet Guided Few-shot Composed Image Retrieval0
Semantic Compositions Enhance Vision-Language Contrastive Learning0
At First Sight: Zero-Shot Classification of Astronomical Images with Large Multimodal Models0
Evaluation of Language Models in the Medical Context Under Resource-Constrained SettingsCode0
Review of Zero-Shot and Few-Shot AI Algorithms in The Medical Domain0
A Simple Framework for Open-Vocabulary Zero-Shot Segmentation0
Serial Position Effects of Large Language Models0
Contextual Interaction via Primitive-based Adversarial Training For Compositional Zero-shot LearningCode0
CLIP-Decoder : ZeroShot Multilabel Classification using Multimodal CLIP Aligned RepresentationCode0
A Data-Driven Guided Decoding Mechanism for Diagnostic CaptioningCode0
Factual Dialogue Summarization via Learning from Large Language Models0
Using Multimodal Large Language Models for Automated Detection of Traffic Safety Critical Events0
FuseGen: PLM Fusion for Data-generation based Zero-shot LearningCode0
MAC: A Benchmark for Multiple Attributes Compositional Zero-Shot Learning0
BAMBINO-LM: (Bilingual-)Human-Inspired Continual Pretraining of BabyLM0
Zero-Shot Learning Over Large Output Spaces : Utilizing Indirect Knowledge Extraction from Large Language Models0
Understanding Visual Concepts Across ModelsCode0
BAMO at SemEval-2024 Task 9: BRAINTEASER: A Novel Task Defying Common SenseCode0
Attend and Enrich: Enhanced Visual Prompt for Zero-Shot Learning0
Exploring Data Efficiency in Zero-Shot Learning with Diffusion Models0
SLANT: Spurious Logo ANalysis Toolkit0
Multi-Modal Generative Embedding Model0
Show:102550
← PrevPage 15 of 38Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ZeroDiffaverage top-1 classification accuracy87.5Unverified
2DUETaverage top-1 classification accuracy72.3Unverified
3Composeraverage top-1 classification accuracy69.4Unverified
4HDC-ZSC-MLPaverage top-1 classification accuracy65.6Unverified
5ZSL_TF-VAEGANaverage top-1 classification accuracy64.9Unverified
6ZLaPAccuracy64.3Unverified
7ZLaP*Accuracy64.2Unverified
8HDC-ZSCaverage top-1 classification accuracy63.8Unverified
9SPOTaverage top-1 classification accuracy62.9Unverified
10f-VAEGAN-D2average top-1 classification accuracy61Unverified
#ModelMetricClaimedVerifiedStatus
1dmis-lab/biobert-v1.1Accuracy26.15Unverified
2meta-llama/Meta-Llama-3-8B-InstructAccuracy25.84Unverified
3epfl-llm/meditron-7bAccuracy25.75Unverified
4dmis-lab/meerkat-7b-v1.0Accuracy25.68Unverified
5meta-llama/Meta-Llama-3-8B-InstructAccuracy25.65Unverified
6HuggingFaceH4/zephyr-7b-betaAccuracy25.54Unverified
7dmis-lab/biobert-v1.1Accuracy25.46Unverified
8epfl-llm/meditron-70bAccuracy25.36Unverified
9epfl-llm/meditron-70bAccuracy25.26Unverified
10HuggingFaceH4/zephyr-7b-betaAccuracy25.06Unverified
#ModelMetricClaimedVerifiedStatus
1ZeroDiffaverage top-1 classification accuracy77.3Unverified
2SPOT (VAEGAN)average top-1 classification accuracy66.04Unverified
3ZSL_TF-VAEGANaverage top-1 classification accuracy66Unverified
4f-VAEGANaverage top-1 classification accuracy64.7Unverified
5DUET (Ours)average top-1 classification accuracy64.4Unverified
6LisGANaverage top-1 classification accuracy61.7Unverified
7TCNaverage top-1 classification accuracy61.5Unverified
8f-CLSWGANaverage top-1 classification accuracy60.8Unverified
9Cycle-WGANaverage top-1 classification accuracy59.9Unverified
#ModelMetricClaimedVerifiedStatus
1ZeroDiffaverage top-1 classification accuracy86.4Unverified
2ZSL-KGaverage top-1 classification accuracy78.08Unverified
3ZSL_TF-VAEGANaverage top-1 classification accuracy72.2Unverified
4DUET (Ours)average top-1 classification accuracy69.9Unverified
#ModelMetricClaimedVerifiedStatus
1ZLaPAccuracy84Unverified
2ZLaP*Accuracy83.1Unverified
#ModelMetricClaimedVerifiedStatus
1ZLaP*Accuracy93.6Unverified
2ZLaPAccuracy93.4Unverified
#ModelMetricClaimedVerifiedStatus
1ZLaP*Accuracy74.2Unverified
2ZLaPAccuracy74Unverified
#ModelMetricClaimedVerifiedStatus
1ViT-B/16Average mAP60.17Unverified
2ResNet-50Average mAP56.19Unverified
#ModelMetricClaimedVerifiedStatus
1ZLaPAccuracy51.2Unverified
2ZLaP*Accuracy51Unverified
#ModelMetricClaimedVerifiedStatus
1ZLaPAccuracy29.1Unverified
2ZLaP*Accuracy29Unverified
#ModelMetricClaimedVerifiedStatus
1ZLaPAccuracy75.9Unverified
2ZLaP*Accuracy75.5Unverified
#ModelMetricClaimedVerifiedStatus
1ZLaP*Accuracy87.9Unverified
2ZLaPAccuracy87.8Unverified
#ModelMetricClaimedVerifiedStatus
1ZLaPTop 1 Accuracy72.1Unverified
2ZLaP*Top 1 Accuracy72.1Unverified
#ModelMetricClaimedVerifiedStatus
1HiTeAAccuracy21.7Unverified
2HiTeAAccuracy0.46Unverified
#ModelMetricClaimedVerifiedStatus
1HiTeAAccuracy37.4Unverified
2HiTeAAccuracy0.56Unverified
#ModelMetricClaimedVerifiedStatus
1SPOTaverage top-1 classification accuracy71.9Unverified
2ZSL_TF-VAEGANaverage top-1 classification accuracy70.8Unverified
#ModelMetricClaimedVerifiedStatus
1ZLaPAccuracy90Unverified
2ZLaP*Accuracy89Unverified
#ModelMetricClaimedVerifiedStatus
1ZLaP*Accuracy71.8Unverified
2ZLaPAccuracy71.2Unverified
#ModelMetricClaimedVerifiedStatus
1ZLaP*Accuracy71.4Unverified
2ZLaPAccuracy71Unverified
#ModelMetricClaimedVerifiedStatus
1ZLaP*Accuracy76.3Unverified
2ZLaPAccuracy76.3Unverified
#ModelMetricClaimedVerifiedStatus
1CLIP(ViT-B/16)Average mAP85.77Unverified
2CLIP(ResNet-50)Average mAP84.3Unverified
#ModelMetricClaimedVerifiedStatus
1ZSL-KGTop-160.54Unverified
#ModelMetricClaimedVerifiedStatus
1zsl_ADAAverage Per-Class Accuracy70.9Unverified
#ModelMetricClaimedVerifiedStatus
1ZLaP*Accuracy63.2Unverified
#ModelMetricClaimedVerifiedStatus
1MSDAPearson correlation coefficient (PCC)0.52Unverified
#ModelMetricClaimedVerifiedStatus
1SeViLAAccuracy72.3Unverified
#ModelMetricClaimedVerifiedStatus
1M^2-EncoderAccuracy80.7Unverified
#ModelMetricClaimedVerifiedStatus
1FrozenBiLMAccuracy51.5Unverified
#ModelMetricClaimedVerifiedStatus
1CZSLA-acc36Unverified
#ModelMetricClaimedVerifiedStatus
1ZS3Netk=10 mIOU26.3Unverified
#ModelMetricClaimedVerifiedStatus
1ZSL-KGAccuracy88.98Unverified
#ModelMetricClaimedVerifiedStatus
1VideoChat2Accuracy40.6Unverified