SOTAVerified

Zero-Shot Learning

Zero-shot learning (ZSL) is a model's ability to detect classes never seen during training. The condition is that the classes are not known during supervised learning.

Earlier work in zero-shot learning use attributes in a two-step approach to infer unknown classes. In the computer vision context, more recent advances learn mappings from image feature space to semantic space. Other approaches learn non-linear multimodal embeddings. In the modern NLP context, language models can be evaluated on downstream tasks without fine tuning.

Benchmark datasets for zero-shot learning include aPY, AwA, and CUB, among others.

( Image credit: Prototypical Networks for Few shot Learning in PyTorch )

Further readings:

Papers

Showing 551600 of 1864 papers

TitleStatusHype
Enhancing medical vision-language contrastive learning via inter-matching relation modelling0
Exploiting GPT-4 Vision for Zero-shot Point Cloud Understanding0
Foundation Models for Biomedical Image Segmentation: A Survey0
3D Object Detection and High-Resolution Traffic Parameters Extraction Using Low-Resolution LiDAR Data0
Batch-ICL: Effective, Efficient, and Order-Agnostic In-Context LearningCode1
SamLP: A Customized Segment Anything Model for License Plate DetectionCode1
Can Active Label Correction Improve LLM-based Modular AI Systems?0
Modality-Aware Representation Learning for Zero-shot Sketch-based Image RetrievalCode0
CLIP-Guided Source-Free Object Detection in Aerial ImagesCode1
TwinBooster: Synergising Large Language Models with Barlow Twins and Gradient Boosting for Enhanced Molecular Property PredictionCode0
Open-Vocabulary SAM: Segment and Recognize Twenty-thousand Classes InteractivelyCode5
Benchmarking PathCLIP for Pathology Image Analysis0
Zero-shot Microclimate Prediction with Deep Learning0
Improved Zero-Shot Classification by Adapting VLMs with Text DescriptionsCode1
Query-Based Knowledge Sharing for Open-Vocabulary Multi-Label Classification0
Beyond Seen Primitive Concepts and Attribute-Object Compositional Learning0
Towards High-fidelity Artistic Image Vectorization via Texture-Encapsulated Shape Parameterization0
Building Vision-Language Models on Solid Foundations with Masked Distillation0
Improving Generalized Zero-Shot Learning by Exploring the Diverse Semantics from External Class NamesCode1
Diffusion Models, Image Super-Resolution And Everything: A Survey0
Pushing Boundaries: Exploring Zero Shot Object Classification with Large Multimodal Models0
AI Content Self-Detection for Transformer-based Large Language Models0
FILP-3D: Enhancing 3D Few-shot Class-incremental Learning with Pre-trained Vision-Language ModelsCode1
Revealing the Proximate Long-Tail Distribution in Compositional Zero-Shot LearningCode0
A Prompt Learning Framework for Source Code SummarizationCode1
On the Promises and Challenges of Multimodal Foundation Models for Geographical, Environmental, Agricultural, and Urban Planning Applications0
Do LLM Agents Exhibit Social Behavior?0
Meta Transfer of Self-Supervised Knowledge: Foundation Model in Action for Post-Traumatic Epilepsy Prediction0
Compositional Zero-Shot Learning for Attribute-Based Object Reference in Human-Robot Interaction0
SEER-ZSL: Semantic Encoder-Enhanced Representations for Generalized Zero-Shot Learning0
ParsNets: A Parsimonious Orthogonal and Low-Rank Linear Networks for Zero-Shot Learning0
CLIP-guided Federated Learning on Heterogeneous and Long-Tailed DataCode1
EZ-CLIP: Efficient Zeroshot Video Action RecognitionCode1
CSI-Based Cross-Domain Activity Recognition via Zero-Shot Prototypical Networks0
Open-Pose 3D Zero-Shot Learning: Benchmark and ChallengesCode1
A Review of Machine Learning Methods Applied to Video Analysis Systems0
Physical-Layer Semantic-Aware Network for Zero-Shot Wireless Sensing0
The Potential of Vision-Language Models for Content Moderation of Children's Videos0
Lite-Mind: Towards Efficient and Robust Brain Representation NetworkCode1
SCLIP: Rethinking Self-Attention for Dense Vision-Language InferenceCode1
CILF-CIAE: CLIP-driven Image-Language Fusion for Correcting Inverse Age Estimation0
ArabIcros: AI-Powered Arabic Crossword Puzzle Generation for Educational Applications0
Large Language Models Are Zero-Shot Text ClassifiersCode1
Meta-Learned Attribute Self-Interaction Network for Continual and Generalized Zero-Shot Learning0
Prompt Tuning for Zero-shot Compositional Learning0
Pipeline Enabling Zero-shot Classification for Bangla Handwritten Grapheme0
FreeZe: Training-free zero-shot 6D pose estimation with geometric and vision foundation models0
Applying Large Language Models and Chain-of-Thought for Automatic Scoring0
Class Distribution Shifts in Zero-Shot Learning: Learning Robust RepresentationsCode0
Explaining CLIP's performance disparities on data from blind/low vision users0
Show:102550
← PrevPage 12 of 38Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ZeroDiffaverage top-1 classification accuracy87.5Unverified
2DUETaverage top-1 classification accuracy72.3Unverified
3Composeraverage top-1 classification accuracy69.4Unverified
4HDC-ZSC-MLPaverage top-1 classification accuracy65.6Unverified
5ZSL_TF-VAEGANaverage top-1 classification accuracy64.9Unverified
6ZLaPAccuracy64.3Unverified
7ZLaP*Accuracy64.2Unverified
8HDC-ZSCaverage top-1 classification accuracy63.8Unverified
9SPOTaverage top-1 classification accuracy62.9Unverified
10f-VAEGAN-D2average top-1 classification accuracy61Unverified
#ModelMetricClaimedVerifiedStatus
1dmis-lab/biobert-v1.1Accuracy26.15Unverified
2meta-llama/Meta-Llama-3-8B-InstructAccuracy25.84Unverified
3epfl-llm/meditron-7bAccuracy25.75Unverified
4dmis-lab/meerkat-7b-v1.0Accuracy25.68Unverified
5meta-llama/Meta-Llama-3-8B-InstructAccuracy25.65Unverified
6HuggingFaceH4/zephyr-7b-betaAccuracy25.54Unverified
7dmis-lab/biobert-v1.1Accuracy25.46Unverified
8epfl-llm/meditron-70bAccuracy25.36Unverified
9epfl-llm/meditron-70bAccuracy25.26Unverified
10HuggingFaceH4/zephyr-7b-betaAccuracy25.06Unverified
#ModelMetricClaimedVerifiedStatus
1ZeroDiffaverage top-1 classification accuracy77.3Unverified
2SPOT (VAEGAN)average top-1 classification accuracy66.04Unverified
3ZSL_TF-VAEGANaverage top-1 classification accuracy66Unverified
4f-VAEGANaverage top-1 classification accuracy64.7Unverified
5DUET (Ours)average top-1 classification accuracy64.4Unverified
6LisGANaverage top-1 classification accuracy61.7Unverified
7TCNaverage top-1 classification accuracy61.5Unverified
8f-CLSWGANaverage top-1 classification accuracy60.8Unverified
9Cycle-WGANaverage top-1 classification accuracy59.9Unverified
#ModelMetricClaimedVerifiedStatus
1ZeroDiffaverage top-1 classification accuracy86.4Unverified
2ZSL-KGaverage top-1 classification accuracy78.08Unverified
3ZSL_TF-VAEGANaverage top-1 classification accuracy72.2Unverified
4DUET (Ours)average top-1 classification accuracy69.9Unverified
#ModelMetricClaimedVerifiedStatus
1ZLaPAccuracy84Unverified
2ZLaP*Accuracy83.1Unverified
#ModelMetricClaimedVerifiedStatus
1ZLaP*Accuracy93.6Unverified
2ZLaPAccuracy93.4Unverified
#ModelMetricClaimedVerifiedStatus
1ZLaP*Accuracy74.2Unverified
2ZLaPAccuracy74Unverified
#ModelMetricClaimedVerifiedStatus
1ViT-B/16Average mAP60.17Unverified
2ResNet-50Average mAP56.19Unverified
#ModelMetricClaimedVerifiedStatus
1ZLaPAccuracy51.2Unverified
2ZLaP*Accuracy51Unverified
#ModelMetricClaimedVerifiedStatus
1ZLaPAccuracy29.1Unverified
2ZLaP*Accuracy29Unverified
#ModelMetricClaimedVerifiedStatus
1ZLaPAccuracy75.9Unverified
2ZLaP*Accuracy75.5Unverified
#ModelMetricClaimedVerifiedStatus
1ZLaP*Accuracy87.9Unverified
2ZLaPAccuracy87.8Unverified
#ModelMetricClaimedVerifiedStatus
1ZLaPTop 1 Accuracy72.1Unverified
2ZLaP*Top 1 Accuracy72.1Unverified
#ModelMetricClaimedVerifiedStatus
1HiTeAAccuracy21.7Unverified
2HiTeAAccuracy0.46Unverified
#ModelMetricClaimedVerifiedStatus
1HiTeAAccuracy37.4Unverified
2HiTeAAccuracy0.56Unverified
#ModelMetricClaimedVerifiedStatus
1SPOTaverage top-1 classification accuracy71.9Unverified
2ZSL_TF-VAEGANaverage top-1 classification accuracy70.8Unverified
#ModelMetricClaimedVerifiedStatus
1ZLaPAccuracy90Unverified
2ZLaP*Accuracy89Unverified
#ModelMetricClaimedVerifiedStatus
1ZLaP*Accuracy71.8Unverified
2ZLaPAccuracy71.2Unverified
#ModelMetricClaimedVerifiedStatus
1ZLaP*Accuracy71.4Unverified
2ZLaPAccuracy71Unverified
#ModelMetricClaimedVerifiedStatus
1ZLaP*Accuracy76.3Unverified
2ZLaPAccuracy76.3Unverified
#ModelMetricClaimedVerifiedStatus
1CLIP(ViT-B/16)Average mAP85.77Unverified
2CLIP(ResNet-50)Average mAP84.3Unverified
#ModelMetricClaimedVerifiedStatus
1ZSL-KGTop-160.54Unverified
#ModelMetricClaimedVerifiedStatus
1zsl_ADAAverage Per-Class Accuracy70.9Unverified
#ModelMetricClaimedVerifiedStatus
1ZLaP*Accuracy63.2Unverified
#ModelMetricClaimedVerifiedStatus
1MSDAPearson correlation coefficient (PCC)0.52Unverified
#ModelMetricClaimedVerifiedStatus
1SeViLAAccuracy72.3Unverified
#ModelMetricClaimedVerifiedStatus
1M^2-EncoderAccuracy80.7Unverified
#ModelMetricClaimedVerifiedStatus
1FrozenBiLMAccuracy51.5Unverified
#ModelMetricClaimedVerifiedStatus
1CZSLA-acc36Unverified
#ModelMetricClaimedVerifiedStatus
1ZS3Netk=10 mIOU26.3Unverified
#ModelMetricClaimedVerifiedStatus
1ZSL-KGAccuracy88.98Unverified
#ModelMetricClaimedVerifiedStatus
1VideoChat2Accuracy40.6Unverified