SOTAVerified

HyperShot: Few-Shot Learning by Kernel HyperNetworks

2022-03-21Code Available1· sign in to hype

Marcin Sendera, Marcin Przewięźlikowski, Konrad Karanowski, Maciej Zięba, Jacek Tabor, Przemysław Spurek

Code Available — Be the first to reproduce this paper.

Reproduce

Code

Abstract

Few-shot models aim at making predictions using a minimal number of labeled examples from a given task. The main challenge in this area is the one-shot setting where only one element represents each class. We propose HyperShot - the fusion of kernels and hypernetwork paradigm. Compared to reference approaches that apply a gradient-based adjustment of the parameters, our model aims to switch the classification module parameters depending on the task's embedding. In practice, we utilize a hypernetwork, which takes the aggregated information from support data and returns the classifier's parameters handcrafted for the considered problem. Moreover, we introduce the kernel-based representation of the support examples delivered to hypernetwork to create the parameters of the classification module. Consequently, we rely on relations between embeddings of the support examples instead of direct feature values provided by the backbone models. Thanks to this approach, our model can adapt to highly different tasks.

Tasks

Benchmark Results

DatasetModelMetricClaimedVerifiedStatus
CUB 200 5-way 1-shotHyperShotAccuracy66.13Unverified
CUB 200 5-way 5-shotHyperShotAccuracy80.07Unverified
Mini-ImageNet - 1-Shot LearningHyperShotAccuracy53.18Unverified
Mini-Imagenet 5-way (5-shot)HyperShotAccuracy69.62Unverified
Mini-ImageNet-CUB 5-way (1-shot)HyperShotAccuracy40.03Unverified
Mini-ImageNet-CUB 5-way (5-shot)HyperShotAccuracy58.86Unverified
OMNIGLOT-EMNIST 5-way (1-shot)HyperShotAccuracy80.65Unverified
OMNIGLOT-EMNIST 5-way (5-shot)HyperShotAccuracy90.81Unverified

Reproductions