HyperShot: Few-Shot Learning by Kernel HyperNetworks

2022-03-21Code Available1· sign in to hype

Marcin Sendera, Marcin Przewięźlikowski, Konrad Karanowski, Maciej Zięba, Jacek Tabor, Przemysław Spurek

Code Available — Be the first to reproduce this paper.

Code

github.com/gmum/few-shot-hypernets-public
Officialpytorch★ 28

Abstract

Few-shot models aim at making predictions using a minimal number of labeled examples from a given task. The main challenge in this area is the one-shot setting where only one element represents each class. We propose HyperShot - the fusion of kernels and hypernetwork paradigm. Compared to reference approaches that apply a gradient-based adjustment of the parameters, our model aims to switch the classification module parameters depending on the task's embedding. In practice, we utilize a hypernetwork, which takes the aggregated information from support data and returns the classifier's parameters handcrafted for the considered problem. Moreover, we introduce the kernel-based representation of the support examples delivered to hypernetwork to create the parameters of the classification module. Consequently, we rely on relations between embeddings of the support examples instead of direct feature values provided by the backbone models. Thanks to this approach, our model can adapt to highly different tasks.

Tasks

Few-Shot Image Classification Few-Shot Learning

Benchmark Results

Dataset	Model	Metric	Claimed	Verified	Status
CUB 200 5-way 1-shot	HyperShot	Accuracy	66.13	—	Unverified
CUB 200 5-way 5-shot	HyperShot	Accuracy	80.07	—	Unverified
Mini-ImageNet - 1-Shot Learning	HyperShot	Accuracy	53.18	—	Unverified
Mini-Imagenet 5-way (5-shot)	HyperShot	Accuracy	69.62	—	Unverified
Mini-ImageNet-CUB 5-way (1-shot)	HyperShot	Accuracy	40.03	—	Unverified
Mini-ImageNet-CUB 5-way (5-shot)	HyperShot	Accuracy	58.86	—	Unverified
OMNIGLOT-EMNIST 5-way (1-shot)	HyperShot	Accuracy	80.65	—	Unverified
OMNIGLOT-EMNIST 5-way (5-shot)	HyperShot	Accuracy	90.81	—	Unverified

HyperShot: Few-Shot Learning by Kernel HyperNetworks

Code

Abstract

Tasks

Benchmark Results

Reproductions