Transductive Information Maximization For Few-Shot Learning

2020-08-25Code Available1· sign in to hype

Malik Boudiaf, Ziko Imtiaz Masud, Jérôme Rony, José Dolz, Pablo Piantanida, Ismail Ben Ayed

Code Available — Be the first to reproduce this paper.

Code

github.com/mboudiaf/TIM
OfficialIn paperpytorch★ 123

Abstract

We introduce Transductive Infomation Maximization (TIM) for few-shot learning. Our method maximizes the mutual information between the query features and their label predictions for a given few-shot task, in conjunction with a supervision loss based on the support set. Furthermore, we propose a new alternating-direction solver for our mutual-information loss, which substantially speeds up transductive-inference convergence over gradient-based optimization, while yielding similar accuracy. TIM inference is modular: it can be used on top of any base-training feature extractor. Following standard transductive few-shot settings, our comprehensive experiments demonstrate that TIM outperforms state-of-the-art methods significantly across various datasets and networks, while used on top of a fixed feature extractor trained with simple cross-entropy on the base classes, without resorting to complex meta-learning schemes. It consistently brings between 2% and 5% improvement in accuracy over the best performing method, not only on all the well-established few-shot benchmarks but also on more challenging scenarios,with domain shifts and larger numbers of classes.

Tasks

Few-Shot Image Classification Few-Shot Learning Meta-Learning

Benchmark Results

Dataset	Model	Metric	Claimed	Verified	Status
CUB 200 5-way 1-shot	TIM-GD	Accuracy	82.2	—	Unverified
CUB 200 5-way 5-shot	TIM-GD	Accuracy	90.8	—	Unverified
Mini-Imagenet 10-way (1-shot)	TIM-GD	Accuracy	56.1	—	Unverified
Mini-Imagenet 10-way (5-shot)	TIM-GD	Accuracy	72.8	—	Unverified
Mini-Imagenet 20-way (1-shot)	TIM-GD	Accuracy	39.3	—	Unverified
Mini-Imagenet 20-way (5-shot)	TIM-GD	Accuracy	59.5	—	Unverified
Mini-Imagenet 5-way (1-shot)	TIM-GD	Accuracy	77.8	—	Unverified
Mini-ImageNet to CUB - 5 shot learning	TIM-GD	Accuracy	71	—	Unverified
Tiered ImageNet 5-way (1-shot)	TIM-GD	Accuracy	82.1	—	Unverified
Tiered ImageNet 5-way (5-shot)	TIM-GD	Accuracy	89.8	—	Unverified

Transductive Information Maximization For Few-Shot Learning

Code

Abstract

Tasks

Benchmark Results

Reproductions