Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks

2017-03-09ICML 2017Code Available1· sign in to hype

Chelsea Finn, Pieter Abbeel, Sergey Levine

Code Available — Be the first to reproduce this paper.

Code

github.com/cbfinn/maml_rl
OfficialIn papertf★ 0
github.com/cbfinn/maml
OfficialIn papertf★ 0
github.com/JWSoh/MZSR
tf★ 277
github.com/fmu2/PyTorch-MAML
pytorch★ 247
github.com/shaohua0116/MultiDigitMNIST
none★ 103
github.com/yoonholee/MT-net
tf★ 38
github.com/MoritzTaylor/maml-rl-tf2
tf★ 27
github.com/mikehuisman/revisiting-learned-optimizers
pytorch★ 5
github.com/psh150204/MAML
pytorch★ 4
github.com/s-a-malik/multi-few
pytorch★ 4

Abstract

We propose an algorithm for meta-learning that is model-agnostic, in the sense that it is compatible with any model trained with gradient descent and applicable to a variety of different learning problems, including classification, regression, and reinforcement learning. The goal of meta-learning is to train a model on a variety of learning tasks, such that it can solve new learning tasks using only a small number of training samples. In our approach, the parameters of the model are explicitly trained such that a small number of gradient steps with a small amount of training data from a new task will produce good generalization performance on that task. In effect, our method trains the model to be easy to fine-tune. We demonstrate that this approach leads to state-of-the-art performance on two few-shot image classification benchmarks, produces good results on few-shot regression, and accelerates fine-tuning for policy gradient reinforcement learning with neural network policies.

Tasks

Category-Agnostic Pose Estimation Few-Shot Image Classification Few-Shot Learning General Classification image-classification Image Classification Meta-Learning One-Shot Learning regression reinforcement-learning Reinforcement Learning Reinforcement Learning (RL)

Benchmark Results

Dataset	Model	Metric	Claimed	Verified	Status
Dirichlet Mini-Imagenet (5-way, 1-shot)	MAML	1:1 Accuracy	47.6	—	Unverified
Dirichlet Mini-Imagenet (5-way, 5-shot)	MAML	1:1 Accuracy	64.5	—	Unverified
Meta-Dataset	fo-MAML	Accuracy	57.02	—	Unverified
Meta-Dataset Rank	fo-MAML	Mean Rank	10.25	—	Unverified
Mini-Imagenet 10-way (1-shot)	MAML	Accuracy	31.3	—	Unverified
Mini-Imagenet 10-way (1-shot)	MAML + Transduction	Accuracy	31.8	—	Unverified
Mini-Imagenet 10-way (5-shot)	MAML + Transduction	Accuracy	48.2	—	Unverified
Mini-Imagenet 10-way (5-shot)	MAML	Accuracy	46.9	—	Unverified
Mini-Imagenet 5-way (1-shot)	MAML	Accuracy	48.7	—	Unverified
Mini-Imagenet 5-way (5-shot)	MAML	Accuracy	63.1	—	Unverified
Mini-ImageNet-CUB 5-way (1-shot)	MAML (Finn et al., 2017)	Accuracy	40.15	—	Unverified
OMNIGLOT - 1-Shot, 5-way	MAML	Accuracy	98.7	—	Unverified
OMNIGLOT - 5-Shot, 5-way	MAML	Accuracy	99.9	—	Unverified
Tiered ImageNet 10-way (1-shot)	MAML + Transduction	Accuracy	34.8	—	Unverified
Tiered ImageNet 10-way (1-shot)	MAML	Accuracy	34.4	—	Unverified
Tiered ImageNet 10-way (5-shot)	MAML	Accuracy	53.3	—	Unverified
Tiered ImageNet 10-way (5-shot)	MAML + Transduction	Accuracy	54.7	—	Unverified

Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks

Code

Abstract

Tasks

Benchmark Results

Reproductions