Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks
Chelsea Finn, Pieter Abbeel, Sergey Levine
Code Available — Be the first to reproduce this paper.
ReproduceCode
- github.com/cbfinn/maml_rlOfficialIn papertf★ 0
- github.com/cbfinn/mamlOfficialIn papertf★ 0
- github.com/learnables/learn2learnpytorch★ 2,878
- github.com/leopard-ai/bettypytorch★ 346
- github.com/JWSoh/MZSRtf★ 277
- github.com/fmu2/PyTorch-MAMLpytorch★ 247
- github.com/shaohua0116/MultiDigitMNISTnone★ 103
- github.com/prajjwal1/fluencepytorch★ 70
- github.com/MoritzTaylor/maml-rl-tf2tf★ 27
- github.com/mikehuisman/revisiting-learned-optimizerspytorch★ 5
Abstract
We propose an algorithm for meta-learning that is model-agnostic, in the sense that it is compatible with any model trained with gradient descent and applicable to a variety of different learning problems, including classification, regression, and reinforcement learning. The goal of meta-learning is to train a model on a variety of learning tasks, such that it can solve new learning tasks using only a small number of training samples. In our approach, the parameters of the model are explicitly trained such that a small number of gradient steps with a small amount of training data from a new task will produce good generalization performance on that task. In effect, our method trains the model to be easy to fine-tune. We demonstrate that this approach leads to state-of-the-art performance on two few-shot image classification benchmarks, produces good results on few-shot regression, and accelerates fine-tuning for policy gradient reinforcement learning with neural network policies.
Tasks
Benchmark Results
| Dataset | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| Dirichlet Mini-Imagenet (5-way, 1-shot) | MAML | 1:1 Accuracy | 47.6 | — | Unverified |
| Dirichlet Mini-Imagenet (5-way, 5-shot) | MAML | 1:1 Accuracy | 64.5 | — | Unverified |
| Meta-Dataset | fo-MAML | Accuracy | 57.02 | — | Unverified |
| Meta-Dataset Rank | fo-MAML | Mean Rank | 10.25 | — | Unverified |
| Mini-Imagenet 10-way (1-shot) | MAML | Accuracy | 31.3 | — | Unverified |
| Mini-Imagenet 10-way (1-shot) | MAML + Transduction | Accuracy | 31.8 | — | Unverified |
| Mini-Imagenet 10-way (5-shot) | MAML + Transduction | Accuracy | 48.2 | — | Unverified |
| Mini-Imagenet 10-way (5-shot) | MAML | Accuracy | 46.9 | — | Unverified |
| Mini-Imagenet 5-way (1-shot) | MAML | Accuracy | 48.7 | — | Unverified |
| Mini-Imagenet 5-way (5-shot) | MAML | Accuracy | 63.1 | — | Unverified |
| Mini-ImageNet-CUB 5-way (1-shot) | MAML (Finn et al., 2017) | Accuracy | 40.15 | — | Unverified |
| OMNIGLOT - 1-Shot, 5-way | MAML | Accuracy | 98.7 | — | Unverified |
| OMNIGLOT - 5-Shot, 5-way | MAML | Accuracy | 99.9 | — | Unverified |
| Tiered ImageNet 10-way (1-shot) | MAML + Transduction | Accuracy | 34.8 | — | Unverified |
| Tiered ImageNet 10-way (1-shot) | MAML | Accuracy | 34.4 | — | Unverified |
| Tiered ImageNet 10-way (5-shot) | MAML | Accuracy | 53.3 | — | Unverified |
| Tiered ImageNet 10-way (5-shot) | MAML + Transduction | Accuracy | 54.7 | — | Unverified |