Meta-Dataset: A Dataset of Datasets for Learning to Learn from Few Examples
Eleni Triantafillou, Tyler Zhu, Vincent Dumoulin, Pascal Lamblin, Utku Evci, Kelvin Xu, Ross Goroshin, Carles Gelada, Kevin Swersky, Pierre-Antoine Manzagol, Hugo Larochelle
Code Available — Be the first to reproduce this paper.
ReproduceCode
- github.com/google-research/meta-datasetOfficialIn papertf★ 798
- github.com/cambridge-mlg/cnapspytorch★ 162
- github.com/peymanbateni/simple-cnapspytorch★ 120
- github.com/mboudiaf/pytorch-meta-datasetpytorch★ 59
- github.com/plai-group/simple-cnapstf★ 58
- github.com/nobody-1617/detapytorch★ 17
- github.com/tmlr-group/CoPAtf★ 10
- github.com/jimzai/detapytorch★ 6
- github.com/tmlr-group/mokdtf★ 4
- github.com/gokyeongryeol/MAHApytorch★ 1
Abstract
Few-shot classification refers to learning a classifier for new classes given only a few examples. While a plethora of models have emerged to tackle it, we find the procedure and datasets that are used to assess their progress lacking. To address this limitation, we propose Meta-Dataset: a new benchmark for training and evaluating models that is large-scale, consists of diverse datasets, and presents more realistic tasks. We experiment with popular baselines and meta-learners on Meta-Dataset, along with a competitive method that we propose. We analyze performance as a function of various characteristics of test tasks and examine the models' ability to leverage diverse training sources for improving their generalization. We also propose a new set of baselines for quantifying the benefit of meta-learning in Meta-Dataset. Our extensive experimentation has uncovered important research challenges and we hope to inspire work in these directions.
Tasks
Benchmark Results
| Dataset | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| Meta-Dataset | fo-Proto-MAML | Accuracy | 63.43 | — | Unverified |
| Meta-Dataset | Finetune | Accuracy | 58.76 | — | Unverified |
| Meta-Dataset | k-NN | Accuracy | 54.32 | — | Unverified |
| Meta-Dataset Rank | fo-Proto-MAML | Mean Rank | 6.65 | — | Unverified |
| Meta-Dataset Rank | Finetune | Mean Rank | 8.7 | — | Unverified |
| Meta-Dataset Rank | k-NN | Mean Rank | 10.85 | — | Unverified |