Meta-Curvature
Eunbyung Park, Junier B. Oliva
Code Available — Be the first to reproduce this paper.
ReproduceCode
- github.com/silverbottlep/meta_curvatureOfficialIn papernone★ 0
Abstract
We propose meta-curvature (MC), a framework to learn curvature information for better generalization and fast model adaptation. MC expands on the model-agnostic meta-learner (MAML) by learning to transform the gradients in the inner optimization such that the transformed gradients achieve better generalization performance to a new task. For training large scale neural networks, we decompose the curvature matrix into smaller matrices in a novel scheme where we capture the dependencies of the model's parameters with a series of tensor products. We demonstrate the effects of our proposed method on several few-shot learning tasks and datasets. Without any task specific techniques and architectures, the proposed method achieves substantial improvement upon previous MAML variants and outperforms the recent state-of-the-art methods. Furthermore, we observe faster convergence rates of the meta-training process. Finally, we present an analysis that explains better generalization performance with the meta-trained curvature.
Tasks
Benchmark Results
| Dataset | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| Mini-Imagenet 5-way (1-shot) | MC2+ | Accuracy | 55.73 | — | Unverified |
| Mini-Imagenet 5-way (5-shot) | MC2+ | Accuracy | 70.33 | — | Unverified |
| OMNIGLOT - 1-Shot, 20-way | MC2+ | Accuracy | 88 | — | Unverified |
| OMNIGLOT - 1-Shot, 5-way | MC2+ | Accuracy | 99.97 | — | Unverified |
| OMNIGLOT - 5-Shot, 20-way | MC2+ | Accuracy | 99.65 | — | Unverified |
| OMNIGLOT - 5-Shot, 5-way | MC2+ | Accuracy | 99.89 | — | Unverified |