AutoMix: Unveiling the Power of Mixup for Stronger Classifiers
Zicheng Liu, Siyuan Li, Di wu, Zihan Liu, ZhiYuan Chen, Lirong Wu, Stan Z. Li
Code Available — Be the first to reproduce this paper.
ReproduceCode
- github.com/Westlake-AI/openmixupOfficialIn paperpytorch★ 657
- github.com/zeyuanyin/tiny-imagenetpytorch★ 20
- github.com/Westlake-AI/AutoMixpytorch★ 18
Abstract
Data mixing augmentation have proved to be effective in improving the generalization ability of deep neural networks. While early methods mix samples by hand-crafted policies (e.g., linear interpolation), recent methods utilize saliency information to match the mixed samples and labels via complex offline optimization. However, there arises a trade-off between precise mixing policies and optimization complexity. To address this challenge, we propose a novel automatic mixup (AutoMix) framework, where the mixup policy is parameterized and serves the ultimate classification goal directly. Specifically, AutoMix reformulates the mixup classification into two sub-tasks (i.e., mixed sample generation and mixup classification) with corresponding sub-networks and solves them in a bi-level optimization framework. For the generation, a learnable lightweight mixup generator, Mix Block, is designed to generate mixed samples by modeling patch-wise relationships under the direct supervision of the corresponding mixed labels. To prevent the degradation and instability of bi-level optimization, we further introduce a momentum pipeline to train AutoMix in an end-to-end manner. Extensive experiments on nine image benchmarks prove the superiority of AutoMix compared with state-of-the-art in various classification scenarios and downstream tasks.
Tasks
Benchmark Results
| Dataset | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| CIFAR-10 | ResNeXt-50 (AutoMix) | Percentage correct | 97.84 | — | Unverified |
| CIFAR-100 | ResNeXt-50(32x4d) + AutoMix | Percentage correct | 83.64 | — | Unverified |
| CIFAR-100 | WRN-28-8 +AutoMix | Percentage correct | 85.16 | — | Unverified |
| ImageNet | ResNet-18 (AutoMix) | Top 1 Accuracy | 72.05 | — | Unverified |
| ImageNet | ResNet-101 (AutoMix) | Top 1 Accuracy | 80.98 | — | Unverified |
| ImageNet | ResNet-50 (AutoMix) | Top 1 Accuracy | 79.25 | — | Unverified |
| ImageNet | ResNet-34 (AutoMix) | Top 1 Accuracy | 76.1 | — | Unverified |
| iNaturalist 2018 | ResNet-50 (AutoMix) | Top-1 Accuracy | 64.73 | — | Unverified |
| iNaturalist 2018 | ResNeXt-101 (AutoMix) | Top-1 Accuracy | 70.49 | — | Unverified |
| Places205 | AutoMix (ResNet-50 Supervised) | Top 1 Accuracy | 64.1 | — | Unverified |
| Tiny ImageNet Classification | ResNeXt-50 (AutoMix) | Validation Acc | 70.72 | — | Unverified |
| Tiny ImageNet Classification | ResNet18 (AutoMix) | Validation Acc | 67.33 | — | Unverified |