AutoMix: Unveiling the Power of Mixup for Stronger Classifiers

2021-03-24Code Available1· sign in to hype

Zicheng Liu, Siyuan Li, Di wu, Zihan Liu, ZhiYuan Chen, Lirong Wu, Stan Z. Li

Code Available — Be the first to reproduce this paper.

Code

github.com/zeyuanyin/tiny-imagenet
pytorch★ 20
github.com/Westlake-AI/AutoMix
pytorch★ 18

Abstract

Data mixing augmentation have proved to be effective in improving the generalization ability of deep neural networks. While early methods mix samples by hand-crafted policies (e.g., linear interpolation), recent methods utilize saliency information to match the mixed samples and labels via complex offline optimization. However, there arises a trade-off between precise mixing policies and optimization complexity. To address this challenge, we propose a novel automatic mixup (AutoMix) framework, where the mixup policy is parameterized and serves the ultimate classification goal directly. Specifically, AutoMix reformulates the mixup classification into two sub-tasks (i.e., mixed sample generation and mixup classification) with corresponding sub-networks and solves them in a bi-level optimization framework. For the generation, a learnable lightweight mixup generator, Mix Block, is designed to generate mixed samples by modeling patch-wise relationships under the direct supervision of the corresponding mixed labels. To prevent the degradation and instability of bi-level optimization, we further introduce a momentum pipeline to train AutoMix in an end-to-end manner. Extensive experiments on nine image benchmarks prove the superiority of AutoMix compared with state-of-the-art in various classification scenarios and downstream tasks.

Tasks

Classification Data Augmentation General Classification Image Classification Representation Learning

Benchmark Results

Dataset	Model	Metric	Claimed	Verified	Status
CIFAR-10	ResNeXt-50 (AutoMix)	Percentage correct	97.84	—	Unverified
CIFAR-100	ResNeXt-50(32x4d) + AutoMix	Percentage correct	83.64	—	Unverified
CIFAR-100	WRN-28-8 +AutoMix	Percentage correct	85.16	—	Unverified
ImageNet	ResNet-18 (AutoMix)	Top 1 Accuracy	72.05	—	Unverified
ImageNet	ResNet-101 (AutoMix)	Top 1 Accuracy	80.98	—	Unverified
ImageNet	ResNet-50 (AutoMix)	Top 1 Accuracy	79.25	—	Unverified
ImageNet	ResNet-34 (AutoMix)	Top 1 Accuracy	76.1	—	Unverified
iNaturalist 2018	ResNet-50 (AutoMix)	Top-1 Accuracy	64.73	—	Unverified
iNaturalist 2018	ResNeXt-101 (AutoMix)	Top-1 Accuracy	70.49	—	Unverified
Places205	AutoMix (ResNet-50 Supervised)	Top 1 Accuracy	64.1	—	Unverified
Tiny ImageNet Classification	ResNeXt-50 (AutoMix)	Validation Acc	70.72	—	Unverified
Tiny ImageNet Classification	ResNet18 (AutoMix)	Validation Acc	67.33	—	Unverified

AutoMix: Unveiling the Power of Mixup for Stronger Classifiers

Code

Abstract

Tasks

Benchmark Results

Reproductions