Distilling Virtual Examples for Long-tailed Recognition

2021-03-28ICCV 2021Code Available0· sign in to hype

Yin-Yin He, Jianxin Wu, Xiu-Shen Wei

Code Available — Be the first to reproduce this paper.

Code

github.com/yangyucheng000/DiVE
mindspore★ 0

Abstract

We tackle the long-tailed visual recognition problem from the knowledge distillation perspective by proposing a Distill the Virtual Examples (DiVE) method. Specifically, by treating the predictions of a teacher model as virtual examples, we prove that distilling from these virtual examples is equivalent to label distribution learning under certain constraints. We show that when the virtual example distribution becomes flatter than the original input distribution, the under-represented tail classes will receive significant improvements, which is crucial in long-tailed recognition. The proposed DiVE method can explicitly tune the virtual example distribution to become flat. Extensive experiments on three benchmark datasets, including the large-scale iNaturalist ones, justify that the proposed DiVE method can significantly outperform state-of-the-art methods. Furthermore, additional analyses and experiments verify the virtual example interpretation, and demonstrate the effectiveness of tailored designs in DiVE for long-tailed problems.

Tasks

Knowledge Distillation Long-tail Learning

Benchmark Results

Dataset	Model	Metric	Claimed	Verified	Status
ImageNet-LT	RIDE-DiVE	Top-1 Accuracy	57.12	—	Unverified
ImageNet-LT	DiVE	Top-1 Accuracy	53.1	—	Unverified
iNaturalist 2018	RIDE-DiVE	Top-1 Accuracy	73.44	—	Unverified
iNaturalist 2018	DiVE	Top-1 Accuracy	71.71	—	Unverified

Distilling Virtual Examples for Long-tailed Recognition

Code

Abstract

Tasks

Benchmark Results

Reproductions