Heteroskedastic and Imbalanced Deep Learning with Adaptive Regularization

2020-06-29ICLR 2021Code Available1· sign in to hype

Kaidi Cao, Yining Chen, Junwei Lu, Nikos Arechiga, Adrien Gaidon, Tengyu Ma

Code Available — Be the first to reproduce this paper.

Code

github.com/kaidic/HAR
Officialpytorch★ 42

Abstract

Real-world large-scale datasets are heteroskedastic and imbalanced -- labels have varying levels of uncertainty and label distributions are long-tailed. Heteroskedasticity and imbalance challenge deep learning algorithms due to the difficulty of distinguishing among mislabeled, ambiguous, and rare examples. Addressing heteroskedasticity and imbalance simultaneously is under-explored. We propose a data-dependent regularization technique for heteroskedastic datasets that regularizes different regions of the input space differently. Inspired by the theoretical derivation of the optimal regularization strength in a one-dimensional nonparametric classification setting, our approach adaptively regularizes the data points in higher-uncertainty, lower-density regions more heavily. We test our method on several benchmark tasks, including a real-world heteroskedastic and imbalanced dataset, WebVision. Our experiments corroborate our theory and demonstrate a significant improvement over other methods in noise-robust deep learning.

Tasks

Deep Learning Image Classification

Benchmark Results

Dataset	Model	Metric	Claimed	Verified	Status
WebVision-1000	HAR (InceptionResNet-v2)	Top-1 Accuracy	75	—	Unverified

Heteroskedastic and Imbalanced Deep Learning with Adaptive Regularization

Code

Abstract

Tasks

Benchmark Results

Reproductions