Understanding Self-Training for Gradual Domain Adaptation

2020-02-26ICML 2020Code Available1· sign in to hype

Ananya Kumar, Tengyu Ma, Percy Liang

Code Available — Be the first to reproduce this paper.

Code

github.com/p-lambda/gradual_domain_adaptation
OfficialIn papertf★ 37
worksheets.codalab.org/worksheets/0xabfe206d857d40a5baf2a19ece5f2659
Officialnone★ 0

Abstract

Machine learning systems must adapt to data distributions that evolve over time, in applications ranging from sensor networks and self-driving car perception modules to brain-machine interfaces. We consider gradual domain adaptation, where the goal is to adapt an initial classifier trained on a source domain given only unlabeled data that shifts gradually in distribution towards a target domain. We prove the first non-vacuous upper bound on the error of self-training with gradual shifts, under settings where directly adapting to the target domain can result in unbounded error. The theoretical analysis leads to algorithmic insights, highlighting that regularization and label sharpening are essential even when we have infinite data, and suggesting that self-training works particularly well for shifts with small Wasserstein-infinity distance. Leveraging the gradual shift structure leads to higher accuracies on a rotating MNIST dataset and a realistic Portraits dataset.

Tasks

Domain Adaptation Unsupervised Domain Adaptation

Benchmark Results

Dataset	Model	Metric	Claimed	Verified	Status
Portraits (over time)	Gradual Self-Training (Small Conv)	Accuracy (%)	83.8	—	Unverified

Understanding Self-Training for Gradual Domain Adaptation

Code

Abstract

Tasks

Benchmark Results

Reproductions