Robust Training in High Dimensions via Block Coordinate Geometric Median Descent

2021-06-16Code Available0· sign in to hype

Anish Acharya, Abolfazl Hashemi, Prateek Jain, Sujay Sanghavi, Inderjit S. Dhillon, Ufuk Topcu

Code Available — Be the first to reproduce this paper.

Code

github.com/anishacharya/Optimization-Mavericks
OfficialIn paperpytorch★ 1
github.com/anishacharya/BGMD
pytorch★ 4

Abstract

Geometric median (Gm) is a classical method in statistics for achieving a robust estimation of the uncorrupted data; under gross corruption, it achieves the optimal breakdown point of 0.5. However, its computational complexity makes it infeasible for robustifying stochastic gradient descent (SGD) for high-dimensional optimization problems. In this paper, we show that by applying Gm to only a judiciously chosen block of coordinates at a time and using a memory mechanism, one can retain the breakdown point of 0.5 for smooth non-convex problems, with non-asymptotic convergence rates comparable to the SGD with Gm.

Tasks

Image Classification Vocal Bursts Intensity Prediction

Benchmark Results

Dataset	Model	Metric	Claimed	Verified	Status
MNIST	CNN-5 Layer	Accuracy	99.27	—	Unverified

Robust Training in High Dimensions via Block Coordinate Geometric Median Descent

Code

Abstract

Tasks

Benchmark Results

Reproductions