SOTAVerified

A spectral least-squares-type method for heavy-tailed corrupted regression with unknown covariance \& heterogeneous noise

2022-09-06Unverified0· sign in to hype

Roberto I. Oliveira, Zoraida F. Rico, Philip Thompson

Unverified — Be the first to reproduce this paper.

Reproduce

Abstract

We revisit heavy-tailed corrupted least-squares linear regression assuming to have a corrupted n-sized label-feature sample of at most n arbitrary outliers. We wish to estimate a p-dimensional parameter b^* given such sample of a label-feature pair (y,x) satisfying y= x,b^*+ with heavy-tailed (x,). We only assume x is L^4-L^2 hypercontractive with constant L>0 and has covariance matrix with minimum eigenvalue 1/^2>0 and bounded condition number >0. The noise can be arbitrarily dependent on x and nonsymmetric as long as x has finite covariance matrix . We propose a near-optimal computationally tractable estimator, based on the power method, assuming no knowledge on (,) nor the operator norm of . With probability at least 1-, our proposed estimator attains the statistical rate ^2^1/2(pn+(1/)n+)^1/2 and breakdown-point 1L^4^2, both optimal in the _2-norm, assuming the near-optimal minimum sample size L^4^2(p p + (1/)) n, up to a log factor. To the best of our knowledge, this is the first computationally tractable algorithm satisfying simultaneously all the mentioned properties. Our estimator is based on a two-stage Multiplicative Weight Update algorithm. The first stage estimates a descent direction v with respect to the (unknown) pre-conditioned inner product (),. The second stage estimate the descent direction v with respect to the (known) inner product ,, without knowing nor estimating .

Tasks

Reproductions