SOTAVerified

Adaptive Online Estimation of Piecewise Polynomial Trends

2020-09-30NeurIPS 2020Unverified0· sign in to hype

Dheeraj Baby, Yu-Xiang Wang

Unverified — Be the first to reproduce this paper.

Reproduce

Abstract

We consider the framework of non-stationary stochastic optimization [Besbes et al, 2015] with squared error losses and noisy gradient feedback where the dynamic regret of an online learner against a time varying comparator sequence is studied. Motivated from the theory of non-parametric regression, we introduce a new variational constraint that enforces the comparator sequence to belong to a discrete k^th order Total Variation ball of radius C_n. This variational constraint models comparators that have piece-wise polynomial structure which has many relevant practical applications [Tibshirani, 2014]. By establishing connections to the theory of wavelet based non-parametric regression, we design a polynomial time algorithm that achieves the nearly optimal dynamic regret of O(n^12k+3C_n^22k+3). The proposed policy is adaptive to the unknown radius C_n. Further, we show that the same policy is minimax optimal for several other non-parametric families of interest.

Tasks

Reproductions