SOTAVerified

Online Learning: Stochastic, Constrained, and Smoothed Adversaries

2011-12-01NeurIPS 2011Unverified0· sign in to hype

Alexander Rakhlin, Karthik Sridharan, Ambuj Tewari

Unverified — Be the first to reproduce this paper.

Reproduce

Abstract

Learning theory has largely focused on two main learning scenarios: the classical statistical setting where instances are drawn i.i.d. from a fixed distribution, and the adversarial scenario whereby at every time step the worst instance is revealed to the player. It can be argued that in the real world neither of these assumptions is reasonable. We define the minimax value of a game where the adversary is restricted in his moves, capturing stochastic and non-stochastic assumptions on data. Building on the sequential symmetrization approach, we define a notion of distribution-dependent Rademacher complexity for the spectrum of problems ranging from i.i.d. to worst-case. The bounds let us immediately deduce variation-type bounds. We study a smoothed online learning scenario and show that exponentially small amount of noise can make function classes with infinite Littlestone dimension learnable.

Tasks

Reproductions