Open Problem: Model Selection for Contextual Bandits

2020-06-19Unverified0· sign in to hype

Dylan J. Foster, Akshay Krishnamurthy, Haipeng Luo

Unverified — Be the first to reproduce this paper.

Abstract

In statistical learning, algorithms for model selection allow the learner to adapt to the complexity of the best hypothesis class in a sequence. We ask whether similar guarantees are possible for contextual bandit learning.

Tasks

model Model Selection Multi-Armed Bandits

Open Problem: Model Selection for Contextual Bandits

Abstract

Tasks

Reproductions