Open Problem: Model Selection for Contextual Bandits
2020-06-19Unverified0· sign in to hype
Dylan J. Foster, Akshay Krishnamurthy, Haipeng Luo
Unverified — Be the first to reproduce this paper.
ReproduceAbstract
In statistical learning, algorithms for model selection allow the learner to adapt to the complexity of the best hypothesis class in a sequence. We ask whether similar guarantees are possible for contextual bandit learning.