Angrier Birds: Bayesian reinforcement learning
2016-01-06Code Available0· sign in to hype
Imanol Arrieta Ibarra, Bernardo Ramos, Lars Roemheld
Code Available — Be the first to reproduce this paper.
ReproduceCode
- github.com/imanolarrieta/angrybirdsOfficialIn papernone★ 0
Abstract
We train a reinforcement learner to play a simplified version of the game Angry Birds. The learner is provided with a game state in a manner similar to the output that could be produced by computer vision algorithms. We improve on the efficiency of regular -greedy Q-Learning with linear function approximation through more systematic exploration in Randomized Least Squares Value Iteration (RLSVI), an algorithm that samples its policy from a posterior distribution on optimal policies. With larger state-action spaces, efficient exploration becomes increasingly important, as evidenced by the faster learning in RLSVI.