Options Discovery with Budgeted Reinforcement Learning
2016-11-21Unverified0· sign in to hype
Aurélia Léon, Ludovic Denoyer
Unverified — Be the first to reproduce this paper.
ReproduceAbstract
We consider the problem of learning hierarchical policies for Reinforcement Learning able to discover options, an option corresponding to a sub-policy over a set of primitive actions. Different models have been proposed during the last decade that usually rely on a predefined set of options. We specifically address the problem of automatically discovering options in decision processes. We describe a new learning model called Budgeted Option Neural Network (BONN) able to discover options based on a budgeted learning objective. The BONN model is evaluated on different classical RL problems, demonstrating both quantitative and qualitative interesting results.