Model-based Reinforcement Learning for Parameterized Action Spaces

2024-04-03Code Available1· sign in to hype

Renhao Zhang, Haotian Fu, Yilin Miao, George Konidaris

Code Available — Be the first to reproduce this paper.

Code

github.com/valarzz/model-based-reinforcement-learning-for-parameterized-action-spaces
OfficialIn paperpytorch★ 28
github.com/valarzz/dlpa
OfficialIn paperpytorch★ 28

Abstract

We propose a novel model-based reinforcement learning algorithm -- Dynamics Learning and predictive control with Parameterized Actions (DLPA) -- for Parameterized Action Markov Decision Processes (PAMDPs). The agent learns a parameterized-action-conditioned dynamics model and plans with a modified Model Predictive Path Integral control. We theoretically quantify the difference between the generated trajectory and the optimal trajectory during planning in terms of the value they achieved through the lens of Lipschitz Continuity. Our empirical results on several standard benchmarks show that our algorithm achieves superior sample efficiency and asymptotic performance than state-of-the-art PAMDP methods.

Tasks

model Model-based Reinforcement Learning reinforcement-learning Reinforcement Learning

Model-based Reinforcement Learning for Parameterized Action Spaces

Code

Abstract

Tasks

Reproductions