Modular Deep Reinforcement Learning with Temporal Logic Specifications

2019-09-23Code Available0· sign in to hype

Lim Zun Yuan, Mohammadhosein Hasanbeig, Alessandro Abate, Daniel Kroening

Code Available — Be the first to reproduce this paper.

Code

github.com/RickyMexx/DeepRL-LTLf
pytorch★ 0
github.com/RickyMexx/DeepRL-LTL
pytorch★ 0

Abstract

We propose an actor-critic, model-free, and online Reinforcement Learning (RL) framework for continuous-state continuous-action Markov Decision Processes (MDPs) when the reward is highly sparse but encompasses a high-level temporal structure. We represent this temporal structure by a finite-state machine and construct an on-the-fly synchronised product with the MDP and the finite machine. The temporal structure acts as a guide for the RL agent within the product, where a modular Deep Deterministic Policy Gradient (DDPG) architecture is proposed to generate a low-level control policy. We evaluate our framework in a Mars rover experiment and we present the success rate of the synthesised policy.

Tasks

Deep Reinforcement Learning reinforcement-learning Reinforcement Learning Reinforcement Learning (RL)

Modular Deep Reinforcement Learning with Temporal Logic Specifications

Code

Abstract

Tasks

Reproductions