Whatever Does Not Kill Deep Reinforcement Learning, Makes It Stronger

2017-12-23Code Available1· sign in to hype

Vahid Behzadan, Arslan Munir

Code Available — Be the first to reproduce this paper.

Code

github.com/behzadanksu/rl-attack
OfficialIn papertf★ 0
github.com/chenhongge/SA_DQN
pytorch★ 35
github.com/elytopia/Info-Sec-Hammerer-and-Brunori
tf★ 0
github.com/behzadanksu/rlattack-dev
tf★ 0

Abstract

Recent developments have established the vulnerability of deep Reinforcement Learning (RL) to policy manipulation attacks via adversarial perturbations. In this paper, we investigate the robustness and resilience of deep RL to training-time and test-time attacks. Through experimental results, we demonstrate that under noncontiguous training-time attacks, Deep Q-Network (DQN) agents can recover and adapt to the adversarial conditions by reactively adjusting the policy. Our results also show that policies learned under adversarial perturbations are more robust to test-time attacks. Furthermore, we compare the performance of -greedy and parameter-space noise exploration methods in terms of robustness and resilience against adversarial perturbations.

Tasks

Deep Reinforcement Learning reinforcement-learning Reinforcement Learning Reinforcement Learning (RL)

Whatever Does Not Kill Deep Reinforcement Learning, Makes It Stronger

Code

Abstract

Tasks

Reproductions