PPO Dash: Improving Generalization in Deep Reinforcement Learning

2019-07-15Code Available0· sign in to hype

Joe Booth

Code Available — Be the first to reproduce this paper.

Code

github.com/Sohojoe/ppo-dash
OfficialIn paperpytorch★ 0

Abstract

Deep reinforcement learning is prone to overfitting, and traditional benchmarks such as Atari 2600 benchmark can exacerbate this problem. The Obstacle Tower Challenge addresses this by using randomized environments and separate seeds for training, validation, and test runs. This paper examines various improvements and best practices to the PPO algorithm using the Obstacle Tower Challenge to empirically study their impact with regards to generalization. Our experiments show that the combination provides state-of-the-art performance on the Obstacle Tower Challenge.

Tasks

Deep Reinforcement Learning reinforcement-learning Reinforcement Learning Reinforcement Learning (RL)

PPO Dash: Improving Generalization in Deep Reinforcement Learning

Code

Abstract

Tasks

Reproductions