Pseudorehearsal in value function approximation

2017-03-21Unverified0· sign in to hype

Vladimir Marochko, Leonard Johard, Manuel Mazzara

Unverified — Be the first to reproduce this paper.

Abstract

Catastrophic forgetting is of special importance in reinforcement learning, as the data distribution is generally non-stationary over time. We study and compare several pseudorehearsal approaches for Q-learning with function approximation in a pole balancing task. We have found that pseudorehearsal seems to assist learning even in such very simple problems, given proper initialization of the rehearsal parameters.

Tasks

Q-Learning reinforcement-learning Reinforcement Learning Reinforcement Learning (RL)

Pseudorehearsal in value function approximation

Abstract

Tasks

Reproductions