SOTAVerified

Reward Function and Initial Values: Better Choices for Accelerated Goal-Directed Reinforcement Learning

2016-09-01无 2016Unverified0· sign in to hype

Laetitia Matignon, guillaume.laurent, nadine.piat

Unverified — Be the first to reproduce this paper.

Reproduce

Abstract

An important issue in Reinforcement Learning (RL) is to accelerate or improve the learning process. In this paper, we study the influence of some RL parameters over the learning speed. Indeed, al- though RL convergence properties have been widely studied, no precise rules exist to correctly choose the reward function and initial Q-values. Our method helps the choice of these RL parameters within the context of reaching a goal in a minimal time. We develop a theoretical study and also provide experimental justifications for choosing on the one hand the reward function, and on the other hand particular initial Q-values based on a goal bias function.

Tasks

Reproductions