SOTAVerified

Average Cost Optimal Control of Stochastic Systems Using Reinforcement Learning

2020-10-13Unverified0· sign in to hype

Jing Lai, Junlin Xiong

Unverified — Be the first to reproduce this paper.

Reproduce

Abstract

This paper addresses the average cost minimization problem for discrete-time systems with multiplicative and additive noises via reinforcement learning. By using Q-function, we propose an online learning scheme to estimate the kernel matrix of Q-function and to update the control gain using the data along the system trajectories. The obtained control gain and kernel matrix are proved to converge to the optimal ones. To implement the proposed learning scheme, an online model-free reinforcement learning algorithm is given, where recursive least squares method is used to estimate the kernel matrix of Q-function. A numerical example is presented to illustrate the proposed approach.

Tasks

Reproductions