SOTAVerified

Autonomous Reinforcement of Behavioral Sequences in Neural Dynamics

2012-10-12Unverified0· sign in to hype

Sohrob Kazerounian, Matthew Luciw, Mathis Richter, Yulia Sandamirskaya

Unverified — Be the first to reproduce this paper.

Reproduce

Abstract

We introduce a dynamic neural algorithm called Dynamic Neural (DN) SARSA( ) for learning a behavioral sequence from delayed reward. DN-SARSA( ) combines Dynamic Field Theory models of behavioral sequence representation, classical reinforcement learning, and a computational neuroscience model of working memory, called Item and Order working memory, which serves as an eligibility trace. DN-SARSA( ) is implemented on both a simulated and real robot that must learn a specific rewarding sequence of elementary behaviors from exploration. Results show DN-SARSA( ) performs on the level of the discrete SARSA( ), validating the feasibility of general reinforcement learning without compromising neural dynamics.

Tasks

Reproductions