Model-Free Episodic Control with State Aggregation

2020-08-21Unverified0· sign in to hype

Rafael Pinto

Unverified — Be the first to reproduce this paper.

Abstract

Episodic control provides a highly sample-efficient method for reinforcement learning while enforcing high memory and computational requirements. This work proposes a simple heuristic for reducing these requirements, and an application to Model-Free Episodic Control (MFEC) is presented. Experiments on Atari games show that this heuristic successfully reduces MFEC computational demands while producing no significant loss of performance when conservative choices of hyperparameters are used. Consequently, episodic control becomes a more feasible option when dealing with reinforcement learning tasks.

Tasks

Atari Games model reinforcement-learning Reinforcement Learning Reinforcement Learning (RL)

Benchmark Results

Dataset	Model	Metric	Claimed	Verified	Status
Atari 2600 Frostbite	MFEC	Score	2,394	—	Unverified
Atari 2600 HERO	MFEC	Score	11,732	—	Unverified
Atari 2600 Ms. Pacman	MFEC	Score	8,530.4	—	Unverified
Atari 2600 Q*Bert	MFEC	Score	14,135	—	Unverified
Atari 2600 River Raid	MFEC	Score	3,868	—	Unverified
Atari 2600 Space Invaders	MFEC	Score	1,990	—	Unverified

Model-Free Episodic Control with State Aggregation

Abstract

Tasks

Benchmark Results

Reproductions