Model-Free Episodic Control with State Aggregation
2020-08-21Unverified0· sign in to hype
Rafael Pinto
Unverified — Be the first to reproduce this paper.
ReproduceAbstract
Episodic control provides a highly sample-efficient method for reinforcement learning while enforcing high memory and computational requirements. This work proposes a simple heuristic for reducing these requirements, and an application to Model-Free Episodic Control (MFEC) is presented. Experiments on Atari games show that this heuristic successfully reduces MFEC computational demands while producing no significant loss of performance when conservative choices of hyperparameters are used. Consequently, episodic control becomes a more feasible option when dealing with reinforcement learning tasks.
Tasks
Benchmark Results
| Dataset | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| Atari 2600 Frostbite | MFEC | Score | 2,394 | — | Unverified |
| Atari 2600 HERO | MFEC | Score | 11,732 | — | Unverified |
| Atari 2600 Ms. Pacman | MFEC | Score | 8,530.4 | — | Unverified |
| Atari 2600 Q*Bert | MFEC | Score | 14,135 | — | Unverified |
| Atari 2600 River Raid | MFEC | Score | 3,868 | — | Unverified |
| Atari 2600 Space Invaders | MFEC | Score | 1,990 | — | Unverified |