Rainbow: Combining Improvements in Deep Reinforcement Learning

2017-10-06Code Available3· sign in to hype

Matteo Hessel, Joseph Modayil, Hado van Hasselt, Tom Schaul, Georg Ostrovski, Will Dabney, Dan Horgan, Bilal Piot, Mohammad Azar, David Silver

arXiv PDF

Code Available — Be the first to reproduce this paper.

Reproduce

Code

github.com/thu-ml/tianshou
pytorch★ 10,409
github.com/floringogianu/atari-agents
pytorch★ 99
github.com/deconlabs/Binanace-trading-simulation
pytorch★ 33
github.com/jacobkooi/hadamax
jax★ 8
github.com/xusophia/DataSciFinalProj
pytorch★ 4
github.com/mohith-sakthivel/rainbow_dqn
pytorch★ 0
github.com/BY571/DQN-Atari-Agents
pytorch★ 0
github.com/liuyuezhang/pyrl
pytorch★ 0
github.com/robintyh1/icml2021-pengqlambda
tf★ 0
github.com/eddynelson/dqn
tf★ 0

Abstract

The deep reinforcement learning community has made several independent improvements to the DQN algorithm. However, it is unclear which of these extensions are complementary and can be fruitfully combined. This paper examines six extensions to the DQN algorithm and empirically studies their combination. Our experiments show that the combination provides state-of-the-art performance on the Atari 2600 benchmark, both in terms of data efficiency and final performance. We also provide results from a detailed ablation study that shows the contribution of each component to overall performance.

Tasks

Atari Games Deep Reinforcement Learning Montezuma's Revenge reinforcement-learning Reinforcement Learning Reinforcement Learning (RL)

Benchmark Results

Dataset	Model	Metric	Claimed	Verified	Status
Atari 2600 Ms. Pacman	Rainbow	Score	2,570.2	—	Unverified
Atari 2600 Space Invaders	Rainbow	Score	12,629	—	Unverified
Atari-57	Rainbow DQN	Mean Human Normalized Score	873.97	—	Unverified
atari game	Rainbow	Human World Record Breakthrough	4	—	Unverified
Atari games	Rainbow DQN	Mean Human Normalized Score	873.97	—	Unverified

Rainbow: Combining Improvements in Deep Reinforcement Learning

Code

Abstract

Tasks

Benchmark Results

Reproductions