Playing Atari with Deep Reinforcement Learning

2013-12-19Code Available1· sign in to hype

Volodymyr Mnih, Koray Kavukcuoglu, David Silver, Alex Graves, Ioannis Antonoglou, Daan Wierstra, Martin Riedmiller

Code Available — Be the first to reproduce this paper.

Code

github.com/labmlai/annotated_deep_learning_paper_implementations
pytorch★ 66,103
github.com/DLR-RM/stable-baselines3
pytorch★ 12,962
github.com/toni-sm/skrl
jax★ 1,014
github.com/proroklab/popgym
pytorch★ 213
github.com/michaelnny/deep_rl_zoo
pytorch★ 122
github.com/sourenaKhanzadeh/snakeAi
pytorch★ 18
github.com/OscarHuangWind/Preference-Guided-DQN-Atari
pytorch★ 12
github.com/rishavb123/MineRL
tf★ 7
github.com/Anshu1245/RL-CourseProject
none★ 0
github.com/marload/deep-rl-tf2
tf★ 0

Abstract

We present the first deep learning model to successfully learn control policies directly from high-dimensional sensory input using reinforcement learning. The model is a convolutional neural network, trained with a variant of Q-learning, whose input is raw pixels and whose output is a value function estimating future rewards. We apply our method to seven Atari 2600 games from the Arcade Learning Environment, with no adjustment of the architecture or learning algorithm. We find that it outperforms all previous approaches on six of the games and surpasses a human expert on three of them.

Tasks

Atari Games Deep Reinforcement Learning Multi-Goal Reinforcement Learning Q-Learning Reinforcement Learning Reinforcement Learning (RL)

Benchmark Results

Dataset	Model	Metric	Claimed	Verified	Status
Atari 2600 Beam Rider	DQN Best	Score	5,184	—	Unverified
Atari 2600 Breakout	DQN Best	Score	225	—	Unverified
Atari 2600 Enduro	DQN Best	Score	661	—	Unverified
Atari 2600 Pong	DQN Best	Score	21	—	Unverified
Atari 2600 Q*Bert	DQN Best	Score	4,500	—	Unverified
Atari 2600 Seaquest	DQN Best	Score	1,740	—	Unverified
Atari 2600 Space Invaders	DQN Best	Score	1,075	—	Unverified

Playing Atari with Deep Reinforcement Learning

Code

Abstract

Tasks

Benchmark Results

Reproductions