Mastering Atari with Discrete World Models
Danijar Hafner, Timothy Lillicrap, Mohammad Norouzi, Jimmy Ba
Code Available — Be the first to reproduce this paper.
ReproduceCode
- github.com/danijar/dreamerv2Officialtf★ 1,015
- github.com/opendilab/DI-enginepytorch★ 3,606
- github.com/andrejorsula/drl_graspingpytorch★ 506
- github.com/RajGhugare19/dreamerv2pytorch★ 274
- github.com/adityabingi/Dreamerpytorch★ 49
- github.com/iglu-contest/iglu-builder-baseline-dreamertf★ 6
- github.com/chandar-lab/LoCA2tf★ 3
- github.com/CVC-Lab/SAC-for-H-Bond-Learningpytorch★ 1
- github.com/LukeBolly/dreamerv2tf★ 0
Abstract
Intelligent agents need to generalize from past experience to achieve goals in complex environments. World models facilitate such generalization and allow learning behaviors from imagined outcomes to increase sample-efficiency. While learning world models from image inputs has recently become feasible for some tasks, modeling Atari games accurately enough to derive successful behaviors has remained an open challenge for many years. We introduce DreamerV2, a reinforcement learning agent that learns behaviors purely from predictions in the compact latent space of a powerful world model. The world model uses discrete representations and is trained separately from the policy. DreamerV2 constitutes the first agent that achieves human-level performance on the Atari benchmark of 55 tasks by learning behaviors inside a separately trained world model. With the same computational budget and wall-clock time, Dreamer V2 reaches 200M frames and surpasses the final performance of the top single-GPU agents IQN and Rainbow. DreamerV2 is also applicable to tasks with continuous actions, where it learns an accurate world model of a complex humanoid robot and solves stand-up and walking from only pixel inputs.
Tasks
Benchmark Results
| Dataset | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| Atari 2600 Alien | DreamerV2 | Score | 3,967 | — | Unverified |
| Atari 2600 Amidar | DreamerV2 | Score | 2,577 | — | Unverified |
| Atari 2600 Assault | DreamerV2 | Score | 23,625 | — | Unverified |
| Atari 2600 Asterix | DreamerV2 | Score | 72,311 | — | Unverified |
| Atari 2600 Asteroids | DreamerV2 | Score | 41,526 | — | Unverified |
| Atari 2600 Atlantis | DreamerV2 | Score | 978,778 | — | Unverified |
| Atari 2600 Bank Heist | DreamerV2 | Score | 1,126 | — | Unverified |
| Atari 2600 Battle Zone | DreamerV2 | Score | 40,325 | — | Unverified |
| Atari 2600 Beam Rider | DreamerV2 | Score | 18,646 | — | Unverified |
| Atari 2600 Berzerk | DreamerV2 | Score | 810 | — | Unverified |
| Atari 2600 Bowling | DreamerV2 | Score | 49 | — | Unverified |
| Atari 2600 Boxing | DreamerV2 | Score | 92 | — | Unverified |
| Atari 2600 Breakout | DreamerV2 | Score | 312 | — | Unverified |
| Atari 2600 Centipede | DreamerV2 | Score | 11,883 | — | Unverified |
| Atari 2600 Chopper Command | DreamerV2 | Score | 2,861 | — | Unverified |
| Atari 2600 Crazy Climber | DreamerV2 | Score | 161,839 | — | Unverified |
| Atari 2600 Demon Attack | DreamerV2 | Score | 82,263 | — | Unverified |
| Atari 2600 Double Dunk | DreamerV2 | Score | 17 | — | Unverified |
| Atari 2600 Enduro | DreamerV2 | Score | 1,656 | — | Unverified |
| Atari 2600 Fishing Derby | DreamerV2 | Score | 65 | — | Unverified |
| Atari 2600 Freeway | DreamerV2 | Score | 33 | — | Unverified |
| Atari 2600 Frostbite | DreamerV2 | Score | 11,384 | — | Unverified |
| Atari 2600 Gopher | DreamerV2 | Score | 92,282 | — | Unverified |
| Atari 2600 Gravitar | DreamerV2 | Score | 3,789 | — | Unverified |
| Atari 2600 HERO | DreamerV2 | Score | 21,868 | — | Unverified |
| Atari 2600 Ice Hockey | DreamerV2 | Score | 26 | — | Unverified |
| Atari 2600 James Bond | DreamerV2 | Score | 40,445 | — | Unverified |
| Atari 2600 Kangaroo | DreamerV2 | Score | 14,064 | — | Unverified |
| Atari 2600 Krull | DreamerV2 | Score | 50,061 | — | Unverified |
| Atari 2600 Kung-Fu Master | DreamerV2 | Score | 62,741 | — | Unverified |
| Atari 2600 Montezuma's Revenge | DreamerV2 | Score | 81 | — | Unverified |
| Atari 2600 Ms. Pacman | DreamerV2 | Score | 5,652 | — | Unverified |
| Atari 2600 Name This Game | DreamerV2 | Score | 14,649 | — | Unverified |
| Atari 2600 Phoenix | DreamerV2 | Score | 49,375 | — | Unverified |
| Atari 2600 Pitfall! | DreamerV2 | Score | 0 | — | Unverified |
| Atari 2600 Pong | DreamerV2 | Score | 20 | — | Unverified |
| Atari 2600 Private Eye | DreamerV2 | Score | 2,198 | — | Unverified |
| Atari 2600 Q*Bert | DreamerV2 | Score | 94,688 | — | Unverified |
| Atari 2600 River Raid | DreamerV2 | Score | 16,351 | — | Unverified |
| Atari 2600 Road Runner | DreamerV2 | Score | 203,576 | — | Unverified |
| Atari 2600 Robotank | DreamerV2 | Score | 78 | — | Unverified |
| Atari 2600 Seaquest | DreamerV2 | Score | 7,480 | — | Unverified |
| Atari 2600 Skiing | DreamerV2 | Score | -9,299 | — | Unverified |
| Atari 2600 Solaris | DreamerV2 | Score | 922 | — | Unverified |
| Atari 2600 Space Invaders | DreamerV2 | Score | 2,474 | — | Unverified |
| Atari 2600 Star Gunner | DreamerV2 | Score | 7,800 | — | Unverified |
| Atari 2600 Tennis | DreamerV2 | Score | 14 | — | Unverified |
| Atari 2600 Time Pilot | DreamerV2 | Score | 37,945 | — | Unverified |
| Atari 2600 Tutankham | DreamerV2 | Score | 264 | — | Unverified |
| Atari 2600 Up and Down | DreamerV2 | Score | 653,662 | — | Unverified |