SOTAVerified

Atari Games

The Atari 2600 Games task (and dataset) involves training an agent to achieve high game scores.

( Image credit: Playing Atari with Deep Reinforcement Learning )

Papers

Showing 151175 of 625 papers

TitleStatusHype
Mask Atari for Deep Reinforcement Learning as POMDP Benchmarks0
Learning Relational Rules from RewardsCode0
NavDreams: Towards Camera-Only RL Navigation Among HumansCode1
Lazy-MDPs: Towards Interpretable Reinforcement Learning by Learning When to Act0
Improving the Diversity of Bootstrapped DQN by Replacing Priors With Noise0
On the Generalization of Representations in Reinforcement LearningCode0
A Unified Perspective on Value Backup and Exploration in Monte-Carlo Tree Search0
ExPoSe: Combining State-Based Exploration with Gradient-Based Online SearchCode0
Distributional Reinforcement Learning with Regularized Wasserstein LossCode0
Deep Reinforcement Learning with Spiking Q-learning0
Exploration by Random Network Distillation0
Attention Option-Critic0
Execute Order 66: Targeted Data Poisoning for Reinforcement Learning0
A Theoretical Understanding of Gradient Bias in Meta-Reinforcement LearningCode0
Deep Reinforcement Learning Policies Learn Shared Adversarial Features Across MDPs0
Conjugated Discrete Distributions for Distributional Reinforcement LearningCode0
Human-Level Control through Directly-Trained Deep Spiking Q-NetworksCode1
Continual Learning In Environments With Polynomial Mixing TimesCode0
A Benchmark for Low-Switching-Cost Reinforcement Learning0
DR3: Value-Based Deep Reinforcement Learning Requires Explicit Regularization0
A Review for Deep Reinforcement Learning in Atari:Benchmarks, Challenges, and Solutions0
Hybrid Self-Attention NEAT: A novel evolutionary approach to improve the NEAT algorithmCode1
Virtual Replay CacheCode0
Target Entropy Annealing for Discrete Soft Actor-Critic0
NovelD: A Simple yet Effective Exploration CriterionCode1
Show:102550
← PrevPage 7 of 25Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GDI-H3(200M frames)Score864Unverified
2GDI-H3Score864Unverified
3GDI-I3(200M frames)Score864Unverified
4GDI-I3Score864Unverified
5Bootstrapped DQNScore855Unverified
6FQFScore854.2Unverified
7R2D2Score837.7Unverified
8Ape-XScore800.9Unverified
9Agent57Score790.4Unverified
10IMPALA (deep)Score787.34Unverified
#ModelMetricClaimedVerifiedStatus
1GDI-I3Score34Unverified
2NoisyNet-DuelingScore34Unverified
3GDI-H3Score34Unverified
4TRPO-hashScore34Unverified
5IQNScore34Unverified
6QR-DQN-1Score34Unverified
7GDI-H3(200M frames)Score34Unverified
8Go-ExploreScore34Unverified
9ASL DDQNScore33.9Unverified
10C51 noopScore33.9Unverified
#ModelMetricClaimedVerifiedStatus
1Agent57Score580,328.14Unverified
2QR-DQN-1Score572,510Unverified
3R2D2Score408,850Unverified
4IMPALA (deep)Score351,200.12Unverified
5Ape-XScore302,391.3Unverified
6A2C + SILScore104,975.6Unverified
7MuZero (Res2 Adam)Score94,906.25Unverified
8DreamerV2Score94,688Unverified
9MuZeroScore72,276Unverified
10DNAScore52,398Unverified
#ModelMetricClaimedVerifiedStatus
1GDI-H3(200M frames)Score1,000,000Unverified
2GDI-H3Score1,000,000Unverified
3Agent57Score999,997.63Unverified
4R2D2Score999,996.7Unverified
5MuZeroScore999,976.52Unverified
6MuZero (Res2 Adam)Score999,659.18Unverified
7GDI-I3Score943,910Unverified
8Ape-XScore392,952.3Unverified
9C51 noopScore266,434Unverified
10Duel noopScore50,254.2Unverified