SOTAVerified

Atari Games

The Atari 2600 Games task (and dataset) involves training an agent to achieve high game scores.

( Image credit: Playing Atari with Deep Reinforcement Learning )

Papers

Showing 351375 of 625 papers

TitleStatusHype
Return-Based Contrastive Representation Learning for Reinforcement Learning0
Return-based Scaling: Yet Another Normalisation Trick for Deep RL0
Reuse of Neural Modules for General Video Game Playing0
Rewarding Episodic Visitation Discrepancy for Exploration in Reinforcement Learning0
Reward Scale Robustness for Proximal Policy Optimization via DreamerV3 Tricks0
Sample Efficient Deep Neuroevolution in Low Dimensional Latent Space0
Sample-Efficient Reinforcement Learning through Transfer and Architectural Priors0
Sample Efficient Reinforcement Learning In Continuous State Spaces: A Perspective Beyond Linearity0
Selective Eye-gaze Augmentation To Enhance Imitation Learning In Atari Games0
Self-Imitation Advantage Learning0
Relevance-Guided Modeling of Object Dynamics for Reinforcement Learning0
Self-Supervised Structured Representations for Deep Reinforcement Learning0
A Self-Tuning Actor-Critic Algorithm0
Shallow Updates for Deep Reinforcement Learning0
Shared Learning : Enhancing Reinforcement in Q-Ensembles0
Should I Run Offline Reinforcement Learning or Behavioral Cloning?0
Sigmoid-Weighted Linear Units for Neural Network Function Approximation in Reinforcement Learning0
Simultaneously Updating All Persistence Values in Reinforcement Learning0
Sketch-Based Linear Value Function Approximation0
Slot Contrastive Networks: A Contrastive Approach for Representing Objects0
Soft Decomposed Policy-Critic: Bridging the Gap for Effective Continuous Control with Discrete RL0
Reinforcement Learning With Sparse-Executing Actions via Sparsity Regularization0
Speeding up reinforcement learning by combining attention and agency features0
State Distribution-aware Sampling for Deep Q-learning0
Stop Regressing: Training Value Functions via Classification for Scalable Deep RL0
Show:102550
← PrevPage 15 of 25Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GDI-H3(200M frames)Score864Unverified
2GDI-H3Score864Unverified
3GDI-I3(200M frames)Score864Unverified
4GDI-I3Score864Unverified
5Bootstrapped DQNScore855Unverified
6FQFScore854.2Unverified
7R2D2Score837.7Unverified
8Ape-XScore800.9Unverified
9Agent57Score790.4Unverified
10IMPALA (deep)Score787.34Unverified
#ModelMetricClaimedVerifiedStatus
1GDI-I3Score34Unverified
2NoisyNet-DuelingScore34Unverified
3GDI-H3Score34Unverified
4TRPO-hashScore34Unverified
5IQNScore34Unverified
6QR-DQN-1Score34Unverified
7GDI-H3(200M frames)Score34Unverified
8Go-ExploreScore34Unverified
9ASL DDQNScore33.9Unverified
10C51 noopScore33.9Unverified
#ModelMetricClaimedVerifiedStatus
1Agent57Score580,328.14Unverified
2QR-DQN-1Score572,510Unverified
3R2D2Score408,850Unverified
4IMPALA (deep)Score351,200.12Unverified
5Ape-XScore302,391.3Unverified
6A2C + SILScore104,975.6Unverified
7MuZero (Res2 Adam)Score94,906.25Unverified
8DreamerV2Score94,688Unverified
9MuZeroScore72,276Unverified
10DNAScore52,398Unverified
#ModelMetricClaimedVerifiedStatus
1GDI-H3(200M frames)Score1,000,000Unverified
2GDI-H3Score1,000,000Unverified
3Agent57Score999,997.63Unverified
4R2D2Score999,996.7Unverified
5MuZeroScore999,976.52Unverified
6MuZero (Res2 Adam)Score999,659.18Unverified
7GDI-I3Score943,910Unverified
8Ape-XScore392,952.3Unverified
9C51 noopScore266,434Unverified
10Duel noopScore50,254.2Unverified