SOTAVerified

Atari Games

The Atari 2600 Games task (and dataset) involves training an agent to achieve high game scores.

( Image credit: Playing Atari with Deep Reinforcement Learning )

Papers

Showing 351400 of 625 papers

TitleStatusHype
Sample-Efficient Reinforcement Learning through Transfer and Architectural Priors0
Sample Efficient Reinforcement Learning In Continuous State Spaces: A Perspective Beyond Linearity0
Selective Eye-gaze Augmentation To Enhance Imitation Learning In Atari Games0
Self-Imitation Advantage Learning0
Relevance-Guided Modeling of Object Dynamics for Reinforcement Learning0
Self-Supervised Structured Representations for Deep Reinforcement Learning0
A Self-Tuning Actor-Critic Algorithm0
Shallow Updates for Deep Reinforcement Learning0
Shared Learning : Enhancing Reinforcement in Q-Ensembles0
Should I Run Offline Reinforcement Learning or Behavioral Cloning?0
Sigmoid-Weighted Linear Units for Neural Network Function Approximation in Reinforcement Learning0
Simultaneously Updating All Persistence Values in Reinforcement Learning0
Sketch-Based Linear Value Function Approximation0
Slot Contrastive Networks: A Contrastive Approach for Representing Objects0
Soft Decomposed Policy-Critic: Bridging the Gap for Effective Continuous Control with Discrete RL0
Reinforcement Learning With Sparse-Executing Actions via Sparsity Regularization0
Speeding up reinforcement learning by combining attention and agency features0
State Distribution-aware Sampling for Deep Q-learning0
Stop Regressing: Training Value Functions via Classification for Scalable Deep RL0
Strategic Attentive Writer for Learning Macro-Actions0
Strategy and Benchmark for Converting Deep Q-Networks to Event-Driven Spiking Neural Networks0
Striving for Simplicity in Off-Policy Deep Reinforcement Learning0
Successor-Predecessor Intrinsic Exploration0
SwitchMT: An Adaptive Context Switching Methodology for Scalable Multi-Task Learning in Intelligent Autonomous Agents0
Tactics of Adversarial Attack on Deep Reinforcement Learning Agents0
CopyCAT: Taking Control of Neural Policies with Constant Attacks0
Target Entropy Annealing for Discrete Soft Actor-Critic0
Target Return Optimizer for Multi-Game Decision Transformer0
Temporal-adaptive Hierarchical Reinforcement Learning0
Terminal Prediction as an Auxiliary Task for Deep Reinforcement Learning0
The Effect of Planning Shape on Dyna-style Planning in High-dimensional State Spaces0
The Past and Present of Imitation Learning: A Citation Chain Study0
The Reactor: A fast and sample-efficient Actor-Critic agent for Reinforcement Learning0
The RL Perceptron: Generalisation Dynamics of Policy Learning in High Dimensions0
The Value-Improvement Path: Towards Better Representations for Reinforcement Learning0
Towards automatic construction of multi-network models for heterogeneous multi-task learning0
Reconstructing Actions To Explain Deep Reinforcement Learning0
Towards Consistent Performance on Atari using Expert Demonstrations0
Towards continual learning in medical imaging0
Towards Control-Centric Representations in Reinforcement Learning from Images0
Towards Generalizable Reinforcement Learning via Causality-Guided Self-Adaptive Representations0
Towards Practical Credit Assignment for Deep Reinforcement Learning0
The Benefits of Being Categorical Distributional: Uncertainty-aware Regularized Exploration in Reinforcement Learning0
Towards Understanding Distributional Reinforcement Learning: Regularization, Optimization, Acceleration and Sinkhorn Algorithm0
Training with Worst-Case Distributional Shift causes Overestimation and Inaccuracies in State-Action Value Functions0
Transferring Deep Reinforcement Learning with Adversarial Objective and Augmentation0
Transforming Game Play: A Comparative Study of DCQN and DTQN Architectures in Reinforcement Learning0
Understanding and Diagnosing Deep Reinforcement Learning0
Understanding plasticity in neural networks0
Understanding Visual Concepts with Continuation Learning0
Show:102550
← PrevPage 8 of 13Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GDI-I3(200M frames)Score864Unverified
2GDI-H3(200M frames)Score864Unverified
3GDI-H3Score864Unverified
4GDI-I3Score864Unverified
5Bootstrapped DQNScore855Unverified
6FQFScore854.2Unverified
7R2D2Score837.7Unverified
8Ape-XScore800.9Unverified
9Agent57Score790.4Unverified
10IMPALA (deep)Score787.34Unverified
#ModelMetricClaimedVerifiedStatus
1IQNScore34Unverified
2QR-DQN-1Score34Unverified
3TRPO-hashScore34Unverified
4NoisyNet-DuelingScore34Unverified
5GDI-H3Score34Unverified
6GDI-H3(200M frames)Score34Unverified
7GDI-I3Score34Unverified
8Go-ExploreScore34Unverified
9Bootstrapped DQNScore33.9Unverified
10C51 noopScore33.9Unverified
#ModelMetricClaimedVerifiedStatus
1Agent57Score580,328.14Unverified
2QR-DQN-1Score572,510Unverified
3R2D2Score408,850Unverified
4IMPALA (deep)Score351,200.12Unverified
5Ape-XScore302,391.3Unverified
6A2C + SILScore104,975.6Unverified
7MuZero (Res2 Adam)Score94,906.25Unverified
8DreamerV2Score94,688Unverified
9MuZeroScore72,276Unverified
10DNAScore52,398Unverified
#ModelMetricClaimedVerifiedStatus
1GDI-H3Score1,000,000Unverified
2GDI-H3(200M frames)Score1,000,000Unverified
3Agent57Score999,997.63Unverified
4R2D2Score999,996.7Unverified
5MuZeroScore999,976.52Unverified
6MuZero (Res2 Adam)Score999,659.18Unverified
7GDI-I3Score943,910Unverified
8Ape-XScore392,952.3Unverified
9C51 noopScore266,434Unverified
10Duel noopScore50,254.2Unverified