SOTAVerified

Atari Games

The Atari 2600 Games task (and dataset) involves training an agent to achieve high game scores.

( Image credit: Playing Atari with Deep Reinforcement Learning )

Papers

Showing 551600 of 625 papers

TitleStatusHype
Pre-training Neural Networks with Human Demonstrations for Deep Reinforcement Learning0
Proto-Value Networks: Scaling Representation Learning with Auxiliary Tasks0
Provable Representation Learning for Imitation with Contrastive Fourier Features0
Reinforcement Learning and its Connections with Neuroscience and Psychology0
Deep Reinforcement Learning via L-BFGS Optimization0
Reward Prediction Error as an Exploration Objective in Deep RL0
Read and Reap the Rewards: Learning to Play Atari with the Help of Instruction Manuals0
Regulatory Focus: Promotion and Prevention Inclinations in Policy Search0
Reinforcement Learning and Video Games0
Reinforcement Learning From Imperfect Corrective Actions And Proxy Rewards0
Reinforcement Learning in Strategy-Based and Atari Games: A Review of Google DeepMinds Innovations0
Reinforcement Learning with Attention that Works: A Self-Supervised Approach0
Reinforcement Learning with Structured Hierarchical Grammar Representations of Actions0
Return-Based Contrastive Representation Learning for Reinforcement Learning0
Return-based Scaling: Yet Another Normalisation Trick for Deep RL0
Reuse of Neural Modules for General Video Game Playing0
Rewarding Episodic Visitation Discrepancy for Exploration in Reinforcement Learning0
Reward Scale Robustness for Proximal Policy Optimization via DreamerV3 Tricks0
Sample Efficient Deep Neuroevolution in Low Dimensional Latent Space0
Sample-Efficient Reinforcement Learning through Transfer and Architectural Priors0
Sample Efficient Reinforcement Learning In Continuous State Spaces: A Perspective Beyond Linearity0
Selective Eye-gaze Augmentation To Enhance Imitation Learning In Atari Games0
Self-Imitation Advantage Learning0
Relevance-Guided Modeling of Object Dynamics for Reinforcement Learning0
Self-Supervised Structured Representations for Deep Reinforcement Learning0
A Self-Tuning Actor-Critic Algorithm0
Shallow Updates for Deep Reinforcement Learning0
Shared Learning : Enhancing Reinforcement in Q-Ensembles0
Should I Run Offline Reinforcement Learning or Behavioral Cloning?0
Sigmoid-Weighted Linear Units for Neural Network Function Approximation in Reinforcement Learning0
Simultaneously Updating All Persistence Values in Reinforcement Learning0
Sketch-Based Linear Value Function Approximation0
Slot Contrastive Networks: A Contrastive Approach for Representing Objects0
Soft Decomposed Policy-Critic: Bridging the Gap for Effective Continuous Control with Discrete RL0
Reinforcement Learning With Sparse-Executing Actions via Sparsity Regularization0
Speeding up reinforcement learning by combining attention and agency features0
State Distribution-aware Sampling for Deep Q-learning0
Stop Regressing: Training Value Functions via Classification for Scalable Deep RL0
Strategic Attentive Writer for Learning Macro-Actions0
Strategy and Benchmark for Converting Deep Q-Networks to Event-Driven Spiking Neural Networks0
Striving for Simplicity in Off-Policy Deep Reinforcement Learning0
Successor-Predecessor Intrinsic Exploration0
SwitchMT: An Adaptive Context Switching Methodology for Scalable Multi-Task Learning in Intelligent Autonomous Agents0
Tactics of Adversarial Attack on Deep Reinforcement Learning Agents0
CopyCAT: Taking Control of Neural Policies with Constant Attacks0
Target Entropy Annealing for Discrete Soft Actor-Critic0
Target Return Optimizer for Multi-Game Decision Transformer0
Temporal-adaptive Hierarchical Reinforcement Learning0
Terminal Prediction as an Auxiliary Task for Deep Reinforcement Learning0
The Effect of Planning Shape on Dyna-style Planning in High-dimensional State Spaces0
Show:102550
← PrevPage 12 of 13Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GDI-H3(200M frames)Score864Unverified
2GDI-H3Score864Unverified
3GDI-I3(200M frames)Score864Unverified
4GDI-I3Score864Unverified
5Bootstrapped DQNScore855Unverified
6FQFScore854.2Unverified
7R2D2Score837.7Unverified
8Ape-XScore800.9Unverified
9Agent57Score790.4Unverified
10IMPALA (deep)Score787.34Unverified
#ModelMetricClaimedVerifiedStatus
1GDI-I3Score34Unverified
2NoisyNet-DuelingScore34Unverified
3GDI-H3Score34Unverified
4TRPO-hashScore34Unverified
5IQNScore34Unverified
6QR-DQN-1Score34Unverified
7GDI-H3(200M frames)Score34Unverified
8Go-ExploreScore34Unverified
9ASL DDQNScore33.9Unverified
10C51 noopScore33.9Unverified
#ModelMetricClaimedVerifiedStatus
1Agent57Score580,328.14Unverified
2QR-DQN-1Score572,510Unverified
3R2D2Score408,850Unverified
4IMPALA (deep)Score351,200.12Unverified
5Ape-XScore302,391.3Unverified
6A2C + SILScore104,975.6Unverified
7MuZero (Res2 Adam)Score94,906.25Unverified
8DreamerV2Score94,688Unverified
9MuZeroScore72,276Unverified
10DNAScore52,398Unverified
#ModelMetricClaimedVerifiedStatus
1GDI-H3(200M frames)Score1,000,000Unverified
2GDI-H3Score1,000,000Unverified
3Agent57Score999,997.63Unverified
4R2D2Score999,996.7Unverified
5MuZeroScore999,976.52Unverified
6MuZero (Res2 Adam)Score999,659.18Unverified
7GDI-I3Score943,910Unverified
8Ape-XScore392,952.3Unverified
9C51 noopScore266,434Unverified
10Duel noopScore50,254.2Unverified