SOTAVerified

Atari Games

The Atari 2600 Games task (and dataset) involves training an agent to achieve high game scores.

( Image credit: Playing Atari with Deep Reinforcement Learning )

Papers

Showing 326350 of 625 papers

TitleStatusHype
A Convergent Variant of the Boltzmann Softmax Operator in Reinforcement Learning0
Action Q-Transformer: Visual Explanation in Deep Reinforcement Learning with Encoder-Decoder Model using Action Query0
Activation by Interval-wise Dropout: A Simple Way to Prevent Neural Networks from Plasticity Loss0
Adapting Auxiliary Losses Using Gradient Similarity0
Adapting Behaviour for Learning Progress0
Adaptive N-step Bootstrapping with Off-policy Data0
Adaptive Q-Network: On-the-fly Target Selection for Deep Reinforcement Learning0
Addressing Action Oscillations through Learning Policy Inertia0
A Decentralized Policy Gradient Approach to Multi-task Reinforcement Learning0
A Deep Learning Approach for Joint Video Frame and Reward Prediction in Atari Games0
Adventurer: Exploration with BiGAN for Deep Reinforcement Learning0
Adversary A3C for Robust Reinforcement Learning0
Agent-Aware Training for Agent-Agnostic Action Advising in Deep Reinforcement Learning0
AGIL: Learning Attention from Human for Visuomotor Tasks0
A Human Mixed Strategy Approach to Deep Reinforcement Learning0
A Mathematical Theory of Deep Convolutional Neural Networks for Feature Extraction0
An advantage actor-critic algorithm for robotic motion planning in dense and dynamic scenarios0
Analysing Results from AI Benchmarks: Key Indicators and How to Obtain Them0
Analysis of Q-learning with Adaptation and Momentum Restart for Gradient Descent0
An Approach to Partial Observability in Games: Learning to Both Act and Observe0
An Entropy Regularization Free Mechanism for Policy-based Reinforcement Learning0
A new Potential-Based Reward Shaping for Reinforcement Learning Agent0
An initial attempt of combining visual selective attention with deep reinforcement learning0
APF+: Boosting adaptive-potential function reinforcement learning methods with a W-shaped network for high-dimensional games0
Approximate Shielding of Atari Agents for Safe Exploration0
Show:102550
← PrevPage 14 of 25Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GDI-H3(200M frames)Score864Unverified
2GDI-I3(200M frames)Score864Unverified
3GDI-I3Score864Unverified
4GDI-H3Score864Unverified
5Bootstrapped DQNScore855Unverified
6FQFScore854.2Unverified
7R2D2Score837.7Unverified
8Ape-XScore800.9Unverified
9Agent57Score790.4Unverified
10IMPALA (deep)Score787.34Unverified
#ModelMetricClaimedVerifiedStatus
1GDI-H3(200M frames)Score34Unverified
2TRPO-hashScore34Unverified
3IQNScore34Unverified
4NoisyNet-DuelingScore34Unverified
5QR-DQN-1Score34Unverified
6Go-ExploreScore34Unverified
7GDI-I3Score34Unverified
8GDI-H3Score34Unverified
9ASL DDQNScore33.9Unverified
10Bootstrapped DQNScore33.9Unverified
#ModelMetricClaimedVerifiedStatus
1Agent57Score580,328.14Unverified
2QR-DQN-1Score572,510Unverified
3R2D2Score408,850Unverified
4IMPALA (deep)Score351,200.12Unverified
5Ape-XScore302,391.3Unverified
6A2C + SILScore104,975.6Unverified
7MuZero (Res2 Adam)Score94,906.25Unverified
8DreamerV2Score94,688Unverified
9MuZeroScore72,276Unverified
10DNAScore52,398Unverified
#ModelMetricClaimedVerifiedStatus
1GDI-H3(200M frames)Score1,000,000Unverified
2GDI-H3Score1,000,000Unverified
3Agent57Score999,997.63Unverified
4R2D2Score999,996.7Unverified
5MuZeroScore999,976.52Unverified
6MuZero (Res2 Adam)Score999,659.18Unverified
7GDI-I3Score943,910Unverified
8Ape-XScore392,952.3Unverified
9C51 noopScore266,434Unverified
10Duel noopScore50,254.2Unverified