SOTAVerified

Atari Games

The Atari 2600 Games task (and dataset) involves training an agent to achieve high game scores.

( Image credit: Playing Atari with Deep Reinforcement Learning )

Papers

Showing 401450 of 625 papers

TitleStatusHype
Unlocking the Power of Representations in Long-term Novelty-based Exploration0
Unsupervised Active Pre-Training for Reinforcement Learning0
Using Generative Adversarial Nets on Atari Games for Feature Extraction in Deep Reinforcement Learning0
Continual Learning Using World Models for Pseudo-Rehearsal0
Utilizing Maximum Mean Discrepancy Barycenter for Propagating the Uncertainty of Value Functions in Reinforcement Learning0
Value-driven Hindsight Modelling0
Vanishing Bias Heuristic-guided Reinforcement Learning Algorithm0
Vision-Based Mobile Robotics Obstacle Avoidance With Deep Reinforcement Learning0
Visual Analogies between Atari Games for Studying Transfer Learning in RL0
Visual Encoders for Data-Efficient Imitation Learning in Modern Video Games0
VisualHints: A Visual-Lingual Environment for Multimodal Reinforcement Learning0
Visual Rationalizations in Deep Reinforcement Learning for Atari Games0
WAD: A Deep Reinforcement Learning Agent for Urban Autonomous Driving0
When Should We Prefer Offline Reinforcement Learning Over Behavioral Cloning?0
Why the Agent Made that Decision: Explaining Deep Reinforcement Learning with Vision Masks0
Width-based Lookaheads with Learnt Base Policies and Heuristics Over the Atari-2600 Benchmark0
Virtual Replay CacheCode0
Unraveling the Rainbow: can value-based methods schedule?Code0
On Catastrophic Interference in Atari 2600 GamesCode0
SLM Lab: A Comprehensive Benchmark and Modular Software Framework for Reproducible Deep Reinforcement LearningCode0
On Improving Deep Reinforcement Learning for POMDPsCode0
ViZDoom: A Doom-based AI Research Platform for Visual Reinforcement LearningCode0
On Learning Intrinsic Rewards for Policy Gradient MethodsCode0
Continual Learning In Environments With Polynomial Mixing TimesCode0
Soft Actor-Critic for Discrete Action SettingsCode0
Scalable Real-Time Recurrent Learning Using Columnar-Constructive NetworksCode0
On the Generalization of Representations in Reinforcement LearningCode0
Unsupervised Representation Learning in Partially Observable Atari GamesCode0
Solving Atari Games Using Fractals And EntropyCode0
Optimizing the Neural Architecture of Reinforcement Learning AgentsCode0
Conservative Optimistic Policy Optimization via Multiple Importance SamplingCode0
ConQUR: Mitigating Delusional Bias in Deep Q-learningCode0
Solving Deep Reinforcement Learning Tasks with Evolution Strategies and Linear Policy NetworksCode0
Pay Attention to What and Where? Interpretable Feature Extractor in Vision-based Deep Reinforcement LearningCode0
Perceptual Similarity for Measuring Decision-Making Style and Policy Diversity in GamesCode0
Performing Deep Recurrent Double Q-Learning for Atari GamesCode0
Exploration via Flow-Based Intrinsic RewardsCode0
Exploratory Not Explanatory: Counterfactual Analysis of Saliency Maps for Deep Reinforcement LearningCode0
Exploring Unknown States with Action BalanceCode0
ExPoSe: Combining State-Based Exploration with Gradient-Based Online SearchCode0
Action Advising with Advice Imitation in Deep Reinforcement LearningCode0
Self-supervised network distillation: an effective approach to exploration in sparse reward environmentsCode0
Fast deep reinforcement learning using online adjustments from the pastCode0
Conjugated Discrete Distributions for Distributional Reinforcement LearningCode0
An Atari Model Zoo for Analyzing, Visualizing, and Comparing Deep Reinforcement Learning AgentsCode0
FDQN: A Flexible Deep Q-Network Framework for Game AutomationCode0
Explain Your Move: Understanding Agent Actions Using Specific and Relevant Feature AttributionCode0
Formal Language Constraints for Markov Decision ProcessesCode0
Fractal AI: A fragile theory of intelligenceCode0
Unsupervised Task Clustering for Multi-Task Reinforcement LearningCode0
Show:102550
← PrevPage 9 of 13Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GDI-H3(200M frames)Score864Unverified
2GDI-H3Score864Unverified
3GDI-I3(200M frames)Score864Unverified
4GDI-I3Score864Unverified
5Bootstrapped DQNScore855Unverified
6FQFScore854.2Unverified
7R2D2Score837.7Unverified
8Ape-XScore800.9Unverified
9Agent57Score790.4Unverified
10IMPALA (deep)Score787.34Unverified
#ModelMetricClaimedVerifiedStatus
1GDI-I3Score34Unverified
2NoisyNet-DuelingScore34Unverified
3GDI-H3Score34Unverified
4TRPO-hashScore34Unverified
5IQNScore34Unverified
6QR-DQN-1Score34Unverified
7GDI-H3(200M frames)Score34Unverified
8Go-ExploreScore34Unverified
9ASL DDQNScore33.9Unverified
10C51 noopScore33.9Unverified
#ModelMetricClaimedVerifiedStatus
1Agent57Score580,328.14Unverified
2QR-DQN-1Score572,510Unverified
3R2D2Score408,850Unverified
4IMPALA (deep)Score351,200.12Unverified
5Ape-XScore302,391.3Unverified
6A2C + SILScore104,975.6Unverified
7MuZero (Res2 Adam)Score94,906.25Unverified
8DreamerV2Score94,688Unverified
9MuZeroScore72,276Unverified
10DNAScore52,398Unverified
#ModelMetricClaimedVerifiedStatus
1GDI-H3(200M frames)Score1,000,000Unverified
2GDI-H3Score1,000,000Unverified
3Agent57Score999,997.63Unverified
4R2D2Score999,996.7Unverified
5MuZeroScore999,976.52Unverified
6MuZero (Res2 Adam)Score999,659.18Unverified
7GDI-I3Score943,910Unverified
8Ape-XScore392,952.3Unverified
9C51 noopScore266,434Unverified
10Duel noopScore50,254.2Unverified