SOTAVerified

Atari Games

The Atari 2600 Games task (and dataset) involves training an agent to achieve high game scores.

( Image credit: Playing Atari with Deep Reinforcement Learning )

Papers

Showing 201250 of 625 papers

TitleStatusHype
A Distance-based Anomaly Detection Framework for Deep Reinforcement Learning0
Evolutionary Self-Replication as a Mechanism for Producing Artificial IntelligenceCode1
WAD: A Deep Reinforcement Learning Agent for Urban Autonomous Driving0
An Approach to Partial Observability in Games: Learning to Both Act and Observe0
High Performance Across Two Atari Paddle Games Using the Same Perceptual Control Architecture Without Training0
Cautious Policy Programming: Exploiting KL Regularization in Monotonic Policy Improvement for Reinforcement Learning0
Estimates for the Branching Factors of Atari GamesCode0
AdaRL: What, Where, and How to Adapt in Transfer Reinforcement LearningCode1
Ensemble and Auxiliary Tasks for Data-Efficient Deep Reinforcement LearningCode0
Improve Agents without Retraining: Parallel Tree Search with Off-Policy CorrectionCode0
Width-based Lookaheads with Learnt Base Policies and Heuristics Over the Atari-2600 Benchmark0
IQ-Learn: Inverse soft-Q Learning for ImitationCode1
Emphatic Algorithms for Deep Reinforcement Learning0
Non-Robust Feature Mapping in Deep Reinforcement Learning0
CROP: Certifying Robust Policies for Reinforcement Learning through Functional SmoothingCode0
Real-time Adversarial Perturbations against Deep Reinforcement Learning Policies: Attacks and DefensesCode0
Sample Efficient Reinforcement Learning In Continuous State Spaces: A Perspective Beyond Linearity0
A Game-Theoretic Approach to Multi-Agent Trust Region OptimizationCode1
GDI: Rethinking What Makes Reinforcement Learning Different From Supervised Learning0
Going Beyond Linear Transformers with Recurrent Fast Weight ProgrammersCode1
Pretraining Representations for Data-Efficient Reinforcement LearningCode1
Towards Practical Credit Assignment for Deep Reinforcement Learning0
MALib: A Parallel Framework for Population-based Multi-agent Reinforcement LearningCode1
MICo: Improved representations via sampling-based state similarity for Markov decision processesCode0
Decision Transformer: Reinforcement Learning via Sequence ModelingCode1
An Entropy Regularization Free Mechanism for Policy-based Reinforcement Learning0
Provable Representation Learning for Imitation with Contrastive Fourier FeaturesCode0
GMAC: A Distributional Perspective on Actor-Critic Framework0
Reinforcement Learning With Sparse-Executing Actions via Sparsity Regularization0
Non-decreasing Quantile Function Network with Efficient Exploration for Distributional Reinforcement Learning0
Return-based Scaling: Yet Another Normalisation Trick for Deep RL0
CASA: Bridging the Gap between Policy Improvement and Policy Evaluation with Conflict Averse Policy Iteration0
InferNet for Delayed Reinforcement Tasks: Addressing the Temporal Credit Assignment Problem0
BACKDOORL: Backdoor Attack against Competitive Reinforcement Learning0
Adapting to Reward Progressivity via Spectral Reinforcement LearningCode0
Action Advising with Advice Imitation in Deep Reinforcement LearningCode0
Learning on a Budget via Teacher ImitationCode0
Online and Offline Reinforcement Learning by Planning with a Learned ModelCode1
Muesli: Combining Improvements in Policy OptimizationCode1
A coevolutionary approach to deep multi-agent reinforcement learningCode1
Bayesian Distributional Policy Gradients0
Generalizable Episodic Memory for Deep Reinforcement LearningCode1
Vision-Based Mobile Robotics Obstacle Avoidance With Deep Reinforcement Learning0
Behavior From the Void: Unsupervised Active Pre-TrainingCode1
Conservative Optimistic Policy Optimization via Multiple Importance SamplingCode0
Improving Computational Efficiency in Visual Reinforcement Learning via Stored EmbeddingsCode1
Addressing Action Oscillations through Learning Policy Inertia0
Ensemble Bootstrapping for Q-Learning0
Combining Off and On-Policy Training in Model-Based Reinforcement Learning0
Return-Based Contrastive Representation Learning for Reinforcement Learning0
Show:102550
← PrevPage 5 of 13Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GDI-I3(200M frames)Score864Unverified
2GDI-H3(200M frames)Score864Unverified
3GDI-H3Score864Unverified
4GDI-I3Score864Unverified
5Bootstrapped DQNScore855Unverified
6FQFScore854.2Unverified
7R2D2Score837.7Unverified
8Ape-XScore800.9Unverified
9Agent57Score790.4Unverified
10IMPALA (deep)Score787.34Unverified
#ModelMetricClaimedVerifiedStatus
1IQNScore34Unverified
2QR-DQN-1Score34Unverified
3TRPO-hashScore34Unverified
4NoisyNet-DuelingScore34Unverified
5GDI-H3Score34Unverified
6GDI-H3(200M frames)Score34Unverified
7GDI-I3Score34Unverified
8Go-ExploreScore34Unverified
9Bootstrapped DQNScore33.9Unverified
10C51 noopScore33.9Unverified
#ModelMetricClaimedVerifiedStatus
1Agent57Score580,328.14Unverified
2QR-DQN-1Score572,510Unverified
3R2D2Score408,850Unverified
4IMPALA (deep)Score351,200.12Unverified
5Ape-XScore302,391.3Unverified
6A2C + SILScore104,975.6Unverified
7MuZero (Res2 Adam)Score94,906.25Unverified
8DreamerV2Score94,688Unverified
9MuZeroScore72,276Unverified
10DNAScore52,398Unverified
#ModelMetricClaimedVerifiedStatus
1GDI-H3Score1,000,000Unverified
2GDI-H3(200M frames)Score1,000,000Unverified
3Agent57Score999,997.63Unverified
4R2D2Score999,996.7Unverified
5MuZeroScore999,976.52Unverified
6MuZero (Res2 Adam)Score999,659.18Unverified
7GDI-I3Score943,910Unverified
8Ape-XScore392,952.3Unverified
9C51 noopScore266,434Unverified
10Duel noopScore50,254.2Unverified