SOTAVerified

Atari Games

The Atari 2600 Games task (and dataset) involves training an agent to achieve high game scores.

( Image credit: Playing Atari with Deep Reinforcement Learning )

Papers

Showing 150 of 625 papers

TitleStatusHype
CALE: Continuous Arcade Learning EnvironmentCode7
MineDojo: Building Open-Ended Embodied Agents with Internet-Scale KnowledgeCode4
Streaming Deep Reinforcement Learning Finally WorksCode3
Distributed Prioritized Experience ReplayCode3
Rainbow: Combining Improvements in Deep Reinforcement LearningCode3
Conformal Symplectic Optimization for Stable Reinforcement LearningCode2
Intelligent Go-Explore: Standing on the Shoulders of Giant Foundation ModelsCode2
Harfang3D Dog-Fight Sandbox: A Reinforcement Learning Research Platform for the Customized Control Tasks of Fighter AircraftsCode2
Mastering Atari Games with Limited DataCode2
Mastering Atari, Go, Chess and Shogi by Planning with a Learned ModelCode2
Accelerated Methods for Deep Reinforcement LearningCode2
Benchmarking Deep Reinforcement Learning for Continuous ControlCode2
OptionZero: Planning with Learned OptionsCode1
Uncovering Untapped Potential in Sample-Efficient World Model AgentsCode1
Divergence-Augmented Policy OptimizationCode1
Enhancing Chess Reinforcement Learning with Graph RepresentationCode1
Scaling Offline Model-Based RL via Jointly-Optimized World-Action Model PretrainingCode1
Investigating Pre-Training Objectives for Generalization in Vision-Based Reinforcement LearningCode1
Interpretable and Editable Programmatic Tree Policies for Reinforcement LearningCode1
Active Reinforcement Learning for Robust Building ControlCode1
MiniZero: Comparative Analysis of AlphaZero and MuZero on Go, Othello, and Atari GamesCode1
PGDQN: Preference-Guided Deep Q-NetworkCode1
Beyond Surprise: Improving Exploration Through Surprise NoveltyCode1
Scaling Laws for Imitation Learning in Single-Agent GamesCode1
OCAtari: Object-Centric Atari 2600 Reinforcement Learning EnvironmentsCode1
Think Before You Act: Decision Transformers with Working MemoryCode1
Train a Real-world Local Path Planner in One Hour via Partially Decoupled Reinforcement Learning and Vectorized DiversityCode1
How To Guide Your Learner: Imitation Learning with Active Adaptive Expert InvolvementCode1
Redeeming Intrinsic Rewards via Constrained OptimizationCode1
WILD-SCAV: Benchmarking FPS Gaming AI on Unity3D-based EnvironmentsCode1
Atari-5: Distilling the Arcade Learning Environment down to Five GamesCode1
Revisiting Discrete Soft Actor-CriticCode1
MAN: Multi-Action Networks LearningCode1
A Deep Reinforcement Learning Approach for Finding Non-Exploitable Strategies in Two-Player Atari GamesCode1
Sampling Efficient Deep Reinforcement Learning through Preference-Guided Stochastic ExplorationCode1
DNA: Proximal Policy Optimization with a Dual Network ArchitectureCode1
Deep Hierarchical Planning from PixelsCode1
Reincarnating Reinforcement Learning: Reusing Prior Computation to Accelerate ProgressCode1
NavDreams: Towards Camera-Only RL Navigation Among HumansCode1
Human-Level Control through Directly-Trained Deep Spiking Q-NetworksCode1
Hybrid Self-Attention NEAT: A novel evolutionary approach to improve the NEAT algorithmCode1
NovelD: A Simple yet Effective Exploration CriterionCode1
Fast and Data-Efficient Training of Rainbow: an Experimental Study on AtariCode1
Large Batch Experience ReplayCode1
Evolutionary Self-Replication as a Mechanism for Producing Artificial IntelligenceCode1
AdaRL: What, Where, and How to Adapt in Transfer Reinforcement LearningCode1
IQ-Learn: Inverse soft-Q Learning for ImitationCode1
A Game-Theoretic Approach to Multi-Agent Trust Region OptimizationCode1
Going Beyond Linear Transformers with Recurrent Fast Weight ProgrammersCode1
Pretraining Representations for Data-Efficient Reinforcement LearningCode1
Show:102550
← PrevPage 1 of 13Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GDI-I3(200M frames)Score864Unverified
2GDI-H3(200M frames)Score864Unverified
3GDI-H3Score864Unverified
4GDI-I3Score864Unverified
5Bootstrapped DQNScore855Unverified
6FQFScore854.2Unverified
7R2D2Score837.7Unverified
8Ape-XScore800.9Unverified
9Agent57Score790.4Unverified
10IMPALA (deep)Score787.34Unverified
#ModelMetricClaimedVerifiedStatus
1IQNScore34Unverified
2QR-DQN-1Score34Unverified
3TRPO-hashScore34Unverified
4NoisyNet-DuelingScore34Unverified
5GDI-H3Score34Unverified
6GDI-H3(200M frames)Score34Unverified
7GDI-I3Score34Unverified
8Go-ExploreScore34Unverified
9Bootstrapped DQNScore33.9Unverified
10C51 noopScore33.9Unverified
#ModelMetricClaimedVerifiedStatus
1Agent57Score580,328.14Unverified
2QR-DQN-1Score572,510Unverified
3R2D2Score408,850Unverified
4IMPALA (deep)Score351,200.12Unverified
5Ape-XScore302,391.3Unverified
6A2C + SILScore104,975.6Unverified
7MuZero (Res2 Adam)Score94,906.25Unverified
8DreamerV2Score94,688Unverified
9MuZeroScore72,276Unverified
10DNAScore52,398Unverified
#ModelMetricClaimedVerifiedStatus
1GDI-H3Score1,000,000Unverified
2GDI-H3(200M frames)Score1,000,000Unverified
3Agent57Score999,997.63Unverified
4R2D2Score999,996.7Unverified
5MuZeroScore999,976.52Unverified
6MuZero (Res2 Adam)Score999,659.18Unverified
7GDI-I3Score943,910Unverified
8Ape-XScore392,952.3Unverified
9C51 noopScore266,434Unverified
10Duel noopScore50,254.2Unverified