| DIAMBRA Arena: a New Reinforcement Learning Platform for Research and Experimentation | Oct 19, 2022 | Deep Reinforcement LearningImitation Learning | CodeCode Available | 2 |
| Harfang3D Dog-Fight Sandbox: A Reinforcement Learning Research Platform for the Customized Control Tasks of Fighter Aircrafts | Oct 13, 2022 | Atari GamesDecision Making | CodeCode Available | 2 |
| Deep Reinforcement Learning Based Joint Downlink Beamforming and RIS Configuration in RIS-aided MU-MISO Systems Under Hardware Impairments and Imperfect CSI | Oct 10, 2022 | Deep Reinforcement Learning | CodeCode Available | 2 |
| Transformers are Sample-Efficient World Models | Sep 1, 2022 | Atari Games 100kDeep Reinforcement Learning | CodeCode Available | 2 |
| A Walk in the Park: Learning to Walk in 20 Minutes With Model-Free Reinforcement Learning | Aug 16, 2022 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 2 |
| Deep Reinforcement Learning for Multi-Agent Interaction | Aug 2, 2022 | BIG-bench Machine LearningCausal Inference | CodeCode Available | 2 |
| CodeRL: Mastering Code Generation through Pretrained Models and Deep Reinforcement Learning | Jul 5, 2022 | Code GenerationDecoder | CodeCode Available | 2 |
| DayDreamer: World Models for Physical Robot Learning | Jun 28, 2022 | Deep Reinforcement LearningNavigate | CodeCode Available | 2 |
| Accelerated Policy Learning with Parallel Differentiable Simulation | Apr 14, 2022 | Deep Reinforcement Learning | CodeCode Available | 2 |
| VRL3: A Data-Driven Framework for Visual Deep Reinforcement Learning | Feb 17, 2022 | Deep Reinforcement LearningOffline RL | CodeCode Available | 2 |
| Reinforcement Learning Textbook | Jan 19, 2022 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 2 |
| FinRL-Meta: A Universe of Near-Real Market Environments for Data-Driven Deep Reinforcement Learning in Quantitative Finance | Dec 13, 2021 | Deep Reinforcement LearningGPU | CodeCode Available | 2 |
| ElegantRL-Podracer: Scalable and Elastic Library for Cloud-Native Deep Reinforcement Learning | Dec 11, 2021 | Deep Reinforcement LearningGPU | CodeCode Available | 2 |
| Learning Efficient Online 3D Bin Packing on Packing Configuration Trees | Sep 29, 2021 | 3D Bin PackingDeep Reinforcement Learning | CodeCode Available | 2 |
| Learning to Walk in Minutes Using Massively Parallel Deep Reinforcement Learning | Sep 24, 2021 | Deep Reinforcement LearningGPU | CodeCode Available | 2 |
| Learning Practically Feasible Policies for Online 3D Bin Packing | Aug 31, 2021 | 3D Bin PackingCollision Avoidance | CodeCode Available | 2 |
| Habitat 2.0: Training Home Assistants to Rearrange their Habitat | Jun 28, 2021 | Deep Reinforcement LearningGPU | CodeCode Available | 2 |
| DouZero: Mastering DouDizhu with Self-Play Deep Reinforcement Learning | Jun 11, 2021 | Card GamesDeep Reinforcement Learning | CodeCode Available | 2 |
| Model-agnostic and Scalable Counterfactual Explanations via Reinforcement Learning | Jun 4, 2021 | counterfactualDeep Reinforcement Learning | CodeCode Available | 2 |
| Revocable Deep Reinforcement Learning with Affinity Regularization for Outlier-Robust Graph Matching | Dec 16, 2020 | Combinatorial OptimizationDecision Making | CodeCode Available | 2 |
| Decoupling Representation Learning from Reinforcement Learning | Sep 14, 2020 | Data AugmentationDeep Reinforcement Learning | CodeCode Available | 2 |
| Flightmare: A Flexible Quadrotor Simulator | Sep 1, 2020 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 2 |
| rlpyt: A Research Code Base for Deep Reinforcement Learning in PyTorch | Sep 3, 2019 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 2 |
| Simulation to Scaled City: Zero-Shot Policy Transfer for Traffic Control via Autonomous Vehicles | Dec 14, 2018 | Autonomous VehiclesDeep Reinforcement Learning | CodeCode Available | 2 |
| Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models | May 30, 2018 | Deep Reinforcement LearningModel-based Reinforcement Learning | CodeCode Available | 2 |
| Accelerated Methods for Deep Reinforcement Learning | Mar 7, 2018 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 2 |
| Flow: A Modular Learning Framework for Mixed Autonomy Traffic | Oct 16, 2017 | Autonomous VehiclesDeep Reinforcement Learning | CodeCode Available | 2 |
| Benchmarking Deep Reinforcement Learning for Continuous Control | Apr 22, 2016 | Action Triplet RecognitionAtari Games | CodeCode Available | 2 |
| Deep Reinforcement Learning with Gradient Eligibility Traces | Jul 12, 2025 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 1 |
| Re4MPC: Reactive Nonlinear MPC for Multi-model Motion Planning via Deep Reinforcement Learning | Jun 10, 2025 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 1 |
| The Cell Must Go On: Agar.io for Continual Reinforcement Learning | May 23, 2025 | Continual LearningDeep Reinforcement Learning | CodeCode Available | 1 |
| GATES: Cost-aware Dynamic Workflow Scheduling via Graph Attention Networks and Evolution Strategy | May 18, 2025 | Cloud ComputingDeep Reinforcement Learning | CodeCode Available | 1 |
| Reasoning on a Budget: Miniaturizing DeepSeek R1 with SFT-GRPO Alignment for Instruction-Tuned LLMs | May 16, 2025 | Deep Reinforcement LearningMathematical Reasoning | CodeCode Available | 1 |
| Evaluating Robustness of Deep Reinforcement Learning for Autonomous Surface Vehicle Control in Field Tests | May 15, 2025 | BenchmarkingDeep Reinforcement Learning | CodeCode Available | 1 |
| Enhancing Cooperative Multi-Agent Reinforcement Learning with State Modelling and Adversarial Exploration | May 8, 2025 | Deep Reinforcement LearningMulti-agent Reinforcement Learning | CodeCode Available | 1 |
| Neurophysiologically Realistic Environment for Comparing Adaptive Deep Brain Stimulation Algorithms in Parkinson Disease | Apr 26, 2025 | Deep Reinforcement LearningReinforcement Learning (RL) | CodeCode Available | 1 |
| Learning Decision Trees as Amortized Structure Inference | Mar 10, 2025 | Anomaly DetectionDeep Reinforcement Learning | CodeCode Available | 1 |
| Dynamics-Invariant Quadrotor Control using Scale-Aware Deep Reinforcement Learning | Mar 9, 2025 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Studying the Interplay Between the Actor and Critic Representations in Reinforcement Learning | Mar 8, 2025 | Deep Reinforcement LearningRepresentation Learning | CodeCode Available | 1 |
| Playing Pokémon Red via Deep Reinforcement Learning | Feb 27, 2025 | Deep Reinforcement LearningLanguage Modeling | CodeCode Available | 1 |
| ColorDynamic: Generalizable, Scalable, Real-time, End-to-end Local Planner for Unstructured and Dynamic Environments | Feb 27, 2025 | Data AugmentationDeep Reinforcement Learning | CodeCode Available | 1 |
| Towards Optimal Adversarial Robust Reinforcement Learning with Infinity Measurement Error | Feb 23, 2025 | Adversarial RobustnessDeep Reinforcement Learning | CodeCode Available | 1 |
| Reevaluating Policy Gradient Methods for Imperfect-Information Games | Feb 13, 2025 | counterfactualDeep Reinforcement Learning | CodeCode Available | 1 |
| A Comprehensive Survey on Self-Interpretable Neural Networks | Jan 26, 2025 | Deep Reinforcement LearningSurvey | CodeCode Available | 1 |
| Divergence-Augmented Policy Optimization | Jan 25, 2025 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 1 |
| Hierarchical Deep Reinforcement Learning for Adaptive Resource Management in Integrated Terrestrial and Non-Terrestrial Networks | Jan 16, 2025 | Deep Reinforcement LearningManagement | CodeCode Available | 1 |
| CuAsmRL: Optimizing GPU SASS Schedules via Deep Reinforcement Learning | Jan 14, 2025 | Deep Reinforcement LearningGPU | CodeCode Available | 1 |
| Co-Activation Graph Analysis of Safety-Verified and Explainable Deep Reinforcement Learning Policies | Jan 6, 2025 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 1 |
| Plug-and-Play PPO: An Adaptive Point Prompt Optimizer Making SAM Greater | Jan 1, 2025 | Deep Reinforcement LearningSegmentation | CodeCode Available | 1 |
| GRAM: Generalization in Deep RL with a Robust Adaptation Module | Dec 5, 2024 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |