| Harfang3D Dog-Fight Sandbox: A Reinforcement Learning Research Platform for the Customized Control Tasks of Fighter Aircrafts | Oct 13, 2022 | Atari GamesDecision Making | CodeCode Available | 2 | 5 |
| Accelerated Policy Learning with Parallel Differentiable Simulation | Apr 14, 2022 | Deep Reinforcement Learning | CodeCode Available | 2 | 5 |
| Flow: A Modular Learning Framework for Mixed Autonomy Traffic | Oct 16, 2017 | Autonomous VehiclesDeep Reinforcement Learning | CodeCode Available | 2 | 5 |
| MaskPlace: Fast Chip Placement via Reinforced Visual Representation Learning | Nov 24, 2022 | Deep Reinforcement LearningLayout Design | CodeCode Available | 2 | 5 |
| Safety-Driven Deep Reinforcement Learning Framework for Cobots: A Sim2Real Approach | Jul 2, 2024 | Deep Reinforcement Learning | CodeCode Available | 2 | 5 |
| ElegantRL-Podracer: Scalable and Elastic Library for Cloud-Native Deep Reinforcement Learning | Dec 11, 2021 | Deep Reinforcement LearningGPU | CodeCode Available | 2 | 5 |
| Efficient World Models with Context-Aware Tokenization | Jun 27, 2024 | Deep Reinforcement LearningReinforcement Learning (RL) | CodeCode Available | 2 | 5 |
| FinRL-Meta: A Universe of Near-Real Market Environments for Data-Driven Deep Reinforcement Learning in Quantitative Finance | Dec 13, 2021 | Deep Reinforcement LearningGPU | CodeCode Available | 2 | 5 |
| A Walk in the Park: Learning to Walk in 20 Minutes With Model-Free Reinforcement Learning | Aug 16, 2022 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 2 | 5 |
| Learning to Solve Job Shop Scheduling under Uncertainty | Mar 4, 2024 | Combinatorial OptimizationDeep Reinforcement Learning | CodeCode Available | 2 | 5 |
| DIAMBRA Arena: a New Reinforcement Learning Platform for Research and Experimentation | Oct 19, 2022 | Deep Reinforcement LearningImitation Learning | CodeCode Available | 2 | 5 |
| Developing A Multi-Agent and Self-Adaptive Framework with Deep Reinforcement Learning for Dynamic Portfolio Risk Management | Feb 1, 2024 | Deep Reinforcement LearningManagement | CodeCode Available | 2 | 5 |
| Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models | May 30, 2018 | Deep Reinforcement LearningModel-based Reinforcement Learning | CodeCode Available | 2 | 5 |
| Decoupling Representation Learning from Reinforcement Learning | Sep 14, 2020 | Data AugmentationDeep Reinforcement Learning | CodeCode Available | 2 | 5 |
| DayDreamer: World Models for Physical Robot Learning | Jun 28, 2022 | Deep Reinforcement LearningNavigate | CodeCode Available | 2 | 5 |
| Revocable Deep Reinforcement Learning with Affinity Regularization for Outlier-Robust Graph Matching | Dec 16, 2020 | Combinatorial OptimizationDecision Making | CodeCode Available | 2 | 5 |
| Combinatorial Client-Master Multiagent Deep Reinforcement Learning for Task Offloading in Mobile Edge Computing | Feb 18, 2024 | Deep Reinforcement LearningEdge-computing | CodeCode Available | 2 | 5 |
| CodeRL: Mastering Code Generation through Pretrained Models and Deep Reinforcement Learning | Jul 5, 2022 | Code GenerationDecoder | CodeCode Available | 2 | 5 |
| Deep Reinforcement Learning Based Joint Downlink Beamforming and RIS Configuration in RIS-aided MU-MISO Systems Under Hardware Impairments and Imperfect CSI | Oct 10, 2022 | Deep Reinforcement Learning | CodeCode Available | 2 | 5 |
| Deep Reinforcement Learning for Multi-Agent Interaction | Aug 2, 2022 | BIG-bench Machine LearningCausal Inference | CodeCode Available | 2 | 5 |
| Conformal Symplectic Optimization for Stable Reinforcement Learning | Dec 3, 2024 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 2 | 5 |
| Assessment of Reinforcement Learning for Macro Placement | Feb 21, 2023 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 2 | 5 |
| Cooperative Edge Caching Based on Elastic Federated and Multi-Agent Deep Reinforcement Learning in Next-Generation Network | Jan 18, 2024 | Deep Reinforcement LearningFederated Learning | CodeCode Available | 2 | 5 |
| Deep Reinforcement Learning with Enhanced PPO for Safe Mobile Robot Navigation | May 25, 2024 | Autonomous NavigationDeep Reinforcement Learning | CodeCode Available | 2 | 5 |
| DouZero: Mastering DouDizhu with Self-Play Deep Reinforcement Learning | Jun 11, 2021 | Card GamesDeep Reinforcement Learning | CodeCode Available | 2 | 5 |
| Flightmare: A Flexible Quadrotor Simulator | Sep 1, 2020 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 2 | 5 |
| SimBa: Simplicity Bias for Scaling Up Parameters in Deep Reinforcement Learning | Oct 13, 2024 | Computational EfficiencyDeep Reinforcement Learning | CodeCode Available | 2 | 5 |
| Accelerated Methods for Deep Reinforcement Learning | Mar 7, 2018 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 2 | 5 |
| Bridging State and History Representations: Understanding Self-Predictive RL | Jan 17, 2024 | Deep Reinforcement LearningReinforcement Learning (RL) | CodeCode Available | 1 | 5 |
| AADG: Automatic Augmentation for Domain Generalization on Retinal Image Segmentation | Jul 27, 2022 | Data AugmentationDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| MPC-Inspired Reinforcement Learning for Verifiable Model-Free Control | Dec 8, 2023 | Deep Reinforcement LearningModel Predictive Control | CodeCode Available | 1 | 5 |
| Bridging Imagination and Reality for Model-Based Deep Reinforcement Learning | Oct 23, 2020 | Deep Reinforcement LearningModel-based Reinforcement Learning | CodeCode Available | 1 | 5 |
| Agent with Warm Start and Adaptive Dynamic Termination for Plane Localization in 3D Ultrasound | Mar 26, 2021 | Deep Reinforcement LearningReinforcement Learning (RL) | CodeCode Available | 1 | 5 |
| Bridging RL Theory and Practice with the Effective Horizon | Apr 19, 2023 | Deep Reinforcement LearningReinforcement Learning (RL) | CodeCode Available | 1 | 5 |
| Bayesian Soft Actor-Critic: A Directed Acyclic Strategy Graph Based Deep Reinforcement Learning | Aug 11, 2022 | continuous-controlContinuous Control | CodeCode Available | 1 | 5 |
| A Holistic Power Optimization Approach for Microgrid Control Based on Deep Reinforcement Learning | Mar 1, 2024 | Deep Reinforcement Learningenergy management | CodeCode Available | 1 | 5 |
| Age-Based Scheduling for Mobile Edge Computing: A Deep Reinforcement Learning Approach | Dec 1, 2023 | Deep Reinforcement LearningEdge-computing | CodeCode Available | 1 | 5 |
| BOHB: Robust and Efficient Hyperparameter Optimization at Scale | Jul 4, 2018 | Bayesian OptimizationDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| A2C is a special case of PPO | May 18, 2022 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 | 5 |
| Affordance Learning from Play for Sample-Efficient Policy Learning | Mar 1, 2022 | Deep Reinforcement LearningMotion Planning | CodeCode Available | 1 | 5 |
| A Gentle Introduction to Conformal Prediction and Distribution-Free Uncertainty Quantification | Jul 15, 2021 | Conformal PredictionDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| Boosting Soft Actor-Critic: Emphasizing Recent Experience without Forgetting the Past | Jun 10, 2019 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 1 | 5 |
| Building a 3-Player Mahjong AI using Deep Reinforcement Learning | Feb 25, 2022 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 | 5 |
| Adversarial Policy Gradient for Deep Learning Image Augmentation | Sep 9, 2019 | ClassificationDeep Learning | CodeCode Available | 1 | 5 |
| Adversarial Policies: Attacking Deep Reinforcement Learning | May 25, 2019 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 | 5 |
| Benchmarking Reinforcement Learning Techniques for Autonomous Navigation | Oct 10, 2022 | Autonomous NavigationBenchmarking | CodeCode Available | 1 | 5 |
| Adversarially Guided Actor-Critic | Feb 8, 2021 | Deep Reinforcement LearningEfficient Exploration | CodeCode Available | 1 | 5 |
| Benchmarking Deep Reinforcement Learning for Navigation in Denied Sensor Environments | Oct 18, 2024 | Autonomous NavigationBenchmarking | CodeCode Available | 1 | 5 |
| Beyond The Rainbow: High Performance Deep Reinforcement Learning on a Desktop PC | Nov 6, 2024 | Computational EfficiencyDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| BeBold: Exploration Beyond the Boundary of Explored Regions | Dec 15, 2020 | Deep Reinforcement LearningEfficient Exploration | CodeCode Available | 1 | 5 |