| Catalyst.RL: A Distributed Framework for Reproducible RL Research | Feb 28, 2019 | continuous-controlContinuous Control | CodeCode Available | 1 | 5 |
| CDT: Cascading Decision Trees for Explainable Reinforcement Learning | Nov 15, 2020 | Deep Reinforcement LearningExplainable Models | CodeCode Available | 1 | 5 |
| Training a Resilient Q-Network against Observational Interference | Feb 18, 2021 | Causal InferenceDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| Comparing Observation and Action Representations for Deep Reinforcement Learning in μRTS | Oct 26, 2019 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 | 5 |
| Enhancing Battery Storage Energy Arbitrage with Deep Reinforcement Learning and Time-Series Forecasting | Oct 25, 2024 | Deep Reinforcement LearningTime Series | CodeCode Available | 1 | 5 |
| Enhancing Cooperative Multi-Agent Reinforcement Learning with State Modelling and Adversarial Exploration | May 8, 2025 | Deep Reinforcement LearningMulti-agent Reinforcement Learning | CodeCode Available | 1 | 5 |
| Enhancing System-Level Safety in Mixed-Autonomy Platoon via Safe Reinforcement Learning | Jan 20, 2024 | Autonomous DrivingCollision Avoidance | CodeCode Available | 1 | 5 |
| A fast balance optimization approach for charging enhancement of lithium-ion battery packs through deep reinforcement learning | Apr 24, 2024 | Deep Reinforcement Learningenergy management | CodeCode Available | 1 | 5 |
| Re4MPC: Reactive Nonlinear MPC for Multi-model Motion Planning via Deep Reinforcement Learning | Jun 10, 2025 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| An End-to-end Deep Reinforcement Learning Approach for the Long-term Short-term Planning on the Frenet Space | Nov 26, 2020 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| Chip Placement with Deep Reinforcement Learning | Apr 22, 2020 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 | 5 |
| Evaluating Robustness of Deep Reinforcement Learning for Autonomous Surface Vehicle Control in Field Tests | May 15, 2025 | BenchmarkingDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| Affordance Learning from Play for Sample-Efficient Policy Learning | Mar 1, 2022 | Deep Reinforcement LearningMotion Planning | CodeCode Available | 1 | 5 |
| Cleanba: A Reproducible and Efficient Distributed Reinforcement Learning Platform | Sep 29, 2023 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 | 5 |
| An Equivalence between Loss Functions and Non-Uniform Sampling in Experience Replay | Jul 12, 2020 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 1 | 5 |
| Experience Replay with Likelihood-free Importance Weights | Jun 23, 2020 | Deep Reinforcement LearningOpenAI Gym | CodeCode Available | 1 | 5 |
| Co-Activation Graph Analysis of Safety-Verified and Explainable Deep Reinforcement Learning Policies | Jan 6, 2025 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| Co-designing Intelligent Control of Building HVACs and Microgrids | Jul 18, 2021 | Deep Reinforcement LearningReinforcement Learning (RL) | CodeCode Available | 1 | 5 |
| Deep Reinforcement Learning for List-wise Recommendations | Dec 30, 2017 | Deep Reinforcement LearningRecommendation Systems | CodeCode Available | 1 | 5 |
| Age-Based Scheduling for Mobile Edge Computing: A Deep Reinforcement Learning Approach | Dec 1, 2023 | Deep Reinforcement LearningEdge-computing | CodeCode Available | 1 | 5 |
| Exploration by Random Network Distillation | Oct 30, 2018 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| Exploring Deep Reinforcement Learning-Assisted Federated Learning for Online Resource Allocation in Privacy-Persevering EdgeIoT | Feb 15, 2022 | Deep Reinforcement LearningEdge-computing | CodeCode Available | 1 | 5 |
| Collective eXplainable AI: Explaining Cooperative Strategies and Agent Contribution in Multiagent Reinforcement Learning with Shapley Values | Oct 4, 2021 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| Collaborative Target Search with a Visual Drone Swarm: An Adaptive Curriculum Embedded Multistage Reinforcement Learning Approach | Apr 26, 2022 | Deep Reinforcement LearningManagement | CodeCode Available | 1 | 5 |
| Deep Reinforcement Learning for Joint Spectrum and Power Allocation in Cellular Networks | Dec 19, 2020 | Deep Reinforcement LearningManagement | CodeCode Available | 1 | 5 |
| ColO-RAN: Developing Machine Learning-based xApps for Open RAN Closed-loop Control on Programmable Experimental Platforms | Dec 17, 2021 | Deep Reinforcement LearningScheduling | CodeCode Available | 1 | 5 |
| An experimental evaluation of Deep Reinforcement Learning algorithms for HVAC control | Jan 11, 2024 | Deep Reinforcement LearningIncremental Learning | CodeCode Available | 1 | 5 |
| Finding Failures in High-Fidelity Simulation using Adaptive Stress Testing and the Backward Algorithm | Jul 27, 2021 | Autonomous VehiclesDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| Deep reinforcement learning for large-scale epidemic control | Mar 30, 2020 | Computational EfficiencyDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| Combining Deep Reinforcement Learning and Search for Imperfect-Information Games | Jul 27, 2020 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 | 5 |
| Deep Reinforcement Learning for Market Making Under a Hawkes Process-Based Limit Order Book Model | Jul 20, 2022 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 | 5 |
| Agent with Warm Start and Adaptive Dynamic Termination for Plane Localization in 3D Ultrasound | Mar 26, 2021 | Deep Reinforcement LearningReinforcement Learning (RL) | CodeCode Available | 1 | 5 |
| An Efficient Asynchronous Method for Integrating Evolutionary and Gradient-based Policy Search | Dec 10, 2020 | continuous-controlContinuous Control | CodeCode Available | 1 | 5 |
| CORE: Towards Scalable and Efficient Causal Discovery with Reinforcement Learning | Jan 30, 2024 | Causal DiscoveryDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| Computational Performance of Deep Reinforcement Learning to find Nash Equilibria | Apr 26, 2021 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 | 5 |
| Comprehensive Training and Evaluation on Deep Reinforcement Learning for Automated Driving in Various Simulated Driving Maneuvers | Jun 20, 2023 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| A Deep Reinforcement Learning Framework for the Financial Portfolio Management Problem | Jun 30, 2017 | Deep Reinforcement LearningManagement | CodeCode Available | 1 | 5 |
| Contention Window Optimization in IEEE 802.11ax Networks with Deep Reinforcement Learning | Mar 3, 2020 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 | 5 |
| ContainerGym: A Real-World Reinforcement Learning Benchmark for Resource Allocation | Jul 6, 2023 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| Connecting Deep-Reinforcement-Learning-based Obstacle Avoidance with Conventional Global Planners using Waypoint Generators | Apr 8, 2021 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 | 5 |
| A Closer Look at Invalid Action Masking in Policy Gradient Algorithms | Jun 25, 2020 | Deep Reinforcement LearningReal-Time Strategy Games | CodeCode Available | 1 | 5 |
| Continuous Coordination As a Realistic Scenario for Lifelong Learning | Mar 4, 2021 | Continual LearningDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| 2-Level Reinforcement Learning for Ships on Inland Waterways: Path Planning and Following | Jul 25, 2023 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 | 5 |
| GATES: Cost-aware Dynamic Workflow Scheduling via Graph Attention Networks and Evolution Strategy | May 18, 2025 | Cloud ComputingDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| Acme: A Research Framework for Distributed Reinforcement Learning | Jun 1, 2020 | Deep Reinforcement LearningDQN Replay Dataset | CodeCode Available | 1 | 5 |
| Generalizing Across Multi-Objective Reward Functions in Deep Reinforcement Learning | Sep 17, 2018 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 | 5 |
| Continuous control with deep reinforcement learning | Sep 9, 2015 | Action Detectioncontinuous-control | CodeCode Available | 1 | 5 |
| Geometric Deep Reinforcement Learning for Dynamic DAG Scheduling | Nov 9, 2020 | Combinatorial OptimizationDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| Continuous-Time Fitted Value Iteration for Robust Policies | Oct 5, 2021 | continuous-controlContinuous Control | CodeCode Available | 1 | 5 |
| Deep Reinforcement Learning for Human-Like Driving Policies in Collision Avoidance Tasks of Self-Driving Cars | Jun 7, 2020 | Autonomous VehiclesCollision Avoidance | CodeCode Available | 1 | 5 |