| Cryptocurrency Portfolio Management with Deep Reinforcement Learning | Dec 5, 2016 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| An actor-critic algorithm with policy gradients to solve the job shop scheduling problem using deep double recurrent agents | Oct 18, 2021 | Deep Reinforcement LearningJob Shop Scheduling | CodeCode Available | 1 | 5 |
| Collaborative Target Search with a Visual Drone Swarm: An Adaptive Curriculum Embedded Multistage Reinforcement Learning Approach | Apr 26, 2022 | Deep Reinforcement LearningManagement | CodeCode Available | 1 | 5 |
| Acme: A Research Framework for Distributed Reinforcement Learning | Jun 1, 2020 | Deep Reinforcement LearningDQN Replay Dataset | CodeCode Available | 1 | 5 |
| A Closer Look at Invalid Action Masking in Policy Gradient Algorithms | Jun 25, 2020 | Deep Reinforcement LearningReal-Time Strategy Games | CodeCode Available | 1 | 5 |
| 2-Level Reinforcement Learning for Ships on Inland Waterways: Path Planning and Following | Jul 25, 2023 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 | 5 |
| Amortizing intractable inference in diffusion models for vision, language, and control | May 31, 2024 | continuous-controlContinuous Control | CodeCode Available | 1 | 5 |
| Learning Multi-Pursuit Evasion for Safe Targeted Navigation of Drones | Apr 7, 2023 | Deep Reinforcement Learning | CodeCode Available | 1 | 5 |
| A multi-agent reinforcement learning model of common-pool resource appropriation | Jul 20, 2017 | Deep Reinforcement LearningMulti-agent Reinforcement Learning | CodeCode Available | 1 | 5 |
| An End-to-end Deep Reinforcement Learning Approach for the Long-term Short-term Planning on the Frenet Space | Nov 26, 2020 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| Combining Deep Reinforcement Learning and Search for Imperfect-Information Games | Jul 27, 2020 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 | 5 |
| Continuous control with deep reinforcement learning | Sep 9, 2015 | Action Detectioncontinuous-control | CodeCode Available | 1 | 5 |
| Data-Efficient Reinforcement Learning with Self-Predictive Representations | Jul 12, 2020 | Atari Games 100kData Augmentation | CodeCode Available | 1 | 5 |
| Chip Placement with Deep Reinforcement Learning | Apr 22, 2020 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 | 5 |
| Cleanba: A Reproducible and Efficient Distributed Reinforcement Learning Platform | Sep 29, 2023 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 | 5 |
| CDT: Cascading Decision Trees for Explainable Reinforcement Learning | Nov 15, 2020 | Deep Reinforcement LearningExplainable Models | CodeCode Available | 1 | 5 |
| AllenAct: A Framework for Embodied AI Research | Aug 28, 2020 | Deep Reinforcement LearningEmbodied Question Answering | CodeCode Available | 1 | 5 |
| Character Controllers Using Motion VAEs | Mar 26, 2021 | Continuous ControlDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| CARL: Controllable Agent with Reinforcement Learning for Quadruped Locomotion | May 7, 2020 | Deep Reinforcement LearningMotion Synthesis | CodeCode Available | 1 | 5 |
| Re4MPC: Reactive Nonlinear MPC for Multi-model Motion Planning via Deep Reinforcement Learning | Jun 10, 2025 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| Catalyst.RL: A Distributed Framework for Reproducible RL Research | Feb 28, 2019 | continuous-controlContinuous Control | CodeCode Available | 1 | 5 |
| CADRE: A Cascade Deep Reinforcement Learning Framework for Vision-based Autonomous Urban Driving | Feb 17, 2022 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 | 5 |
| Building a 3-Player Mahjong AI using Deep Reinforcement Learning | Feb 25, 2022 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 | 5 |
| Can Increasing Input Dimensionality Improve Deep Reinforcement Learning? | Mar 3, 2020 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| Training a Resilient Q-Network against Observational Interference | Feb 18, 2021 | Causal InferenceDeep Reinforcement Learning | CodeCode Available | 1 | 5 |