| DeepACO: Neural-enhanced Ant Systems for Combinatorial Optimization | Sep 25, 2023 | Combinatorial OptimizationDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| Job Shop Scheduling via Deep Reinforcement Learning: a Sequence to Sequence approach | Aug 3, 2023 | Combinatorial OptimizationDecoder | CodeCode Available | 1 | 5 |
| Joint Deep Reinforcement Learning and Unfolding: Beam Selection and Precoding for mmWave Multiuser MIMO with Lens Arrays | Jan 5, 2021 | Deep Reinforcement Learning | CodeCode Available | 1 | 5 |
| AllenAct: A Framework for Embodied AI Research | Aug 28, 2020 | Deep Reinforcement LearningEmbodied Question Answering | CodeCode Available | 1 | 5 |
| Deep Deterministic Portfolio Optimization | Mar 13, 2020 | Deep Reinforcement LearningPortfolio Optimization | CodeCode Available | 1 | 5 |
| DeepMimic: Example-Guided Deep Reinforcement Learning of Physics-Based Character Skills | Apr 8, 2018 | Deep Reinforcement LearningMotion Synthesis | CodeCode Available | 1 | 5 |
| Language as a Cognitive Tool to Imagine Goals in Curiosity Driven Exploration | Dec 1, 2020 | Deep Reinforcement Learning | CodeCode Available | 1 | 5 |
| Deep Intrinsically Motivated Exploration in Continuous Control | Oct 1, 2022 | continuous-controlContinuous Control | CodeCode Available | 1 | 5 |
| Deep Lagrangian Networks for end-to-end learning of energy-based control for under-actuated systems | Jul 10, 2019 | Deep LearningDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| A Deep Reinforcement Learning Framework for the Financial Portfolio Management Problem | Jun 30, 2017 | Deep Reinforcement LearningManagement | CodeCode Available | 1 | 5 |
| DeepMind Lab2D | Nov 13, 2020 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 | 5 |
| An Optimistic Perspective on Offline Deep Reinforcement Learning | Jan 1, 2020 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| Learning Collaborative Policies to Solve NP-hard Routing Problems | Oct 26, 2021 | Deep Reinforcement LearningTraveling Salesman Problem | CodeCode Available | 1 | 5 |
| Learning Discrete World Models for Heuristic Search | Sep 14, 2024 | Deep Reinforcement LearningHeuristic Search | CodeCode Available | 1 | 5 |
| Deep Policy Gradient Methods Without Batch Updates, Target Networks, or Replay Buffers | Nov 22, 2024 | AvgDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| Faster Deep Reinforcement Learning with Slower Online Network | Dec 10, 2021 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 | 5 |
| An Introduction to Deep Reinforcement Learning | Nov 30, 2018 | BIG-bench Machine LearningDecision Making | CodeCode Available | 1 | 5 |
| A Comprehensive Survey on Self-Interpretable Neural Networks | Jan 26, 2025 | Deep Reinforcement LearningSurvey | CodeCode Available | 1 | 5 |
| Deep Recurrent Q-Learning for Partially Observable MDPs | Jul 23, 2015 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| Deep Reinforcement Agent for Scheduling in HPC | Feb 11, 2021 | Deep Reinforcement LearningScheduling | CodeCode Available | 1 | 5 |
| Tactical Optimism and Pessimism for Deep Reinforcement Learning | Feb 7, 2021 | continuous-controlContinuous Control | CodeCode Available | 1 | 5 |
| Deep Reinforcement Learning with Gradient Eligibility Traces | Jul 12, 2025 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 1 | 5 |
| Deep-Reinforcement-Learning-Based AoI-Aware Resource Allocation for RIS-Aided IoV Networks | Jun 17, 2024 | Deep Reinforcement Learning | CodeCode Available | 1 | 5 |
| DISCOVER: Deep identification of symbolically concise open-form PDEs via enhanced reinforcement-learning | Oct 4, 2022 | Deep Reinforcement LearningForm | CodeCode Available | 1 | 5 |
| An End-to-end Deep Reinforcement Learning Approach for the Long-term Short-term Planning on the Frenet Space | Nov 26, 2020 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 1 | 5 |