| Beyond The Rainbow: High Performance Deep Reinforcement Learning on a Desktop PC | Nov 6, 2024 | Computational EfficiencyDeep Reinforcement Learning | CodeCode Available | 1 |
| Neural Inventory Control in Networks via Hindsight Differentiable Policy Optimization | Jun 20, 2023 | Deep Reinforcement LearningManagement | CodeCode Available | 1 |
| Neural Ordinary Differential Equation Control of Dynamics on Graphs | Jun 17, 2020 | Deep Reinforcement LearningReinforcement Learning (RL) | CodeCode Available | 1 |
| An Equivalence between Loss Functions and Non-Uniform Sampling in Experience Replay | Jul 12, 2020 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 1 |
| Object Detection with Deep Reinforcement Learning | Aug 9, 2022 | Active Object LocalizationDeep Reinforcement Learning | CodeCode Available | 1 |
| Goal Misgeneralization in Deep Reinforcement Learning | May 28, 2021 | Deep Reinforcement LearningNavigate | CodeCode Available | 1 |
| BOHB: Robust and Efficient Hyperparameter Optimization at Scale | Jul 4, 2018 | Bayesian OptimizationDeep Reinforcement Learning | CodeCode Available | 1 |
| Off-Policy Deep Reinforcement Learning without Exploration | Dec 7, 2018 | continuous-controlContinuous Control | CodeCode Available | 1 |
| Fast-Converged Deep Reinforcement Learning for Optimal Dispatch of Large-Scale Power Systems under Transient Security Constraints | Apr 17, 2023 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 1 |
| Online 3D Bin Packing with Constrained Deep Reinforcement Learning | Jun 26, 2020 | 3D Bin PackingCollision Avoidance | CodeCode Available | 1 |
| Benchmarking Deep Reinforcement Learning for Navigation in Denied Sensor Environments | Oct 18, 2024 | Autonomous NavigationBenchmarking | CodeCode Available | 1 |
| Benchmarking Reinforcement Learning Techniques for Autonomous Navigation | Oct 10, 2022 | Autonomous NavigationBenchmarking | CodeCode Available | 1 |
| Boosting Soft Actor-Critic: Emphasizing Recent Experience without Forgetting the Past | Jun 10, 2019 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 1 |
| BeBold: Exploration Beyond the Boundary of Explored Regions | Dec 15, 2020 | Deep Reinforcement LearningEfficient Exploration | CodeCode Available | 1 |
| Beacon, a lightweight deep reinforcement learning benchmark library for flow control | Feb 27, 2024 | BenchmarkingCPU | CodeCode Available | 1 |
| Benchmarking Actor-Critic Deep Reinforcement Learning Algorithms for Robotics Control with Action Constraints | Apr 18, 2023 | BenchmarkingDeep Reinforcement Learning | CodeCode Available | 1 |
| A Deep Reinforcement Learning Approach to First-Order Logic Theorem Proving | Nov 5, 2019 | Automated Theorem ProvingDeep Reinforcement Learning | CodeCode Available | 1 |
| Balsa: Learning a Query Optimizer Without Expert Demonstrations | Jan 5, 2022 | Deep Reinforcement Learning | CodeCode Available | 1 |
| Benchmarking Batch Deep Reinforcement Learning Algorithms | Oct 3, 2019 | BenchmarkingDeep Reinforcement Learning | CodeCode Available | 1 |
| Bridging Imagination and Reality for Model-Based Deep Reinforcement Learning | Oct 23, 2020 | Deep Reinforcement LearningModel-based Reinforcement Learning | CodeCode Available | 1 |
| AutoPhase: Juggling HLS Phase Orderings in Random Forests with Deep Reinforcement Learning | Mar 2, 2020 | Deep Reinforcement LearningHigh-Level Synthesis | CodeCode Available | 1 |
| Autonomous Exploration Under Uncertainty via Deep Reinforcement Learning on Graphs | Jul 24, 2020 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 1 |
| Action Space Shaping in Deep Reinforcement Learning | Apr 2, 2020 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| AutoPhase: Compiler Phase-Ordering for High Level Synthesis with Deep Reinforcement Learning | Jan 15, 2019 | Deep Reinforcement LearningHigh-Level Synthesis | CodeCode Available | 1 |
| AutoShard: Automated Embedding Table Sharding for Recommender Systems | Aug 12, 2022 | Deep Reinforcement LearningRecommendation Systems | CodeCode Available | 1 |