| AutoPhase: Compiler Phase-Ordering for High Level Synthesis with Deep Reinforcement Learning | Jan 15, 2019 | Deep Reinforcement LearningHigh-Level Synthesis | CodeCode Available | 1 | 5 |
| Accelerating Deep Reinforcement Learning for Digital Twin Network Optimization with Evolutionary Strategies | Feb 1, 2022 | Deep Reinforcement LearningManagement | CodeCode Available | 1 | 5 |
| AutoShard: Automated Embedding Table Sharding for Recommender Systems | Aug 12, 2022 | Deep Reinforcement LearningRecommendation Systems | CodeCode Available | 1 | 5 |
| Deep Recurrent Q-Learning for Partially Observable MDPs | Jul 23, 2015 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| Faster Deep Reinforcement Learning with Slower Online Network | Dec 10, 2021 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 | 5 |
| Deep Reinforcement Agent for Scheduling in HPC | Feb 11, 2021 | Deep Reinforcement LearningScheduling | CodeCode Available | 1 | 5 |
| An Application of Deep Reinforcement Learning to Algorithmic Trading | Apr 7, 2020 | Algorithmic TradingDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| Differentiable Trust Region Layers for Deep Reinforcement Learning | Jan 22, 2021 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 | 5 |
| Balsa: Learning a Query Optimizer Without Expert Demonstrations | Jan 5, 2022 | Deep Reinforcement Learning | CodeCode Available | 1 | 5 |
| Benchmarking Reinforcement Learning Techniques for Autonomous Navigation | Oct 10, 2022 | Autonomous NavigationBenchmarking | CodeCode Available | 1 | 5 |
| Adversarially Guided Actor-Critic | Feb 8, 2021 | Deep Reinforcement LearningEfficient Exploration | CodeCode Available | 1 | 5 |
| BeBold: Exploration Beyond the Boundary of Explored Regions | Dec 15, 2020 | Deep Reinforcement LearningEfficient Exploration | CodeCode Available | 1 | 5 |
| Adversarial Policies: Attacking Deep Reinforcement Learning | May 25, 2019 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 | 5 |
| An actor-critic algorithm with policy gradients to solve the job shop scheduling problem using deep double recurrent agents | Oct 18, 2021 | Deep Reinforcement LearningJob Shop Scheduling | CodeCode Available | 1 | 5 |
| Benchmarking Actor-Critic Deep Reinforcement Learning Algorithms for Robotics Control with Action Constraints | Apr 18, 2023 | BenchmarkingDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| Benchmarking Batch Deep Reinforcement Learning Algorithms | Oct 3, 2019 | BenchmarkingDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| Deep Policy Gradient Methods Without Batch Updates, Target Networks, or Replay Buffers | Nov 22, 2024 | AvgDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| Amortizing intractable inference in diffusion models for vision, language, and control | May 31, 2024 | continuous-controlContinuous Control | CodeCode Available | 1 | 5 |
| BOHB: Robust and Efficient Hyperparameter Optimization at Scale | Jul 4, 2018 | Bayesian OptimizationDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| Distributed Two-tier DRL Framework for Cell-Free Network: Association, Beamforming and Power Allocation | Mar 22, 2023 | Deep Reinforcement Learning | CodeCode Available | 1 | 5 |
| Blockchain Framework for Artificial Intelligence Computation | Feb 23, 2022 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 | 5 |
| Boosting Soft Actor-Critic: Emphasizing Recent Experience without Forgetting the Past | Jun 10, 2019 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 1 | 5 |
| Bridging RL Theory and Practice with the Effective Horizon | Apr 19, 2023 | Deep Reinforcement LearningReinforcement Learning (RL) | CodeCode Available | 1 | 5 |
| A Deep Reinforcement Learning Framework for the Financial Portfolio Management Problem | Jun 30, 2017 | Deep Reinforcement LearningManagement | CodeCode Available | 1 | 5 |
| Learning Multi-Pursuit Evasion for Safe Targeted Navigation of Drones | Apr 7, 2023 | Deep Reinforcement Learning | CodeCode Available | 1 | 5 |