| A Scalable and Reproducible System-on-Chip Simulation for Reinforcement Learning | Apr 27, 2021 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 | 5 |
| Deep Reinforcement Learning for Producing Furniture Layout in Indoor Scenes | Jan 19, 2021 | Deep Reinforcement LearningPosition | CodeCode Available | 1 | 5 |
| An Efficient Asynchronous Method for Integrating Evolutionary and Gradient-based Policy Search | Dec 10, 2020 | continuous-controlContinuous Control | CodeCode Available | 1 | 5 |
| Deep Reinforcement Learning based Group Recommender System | Jun 13, 2021 | Deep Reinforcement LearningRecommendation Systems | CodeCode Available | 1 | 5 |
| Deep Reinforcement Learning for Time Allocation and Directional Transmission in Joint Radar-Communication | May 19, 2022 | Autonomous VehiclesDecision Making Under Uncertainty | CodeCode Available | 1 | 5 |
| Asset Allocation: From Markowitz to Deep Reinforcement Learning | Jul 14, 2022 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 | 5 |
| Asynchronous Methods for Deep Reinforcement Learning | Feb 4, 2016 | Atari GamesCPU | CodeCode Available | 1 | 5 |
| Deep Reinforcement Learning for URLLC data management on top of scheduled eMBB traffic | Mar 2, 2021 | Deep Reinforcement LearningManagement | CodeCode Available | 1 | 5 |
| Deep Reinforcement Learning based Recommendation with Explicit User-Item Interactions Modeling | Oct 29, 2018 | Collaborative FilteringDecision Making | CodeCode Available | 1 | 5 |
| A Text-based Deep Reinforcement Learning Framework for Interactive Recommendation | Apr 14, 2020 | Deep Reinforcement LearningInteractive Recommendation | CodeCode Available | 1 | 5 |
| A Unified Approach to Reinforcement Learning, Quantal Response Equilibria, and Two-Player Zero-Sum Games | Jun 12, 2022 | Deep Reinforcement LearningMuJoCo Games | CodeCode Available | 1 | 5 |
| Automatic Data Augmentation for Generalization in Deep Reinforcement Learning | Jun 23, 2020 | Data AugmentationDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| SoundSpaces: Audio-Visual Navigation in 3D Environments | Dec 24, 2019 | Deep Reinforcement LearningNavigate | CodeCode Available | 1 | 5 |
| Augmenting Reinforcement Learning with Behavior Primitives for Diverse Manipulation Tasks | Oct 7, 2021 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 | 5 |
| Deep Reinforcement Learning at the Edge of the Statistical Precipice | Aug 30, 2021 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 | 5 |
| Deep-Reinforcement-Learning-Based AoI-Aware Resource Allocation for RIS-Aided IoV Networks | Jun 17, 2024 | Deep Reinforcement Learning | CodeCode Available | 1 | 5 |
| Deep Reinforcement Learning with Gradient Eligibility Traces | Jul 12, 2025 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 1 | 5 |
| Deep Reinforcement Learning with Task-Adaptive Retrieval via Hypernetwork | Jun 19, 2023 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| Deep Reinforcement Learning-based Intelligent Traffic Signal Controls with Optimized CO2 emissions | Oct 19, 2023 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 1 | 5 |
| Deep Reinforcement Learning Control of Quantum Cartpoles | Oct 21, 2019 | Deep LearningDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| Actor Prioritized Experience Replay | Sep 1, 2022 | continuous-controlContinuous Control | CodeCode Available | 1 | 5 |
| AutoPhase: Juggling HLS Phase Orderings in Random Forests with Deep Reinforcement Learning | Mar 2, 2020 | Deep Reinforcement LearningHigh-Level Synthesis | CodeCode Available | 1 | 5 |
| Adversarial Deep Reinforcement Learning for Improving the Robustness of Multi-agent Autonomous Driving Policies | Dec 22, 2021 | Autonomous DrivingDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| Accelerated Sim-to-Real Deep Reinforcement Learning: Learning Collision Avoidance from Human Player | Feb 21, 2021 | Collision AvoidanceDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| Adversarial Deep Reinforcement Learning in Portfolio Management | Aug 29, 2018 | Deep Reinforcement LearningManagement | CodeCode Available | 1 | 5 |
| AutoPhase: Compiler Phase-Ordering for High Level Synthesis with Deep Reinforcement Learning | Jan 15, 2019 | Deep Reinforcement LearningHigh-Level Synthesis | CodeCode Available | 1 | 5 |
| Faster Deep Reinforcement Learning with Slower Online Network | Dec 10, 2021 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 | 5 |
| Deep Policy Gradient Methods Without Batch Updates, Target Networks, or Replay Buffers | Nov 22, 2024 | AvgDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| Avalon: A Benchmark for RL Generalization Using Procedurally Generated Worlds | Oct 24, 2022 | Deep Reinforcement LearningNavigate | CodeCode Available | 1 | 5 |
| Developing an OpenAI Gym-compatible framework and simulation environment for testing Deep Reinforcement Learning agents solving the Ambulance Location Problem | Jan 12, 2021 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| Deep Recurrent Q-Learning for Partially Observable MDPs | Jul 23, 2015 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| An Application of Deep Reinforcement Learning to Algorithmic Trading | Apr 7, 2020 | Algorithmic TradingDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| Balsa: Learning a Query Optimizer Without Expert Demonstrations | Jan 5, 2022 | Deep Reinforcement Learning | CodeCode Available | 1 | 5 |
| Differentiable Trust Region Layers for Deep Reinforcement Learning | Jan 22, 2021 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 | 5 |
| BeBold: Exploration Beyond the Boundary of Explored Regions | Dec 15, 2020 | Deep Reinforcement LearningEfficient Exploration | CodeCode Available | 1 | 5 |
| Adversarially Guided Actor-Critic | Feb 8, 2021 | Deep Reinforcement LearningEfficient Exploration | CodeCode Available | 1 | 5 |
| Discovering General Reinforcement Learning Algorithms with Adversarial Environment Design | Oct 4, 2023 | Deep Reinforcement LearningGeneral Reinforcement Learning | CodeCode Available | 1 | 5 |
| Adversarial Policies: Attacking Deep Reinforcement Learning | May 25, 2019 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 | 5 |
| An actor-critic algorithm with policy gradients to solve the job shop scheduling problem using deep double recurrent agents | Oct 18, 2021 | Deep Reinforcement LearningJob Shop Scheduling | CodeCode Available | 1 | 5 |
| Benchmarking Deep Reinforcement Learning for Navigation in Denied Sensor Environments | Oct 18, 2024 | Autonomous NavigationBenchmarking | CodeCode Available | 1 | 5 |
| DeepMind Lab2D | Nov 13, 2020 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 | 5 |
| Benchmarking Batch Deep Reinforcement Learning Algorithms | Oct 3, 2019 | BenchmarkingDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| Bridging RL Theory and Practice with the Effective Horizon | Apr 19, 2023 | Deep Reinforcement LearningReinforcement Learning (RL) | CodeCode Available | 1 | 5 |
| Benchmarking Reinforcement Learning Techniques for Autonomous Navigation | Oct 10, 2022 | Autonomous NavigationBenchmarking | CodeCode Available | 1 | 5 |
| Beyond The Rainbow: High Performance Deep Reinforcement Learning on a Desktop PC | Nov 6, 2024 | Computational EfficiencyDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| Blockchain Framework for Artificial Intelligence Computation | Feb 23, 2022 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 | 5 |
| Deep Reinforcement Agent for Scheduling in HPC | Feb 11, 2021 | Deep Reinforcement LearningScheduling | CodeCode Available | 1 | 5 |
| Boosting Soft Actor-Critic: Emphasizing Recent Experience without Forgetting the Past | Jun 10, 2019 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 1 | 5 |
| Amortizing intractable inference in diffusion models for vision, language, and control | May 31, 2024 | continuous-controlContinuous Control | CodeCode Available | 1 | 5 |
| A Deep Reinforcement Learning Framework for the Financial Portfolio Management Problem | Jun 30, 2017 | Deep Reinforcement LearningManagement | CodeCode Available | 1 | 5 |