| AllenAct: A Framework for Embodied AI Research | Aug 28, 2020 | Deep Reinforcement LearningEmbodied Question Answering | CodeCode Available | 1 |
| Tactical Optimism and Pessimism for Deep Reinforcement Learning | Feb 7, 2021 | continuous-controlContinuous Control | CodeCode Available | 1 |
| Benchmarking Reinforcement Learning Techniques for Autonomous Navigation | Oct 10, 2022 | Autonomous NavigationBenchmarking | CodeCode Available | 1 |
| Deep Reinforcement Learning with Gradient Eligibility Traces | Jul 12, 2025 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 1 |
| Deep Reinforcement Trading with Predictable Returns | Apr 29, 2021 | ClusteringDeep Reinforcement Learning | CodeCode Available | 1 |
| Deep reinforcement learning-designed radiofrequency waveform in MRI | May 7, 2021 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Boosting Soft Actor-Critic: Emphasizing Recent Experience without Forgetting the Past | Jun 10, 2019 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 1 |
| A fast balance optimization approach for charging enhancement of lithium-ion battery packs through deep reinforcement learning | Apr 24, 2024 | Deep Reinforcement Learningenergy management | CodeCode Available | 1 |
| Can Increasing Input Dimensionality Improve Deep Reinforcement Learning? | Mar 3, 2020 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 1 |
| Beacon, a lightweight deep reinforcement learning benchmark library for flow control | Feb 27, 2024 | BenchmarkingCPU | CodeCode Available | 1 |
| Learning Dexterous Grasping with Object-Centric Visual Affordances | Sep 3, 2020 | Deep Reinforcement LearningObject | CodeCode Available | 1 |
| Balsa: Learning a Query Optimizer Without Expert Demonstrations | Jan 5, 2022 | Deep Reinforcement Learning | CodeCode Available | 1 |
| Affordance Learning from Play for Sample-Efficient Policy Learning | Mar 1, 2022 | Deep Reinforcement LearningMotion Planning | CodeCode Available | 1 |
| Differentiable Trust Region Layers for Deep Reinforcement Learning | Jan 22, 2021 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| BeBold: Exploration Beyond the Boundary of Explored Regions | Dec 15, 2020 | Deep Reinforcement LearningEfficient Exploration | CodeCode Available | 1 |
| DISCOVER: Deep identification of symbolically concise open-form PDEs via enhanced reinforcement-learning | Oct 4, 2022 | Deep Reinforcement LearningForm | CodeCode Available | 1 |
| Avalon: A Benchmark for RL Generalization Using Procedurally Generated Worlds | Oct 24, 2022 | Deep Reinforcement LearningNavigate | CodeCode Available | 1 |
| Distributed Resource Allocation with Multi-Agent Deep Reinforcement Learning for 5G-V2V Communication | Oct 11, 2020 | Deep Reinforcement LearningDistributed Optimization | CodeCode Available | 1 |
| Divergence-Augmented Policy Optimization | Jan 25, 2025 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 1 |
| Age-Based Scheduling for Mobile Edge Computing: A Deep Reinforcement Learning Approach | Dec 1, 2023 | Deep Reinforcement LearningEdge-computing | CodeCode Available | 1 |
| Benchmarking Actor-Critic Deep Reinforcement Learning Algorithms for Robotics Control with Action Constraints | Apr 18, 2023 | BenchmarkingDeep Reinforcement Learning | CodeCode Available | 1 |
| AutoPhase: Compiler Phase-Ordering for High Level Synthesis with Deep Reinforcement Learning | Jan 15, 2019 | Deep Reinforcement LearningHigh-Level Synthesis | CodeCode Available | 1 |
| DPO Meets PPO: Reinforced Token Optimization for RLHF | Apr 29, 2024 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Drafting in Collectible Card Games via Reinforcement Learning | Nov 7, 2020 | Card GamesDeep Reinforcement Learning | CodeCode Available | 1 |
| A Deep Reinforcement Learning Framework for the Financial Portfolio Management Problem | Jun 30, 2017 | Deep Reinforcement LearningManagement | CodeCode Available | 1 |
| DRL-Based Trajectory Tracking for Motion-Related Modules in Autonomous Driving | Aug 30, 2023 | Autonomous DrivingDeep Reinforcement Learning | CodeCode Available | 1 |
| DRL-M4MR: An Intelligent Multicast Routing Approach Based on DQN Deep Reinforcement Learning in SDN | Jul 31, 2022 | Deep Reinforcement Learning | CodeCode Available | 1 |
| DROP: Deep relocating option policy for optimal ride-hailing vehicle repositioning | Sep 9, 2021 | Deep Reinforcement Learning | CodeCode Available | 1 |
| A Gentle Introduction to Conformal Prediction and Distribution-Free Uncertainty Quantification | Jul 15, 2021 | Conformal PredictionDeep Reinforcement Learning | CodeCode Available | 1 |
| Dynamic Feature-based Deep Reinforcement Learning for Flow Control of Circular Cylinder with Sparse Surface Pressure Sensing | Jul 5, 2023 | Deep Reinforcement LearningSelf-Learning | CodeCode Available | 1 |
| Dynamic Sparse Training for Deep Reinforcement Learning | Jun 8, 2021 | continuous-controlContinuous Control | CodeCode Available | 1 |
| Agent with Warm Start and Adaptive Dynamic Termination for Plane Localization in 3D Ultrasound | Mar 26, 2021 | Deep Reinforcement LearningReinforcement Learning (RL) | CodeCode Available | 1 |
| AutoPhase: Juggling HLS Phase Orderings in Random Forests with Deep Reinforcement Learning | Mar 2, 2020 | Deep Reinforcement LearningHigh-Level Synthesis | CodeCode Available | 1 |
| Autonomous Driving using Residual Sensor Fusion and Deep Reinforcement Learning | Dec 27, 2023 | Autonomous DrivingDecision Making | CodeCode Available | 1 |
| Automating DBSCAN via Deep Reinforcement Learning | Aug 9, 2022 | ClusteringComputational Efficiency | CodeCode Available | 1 |
| Embodied Synaptic Plasticity with Online Reinforcement learning | Mar 3, 2020 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Energy Harvesting Reconfigurable Intelligent Surface for UAV Based on Robust Deep Reinforcement Learning | Feb 23, 2023 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Energy Optimization for HVAC Systems in Multi-VAV Open Offices: A Deep Reinforcement Learning Approach | Jun 23, 2023 | Deep Reinforcement Learningenergy management | CodeCode Available | 1 |
| Enhancing Battery Storage Energy Arbitrage with Deep Reinforcement Learning and Time-Series Forecasting | Oct 25, 2024 | Deep Reinforcement LearningTime Series | CodeCode Available | 1 |
| Enhancing Cooperative Multi-Agent Reinforcement Learning with State Modelling and Adversarial Exploration | May 8, 2025 | Deep Reinforcement LearningMulti-agent Reinforcement Learning | CodeCode Available | 1 |
| A Closer Look at Invalid Action Masking in Policy Gradient Algorithms | Jun 25, 2020 | Deep Reinforcement LearningReal-Time Strategy Games | CodeCode Available | 1 |
| Environment Agnostic Representation for Visual Reinforcement Learning | Jan 1, 2023 | Deep Reinforcement LearningDomain Generalization | CodeCode Available | 1 |
| 2-Level Reinforcement Learning for Ships on Inland Waterways: Path Planning and Following | Jul 25, 2023 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| ERL-Re^2: Efficient Evolutionary Reinforcement Learning with Shared State Representation and Individual Policy Representation | Oct 26, 2022 | continuous-controlContinuous Control | CodeCode Available | 1 |
| Acme: A Research Framework for Distributed Reinforcement Learning | Jun 1, 2020 | Deep Reinforcement LearningDQN Replay Dataset | CodeCode Available | 1 |
| Example-guided learning of stochastic human driving policies using deep reinforcement learning | Dec 23, 2022 | Autonomous VehiclesDeep Reinforcement Learning | CodeCode Available | 1 |
| Experience Replay with Likelihood-free Importance Weights | Jun 23, 2020 | Deep Reinforcement LearningOpenAI Gym | CodeCode Available | 1 |
| Explainable Reinforcement Learning for Longitudinal Control | Feb 6, 2021 | Deep Reinforcement LearningOpenAI Gym | CodeCode Available | 1 |
| #Exploration: A Study of Count-Based Exploration for Deep Reinforcement Learning | Nov 15, 2016 | Atari Gamescontinuous-control | CodeCode Available | 1 |
| Autonomous Exploration Under Uncertainty via Deep Reinforcement Learning on Graphs | Jul 24, 2020 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 1 |