| Approximating two value functions instead of one: towards characterizing a new family of Deep Reinforcement Learning algorithms | Sep 1, 2019 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 0 | 5 |
| Curious Exploration and Return-based Memory Restoration for Deep Reinforcement Learning | May 2, 2021 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 | 5 |
| Dueling Network Architectures for Deep Reinforcement Learning | Nov 20, 2015 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 0 | 5 |
| Continuous Control With Ensemble Deep Deterministic Policy Gradients | Nov 30, 2021 | continuous-controlContinuous Control | CodeCode Available | 0 | 5 |
| DutyTTE: Deciphering Uncertainty in Origin-Destination Travel Time Estimation | Aug 23, 2024 | Deep Reinforcement LearningMixture-of-Experts | CodeCode Available | 0 | 5 |
| DR-SAC: Distributionally Robust Soft Actor-Critic for Reinforcement Learning under Uncertainty | Jun 14, 2025 | continuous-controlContinuous Control | CodeCode Available | 0 | 5 |
| DRLViz: Understanding Decisions and Memory in Deep Reinforcement Learning | Sep 6, 2019 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 | 5 |
| Dual Policy Distillation | Jun 7, 2020 | continuous-controlContinuous Control | CodeCode Available | 0 | 5 |
| Network Randomization: A Simple Technique for Generalization in Deep Reinforcement Learning | Oct 11, 2019 | Data AugmentationDeep Reinforcement Learning | CodeCode Available | 0 | 5 |
| Cyclic Policy Distillation: Sample-Efficient Sim-to-Real Reinforcement Learning with Domain Randomization | Jul 29, 2022 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 0 | 5 |
| Dynamic Control of a Fiber Manufacturing Process using Deep Reinforcement Learning | Nov 23, 2019 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 | 5 |
| DRL-Based Medium-Term Planning of Renewable-Integrated Self-Scheduling Cascaded Hydropower to Guide Wholesale Market Participation | Jan 8, 2025 | Deep Reinforcement LearningScheduling | CodeCode Available | 0 | 5 |
| DRL-Based Resource Allocation for Motion Blur Resistant Federated Self-Supervised Learning in IoV | Aug 17, 2024 | CPUDeep Reinforcement Learning | CodeCode Available | 0 | 5 |
| Continual Reinforcement Learning for HVAC Systems Control: Integrating Hypernetworks and Transfer Learning | Mar 24, 2025 | Continual LearningDeep Reinforcement Learning | CodeCode Available | 0 | 5 |
| Continual Deep Reinforcement Learning with Task-Agnostic Policy Distillation | Nov 25, 2024 | Continual LearningDeep Reinforcement Learning | CodeCode Available | 0 | 5 |
| Guaranteeing Control Requirements via Reward Shaping in Reinforcement Learning | Nov 16, 2023 | Deep Reinforcement LearningOpenAI Gym | CodeCode Available | 0 | 5 |
| Driving in Dense Traffic with Model-Free Reinforcement Learning | Sep 15, 2019 | continuous-controlContinuous Control | CodeCode Available | 0 | 5 |
| AirPilot: Interpretable PPO-based DRL Auto-Tuned Nonlinear PID Drone Controller for Robust Autonomous Flights | Mar 30, 2024 | Deep Reinforcement LearningDrone Controller | CodeCode Available | 0 | 5 |
| DRL4AOI: A DRL Framework for Semantic-aware AOI Segmentation in Location-Based Services | Dec 6, 2024 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 0 | 5 |
| Contextual Pre-planning on Reward Machine Abstractions for Enhanced Transfer in Deep Reinforcement Learning | Jul 11, 2023 | Deep Reinforcement Learning | CodeCode Available | 0 | 5 |
| DARLA: Improving Zero-Shot Transfer in Reinforcement Learning | Jul 26, 2017 | Deep Reinforcement LearningDomain Adaptation | CodeCode Available | 0 | 5 |
| Data Assimilation in Chaotic Systems Using Deep Reinforcement Learning | Jan 1, 2024 | Autonomous VehiclesDeep Reinforcement Learning | CodeCode Available | 0 | 5 |
| Applying Deep Reinforcement Learning to the HP Model for Protein Structure Prediction | Nov 27, 2022 | Deep Reinforcement LearningProtein Folding | CodeCode Available | 0 | 5 |
| Abstract Demonstrations and Adaptive Exploration for Efficient and Stable Multi-step Sparse Reward Reinforcement Learning | Jul 19, 2022 | Deep Reinforcement Learning | CodeCode Available | 0 | 5 |
| Efficient Information Diffusion in Time-Varying Graphs through Deep Reinforcement Learning | Nov 27, 2020 | Deep Reinforcement LearningGraph Embedding | CodeCode Available | 0 | 5 |