| Context-Based Soft Actor Critic for Environments with Non-stationary Dynamics | May 7, 2021 | continuous-controlContinuous Control | CodeCode Available | 0 | 5 |
| Influence-aware Memory Architectures for Deep Reinforcement Learning | Nov 18, 2019 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 | 5 |
| Context-Aware Visual Policy Network for Sequence-Level Image Captioning | Aug 16, 2018 | Deep Reinforcement LearningImage Captioning | CodeCode Available | 0 | 5 |
| Information-Directed Exploration for Deep Reinforcement Learning | Dec 18, 2018 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 0 | 5 |
| Dynamic Control of a Fiber Manufacturing Process using Deep Reinforcement Learning | Nov 23, 2019 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 | 5 |
| DutyTTE: Deciphering Uncertainty in Origin-Destination Travel Time Estimation | Aug 23, 2024 | Deep Reinforcement LearningMixture-of-Experts | CodeCode Available | 0 | 5 |
| Dynamic Measurement Scheduling for Event Forecasting using Deep RL | Jan 24, 2019 | Deep Reinforcement LearningICU Mortality | CodeCode Available | 0 | 5 |
| Instance based Generalization in Reinforcement Learning | Nov 2, 2020 | Deep Reinforcement LearningGeneralization Bounds | CodeCode Available | 0 | 5 |
| Dual Policy Distillation | Jun 7, 2020 | continuous-controlContinuous Control | CodeCode Available | 0 | 5 |
| DR-SAC: Distributionally Robust Soft Actor-Critic for Reinforcement Learning under Uncertainty | Jun 14, 2025 | continuous-controlContinuous Control | CodeCode Available | 0 | 5 |
| Dueling Network Architectures for Deep Reinforcement Learning | Nov 20, 2015 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 0 | 5 |
| Dynamic Network Reconfiguration for Entropy Maximization using Deep Reinforcement Learning | May 26, 2022 | Deep Reinforcement LearningNavigate | CodeCode Available | 0 | 5 |
| DRL-Based Resource Allocation for Motion Blur Resistant Federated Self-Supervised Learning in IoV | Aug 17, 2024 | CPUDeep Reinforcement Learning | CodeCode Available | 0 | 5 |
| Decision Theory-Guided Deep Reinforcement Learning for Fast Learning | Feb 8, 2024 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 | 5 |
| DRL-Based Medium-Term Planning of Renewable-Integrated Self-Scheduling Cascaded Hydropower to Guide Wholesale Market Participation | Jan 8, 2025 | Deep Reinforcement LearningScheduling | CodeCode Available | 0 | 5 |
| Intent-Aware DRL-Based NOMA Uplink Dynamic Scheduler for IIoT | Mar 27, 2024 | Deep Reinforcement LearningScheduling | CodeCode Available | 0 | 5 |
| Application of linear regression method to the deep reinforcement learning in continuous action cases | Mar 19, 2025 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 0 | 5 |
| Driving in Dense Traffic with Model-Free Reinforcement Learning | Sep 15, 2019 | continuous-controlContinuous Control | CodeCode Available | 0 | 5 |
| DRiLLS: Deep Reinforcement Learning for Logic Synthesis | Nov 11, 2019 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 | 5 |
| DRL4AOI: A DRL Framework for Semantic-aware AOI Segmentation in Location-Based Services | Dec 6, 2024 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 0 | 5 |
| DRLViz: Understanding Decisions and Memory in Deep Reinforcement Learning | Sep 6, 2019 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 | 5 |
| A Deep Reinforcement Learning Framework for Dynamic Portfolio Optimization: Evidence from China's Stock Market | Dec 24, 2024 | BenchmarkingDecision Making | CodeCode Available | 0 | 5 |
| Double Successive Over-Relaxation Q-Learning with an Extension to Deep Reinforcement Learning | Sep 10, 2024 | Deep Reinforcement LearningOpenAI Gym | CodeCode Available | 0 | 5 |
| Invariant Transform Experience Replay: Data Augmentation for Deep Reinforcement Learning | Sep 24, 2019 | Data AugmentationDeep Reinforcement Learning | CodeCode Available | 0 | 5 |
| Dota 2 with Large Scale Deep Reinforcement Learning | Dec 13, 2019 | Deep Reinforcement LearningDota 2 | CodeCode Available | 0 | 5 |
| DouRN: Improving DouZero by Residual Neural Networks | Mar 21, 2024 | Deep Reinforcement Learning | CodeCode Available | 0 | 5 |
| DOM-Q-NET: Grounded RL on Structured Language | Feb 19, 2019 | Deep Reinforcement LearningGraph Neural Network | CodeCode Available | 0 | 5 |
| Congested Urban Networks Tend to Be Insensitive to Signal Settings: Implications for Learning-Based Control | Aug 21, 2020 | Deep Reinforcement LearningTraffic Signal Control | CodeCode Available | 0 | 5 |
| Conditionally Elicitable Dynamic Risk Measures for Deep Reinforcement Learning | Jun 29, 2022 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 | 5 |
| Conditionally Combining Robot Skills using Large Language Models | Oct 25, 2023 | Deep Reinforcement LearningLanguage Modeling | CodeCode Available | 0 | 5 |
| Do deep reinforcement learning agents model intentions? | May 15, 2018 | Deep Reinforcement Learningmodel | CodeCode Available | 0 | 5 |
| Deep Attention Q-Network for Personalized Treatment Recommendation | Jul 4, 2023 | Deep AttentionDeep Reinforcement Learning | CodeCode Available | 0 | 5 |
| Domes to Drones: Self-Supervised Active Triangulation for 3D Human Pose Reconstruction | Dec 1, 2019 | 2D Pose Estimation3D Pose Estimation | CodeCode Available | 0 | 5 |
| Diversity Optimization for Travelling Salesman Problem via Deep Reinforcement Learning | Jan 1, 2025 | DecoderDeep Reinforcement Learning | CodeCode Available | 0 | 5 |
| Jointly Pre-training with Supervised, Autoencoder, and Value Losses for Deep Reinforcement Learning | Apr 3, 2019 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 0 | 5 |
| Divide-and-Conquer Reinforcement Learning | Nov 27, 2017 | Deep Reinforcement LearningPolicy Gradient Methods | CodeCode Available | 0 | 5 |
| Diversity-based Deep Reinforcement Learning Towards Multidimensional Difficulty for Fighting Game AI | Nov 4, 2022 | Deep Reinforcement LearningDiversity | CodeCode Available | 0 | 5 |
| Combining imagination and heuristics to learn strategies that generalize | Sep 10, 2018 | Deep Reinforcement LearningHierarchical Reinforcement Learning | CodeCode Available | 0 | 5 |
| ADESSE: Advice Explanations in Complex Repeated Decision-Making Environments | May 31, 2024 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 | 5 |
| L2SR: Learning to Sample and Reconstruct for Accelerated MRI via Reinforcement Learning | Dec 5, 2022 | Deep Reinforcement LearningReinforcement Learning (RL) | CodeCode Available | 0 | 5 |
| Divide & Conquer Imitation Learning | Apr 15, 2022 | Deep Reinforcement LearningImitation Learning | CodeCode Available | 0 | 5 |
| Distributional Bellman Operators over Mean Embeddings | Dec 9, 2023 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 0 | 5 |
| A Partially Supervised Reinforcement Learning Framework for Visual Active Search | Oct 15, 2023 | Deep Reinforcement LearningMeta-Learning | CodeCode Available | 0 | 5 |
| Distributional Reward Estimation for Effective Multi-Agent Deep Reinforcement Learning | Oct 14, 2022 | Deep Reinforcement LearningMulti-agent Reinforcement Learning | CodeCode Available | 0 | 5 |
| Distributed-Training-and-Execution Multi-Agent Reinforcement Learning for Power Control in HetNet | Dec 15, 2022 | Deep Reinforcement LearningMulti-agent Reinforcement Learning | CodeCode Available | 0 | 5 |
| DNS: Determinantal Point Process Based Neural Network Sampler for Ensemble Reinforcement Learning | Jan 31, 2022 | continuous-controlContinuous Control | CodeCode Available | 0 | 5 |
| Energy-Efficient Thermal Comfort Control in Smart Buildings via Deep Reinforcement Learning | Jan 15, 2019 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 | 5 |
| Generative Actor-Critic: An Off-policy Algorithm Using the Push-forward Model | May 8, 2021 | continuous-controlContinuous Control | CodeCode Available | 0 | 5 |
| Discovering Object-Centric Generalized Value Functions From Pixels | Apr 27, 2023 | Deep Reinforcement LearningObject | CodeCode Available | 0 | 5 |
| Discovering Diverse Solutions in Deep Reinforcement Learning by Maximizing State-Action-Based Mutual Information | Mar 12, 2021 | Continuous ControlDeep Reinforcement Learning | CodeCode Available | 0 | 5 |