| Evaluating task-agnostic exploration for fixed-batch learning of arbitrary future tasks | Nov 20, 2019 | continuous-controlContinuous Control | CodeCode Available | 0 | 5 |
| Evaluating the Robustness of Deep Reinforcement Learning for Autonomous Policies in a Multi-agent Urban Driving Environment | Dec 22, 2021 | Autonomous DrivingBenchmarking | CodeCode Available | 0 | 5 |
| Dynamic Network Reconfiguration for Entropy Maximization using Deep Reinforcement Learning | May 26, 2022 | Deep Reinforcement LearningNavigate | CodeCode Available | 0 | 5 |
| Contrastive Representation for Interactive Recommendation | Dec 24, 2024 | Contrastive LearningDeep Reinforcement Learning | CodeCode Available | 0 | 5 |
| DutyTTE: Deciphering Uncertainty in Origin-Destination Travel Time Estimation | Aug 23, 2024 | Deep Reinforcement LearningMixture-of-Experts | CodeCode Available | 0 | 5 |
| Contrastive Explanations for Reinforcement Learning via Embedded Self Predictions | Oct 11, 2020 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 | 5 |
| Explainable Post hoc Portfolio Management Financial Policy of a Deep Reinforcement Learning agent | Jul 19, 2024 | Deep Reinforcement LearningFeature Importance | CodeCode Available | 0 | 5 |
| AdsorbRL: Deep Multi-Objective Reinforcement Learning for Inverse Catalysts Design | Dec 4, 2023 | Deep Reinforcement LearningMulti-Objective Reinforcement Learning | CodeCode Available | 0 | 5 |
| Dueling Network Architectures for Deep Reinforcement Learning | Nov 20, 2015 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 0 | 5 |
| Dynamic Control of a Fiber Manufacturing Process using Deep Reinforcement Learning | Nov 23, 2019 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 | 5 |
| A Deep Reinforcement Learning Framework for Dynamic Portfolio Optimization: Evidence from China's Stock Market | Dec 24, 2024 | BenchmarkingDecision Making | CodeCode Available | 0 | 5 |
| Continuous Transition: Improving Sample Efficiency for Continuous Control Problems via MixUp | Nov 30, 2020 | continuous-controlContinuous Control | CodeCode Available | 0 | 5 |
| A DRL solution to help reduce the cost in waiting time of securing a traffic light for cyclists | Nov 23, 2023 | Deep Reinforcement Learning | CodeCode Available | 0 | 5 |
| DRLViz: Understanding Decisions and Memory in Deep Reinforcement Learning | Sep 6, 2019 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 | 5 |
| DR-SAC: Distributionally Robust Soft Actor-Critic for Reinforcement Learning under Uncertainty | Jun 14, 2025 | continuous-controlContinuous Control | CodeCode Available | 0 | 5 |
| DRL-Based Resource Allocation for Motion Blur Resistant Federated Self-Supervised Learning in IoV | Aug 17, 2024 | CPUDeep Reinforcement Learning | CodeCode Available | 0 | 5 |
| Counterfactual Explainer Framework for Deep Reinforcement Learning Models Using Policy Distillation | May 25, 2023 | counterfactualCounterfactual Explanation | CodeCode Available | 0 | 5 |
| Counterfactual State Explanations for Reinforcement Learning Agents via Generative Deep Learning | Jan 29, 2021 | counterfactualDeep Reinforcement Learning | CodeCode Available | 0 | 5 |
| DRL-Based Medium-Term Planning of Renewable-Integrated Self-Scheduling Cascaded Hydropower to Guide Wholesale Market Participation | Jan 8, 2025 | Deep Reinforcement LearningScheduling | CodeCode Available | 0 | 5 |
| Extrapolating Beyond Suboptimal Demonstrations via Inverse Reinforcement Learning from Observations | Apr 12, 2019 | Deep Reinforcement LearningImitation Learning | CodeCode Available | 0 | 5 |
| Dual Policy Distillation | Jun 7, 2020 | continuous-controlContinuous Control | CodeCode Available | 0 | 5 |
| Fast deep reinforcement learning using online adjustments from the past | Oct 18, 2018 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 0 | 5 |
| Faults in Deep Reinforcement Learning Programs: A Taxonomy and A Detection Approach | Jan 1, 2021 | Deep Reinforcement LearningFault Detection | CodeCode Available | 0 | 5 |
| Approximating two value functions instead of one: towards characterizing a new family of Deep Reinforcement Learning algorithms | Sep 1, 2019 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 0 | 5 |
| Continuous Control With Ensemble Deep Deterministic Policy Gradients | Nov 30, 2021 | continuous-controlContinuous Control | CodeCode Available | 0 | 5 |