| Learning Multimodal AI Algorithms for Amplifying Limited User Input into High-dimensional Control Space | May 16, 2025 | Deep Reinforcement LearningIntent Detection | CodeCode Available | 0 | 5 |
| Improving Policy Optimization with Generalist-Specialist Learning | Jun 26, 2022 | Deep Reinforcement LearningImitation Learning | CodeCode Available | 0 | 5 |
| Dueling Network Architectures for Deep Reinforcement Learning | Nov 20, 2015 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 0 | 5 |
| Driving in Dense Traffic with Model-Free Reinforcement Learning | Sep 15, 2019 | continuous-controlContinuous Control | CodeCode Available | 0 | 5 |
| DRiLLS: Deep Reinforcement Learning for Logic Synthesis | Nov 11, 2019 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 | 5 |
| DRL4AOI: A DRL Framework for Semantic-aware AOI Segmentation in Location-Based Services | Dec 6, 2024 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 0 | 5 |
| A Study of Plasticity Loss in On-Policy Deep Reinforcement Learning | May 29, 2024 | Continual LearningDeep Reinforcement Learning | CodeCode Available | 0 | 5 |
| Increasing performance of electric vehicles in ride-hailing services using deep reinforcement learning | Dec 7, 2019 | Autonomous VehiclesDecision Making | CodeCode Available | 0 | 5 |
| Application of linear regression method to the deep reinforcement learning in continuous action cases | Mar 19, 2025 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 0 | 5 |
| Double Successive Over-Relaxation Q-Learning with an Extension to Deep Reinforcement Learning | Sep 10, 2024 | Deep Reinforcement LearningOpenAI Gym | CodeCode Available | 0 | 5 |
| A Study on Overfitting in Deep Reinforcement Learning | Apr 18, 2018 | Deep Reinforcement LearningInductive Bias | CodeCode Available | 0 | 5 |
| Inspector: Pixel-Based Automated Game Testing via Exploration, Detection, and Investigation | Jul 18, 2022 | Deep Reinforcement LearningImitation Learning | CodeCode Available | 0 | 5 |
| Dota 2 with Large Scale Deep Reinforcement Learning | Dec 13, 2019 | Deep Reinforcement LearningDota 2 | CodeCode Available | 0 | 5 |
| DouRN: Improving DouZero by Residual Neural Networks | Mar 21, 2024 | Deep Reinforcement Learning | CodeCode Available | 0 | 5 |
| DRL-Based Medium-Term Planning of Renewable-Integrated Self-Scheduling Cascaded Hydropower to Guide Wholesale Market Participation | Jan 8, 2025 | Deep Reinforcement LearningScheduling | CodeCode Available | 0 | 5 |
| DutyTTE: Deciphering Uncertainty in Origin-Destination Travel Time Estimation | Aug 23, 2024 | Deep Reinforcement LearningMixture-of-Experts | CodeCode Available | 0 | 5 |
| Decision Transformer under Random Frame Dropping | Mar 3, 2023 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 0 | 5 |
| Do deep reinforcement learning agents model intentions? | May 15, 2018 | Deep Reinforcement Learningmodel | CodeCode Available | 0 | 5 |
| DNS: Determinantal Point Process Based Neural Network Sampler for Ensemble Reinforcement Learning | Jan 31, 2022 | continuous-controlContinuous Control | CodeCode Available | 0 | 5 |
| Divide-and-Conquer Reinforcement Learning | Nov 27, 2017 | Deep Reinforcement LearningPolicy Gradient Methods | CodeCode Available | 0 | 5 |
| Divide & Conquer Imitation Learning | Apr 15, 2022 | Deep Reinforcement LearningImitation Learning | CodeCode Available | 0 | 5 |
| Domes to Drones: Self-Supervised Active Triangulation for 3D Human Pose Reconstruction | Dec 1, 2019 | 2D Pose Estimation3D Pose Estimation | CodeCode Available | 0 | 5 |
| Congested Urban Networks Tend to Be Insensitive to Signal Settings: Implications for Learning-Based Control | Aug 21, 2020 | Deep Reinforcement LearningTraffic Signal Control | CodeCode Available | 0 | 5 |
| Conditionally Elicitable Dynamic Risk Measures for Deep Reinforcement Learning | Jun 29, 2022 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 | 5 |
| Conditionally Combining Robot Skills using Large Language Models | Oct 25, 2023 | Deep Reinforcement LearningLanguage Modeling | CodeCode Available | 0 | 5 |