| Distributional Robustness and Regularization in Reinforcement Learning | Mar 5, 2020 | Decision Makingreinforcement-learning | —Unverified | 0 | 0 |
| Divide-and-Conquer Monte Carlo Tree Search For Goal-Directed Planning | Apr 23, 2020 | continuous-controlContinuous Control | —Unverified | 0 | 0 |
| Divide-and-Conquer Monte Carlo Tree Search | Jan 1, 2021 | continuous-controlContinuous Control | —Unverified | 0 | 0 |
| Do LLMs Play Dice? Exploring Probability Distribution Sampling in Large Language Models for Behavioral Simulation | Apr 13, 2024 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Don't Watch Me: A Spatio-Temporal Trojan Attack on Deep-Reinforcement-Learning-Augment Autonomous Driving | Nov 22, 2022 | Autonomous DrivingDecision Making | —Unverified | 0 | 0 |
| DOPL: Direct Online Preference Learning for Restless Bandits with Preference Feedback | Oct 7, 2024 | Multi-Armed BanditsSequential Decision Making | —Unverified | 0 | 0 |
| Doubly Robust Off-policy Value Evaluation for Reinforcement Learning | Nov 11, 2015 | Decision Makingreinforcement-learning | —Unverified | 0 | 0 |
| Doubly Robust Policy Evaluation and Optimization | Mar 10, 2015 | Decision MakingMulti-Armed Bandits | —Unverified | 0 | 0 |
| DriveGPT: Scaling Autoregressive Behavior Models for Driving | Dec 19, 2024 | Autonomous DrivingDecision Making | —Unverified | 0 | 0 |
| Dynamic Bi-Objective Routing of Multiple Vehicles | May 28, 2020 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |