| Deep Reinforcement Learning Versus Evolution Strategies: A Comparative Survey | Sep 28, 2021 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 | 0 |
| Deep Robust Kalman Filter | Mar 7, 2017 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Deep VULMAN: A Deep Reinforcement Learning-Enabled Cyber Vulnerability Management Framework | Aug 3, 2022 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 | 0 |
| Delay and Cooperation in Nonstochastic Linear Bandits | Dec 1, 2020 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Delayed Feedback in Generalised Linear Bandits Revisited | Jul 21, 2022 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Delays in Reinforcement Learning | Sep 20, 2023 | Decision Makingreinforcement-learning | —Unverified | 0 | 0 |
| Demystifying Online Clustering of Bandits: Enhanced Exploration Under Stochastic and Smoothed Adversarial Contexts | Jan 1, 2025 | ClusteringOnline Clustering | —Unverified | 0 | 0 |
| Demystify Painting with RL | Dec 14, 2020 | Decision MakingReinforcement Learning (RL) | —Unverified | 0 | 0 |
| Design of intentional backdoors in sequential models | Feb 26, 2019 | Decision MakingReinforcement Learning | —Unverified | 0 | 0 |
| Differentiable Quantum Architecture Search in Asynchronous Quantum Reinforcement Learning | Jul 25, 2024 | Decision Makingreinforcement-learning | —Unverified | 0 | 0 |
| Digital Twins for forecasting and decision optimisation with machine learning: applications in wastewater treatment | Apr 23, 2024 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Dimension-Free Rates for Natural Policy Gradient in Multi-Agent Reinforcement Learning | Sep 23, 2021 | Decision MakingMulti-agent Reinforcement Learning | —Unverified | 0 | 0 |
| DIP-RL: Demonstration-Inferred Preference Learning in Minecraft | Jul 22, 2023 | Decision MakingMinecraft | —Unverified | 0 | 0 |
| Direct and indirect reinforcement learning | Dec 23, 2019 | Decision Makingreinforcement-learning | —Unverified | 0 | 0 |
| Discovering an Aid Policy to Minimize Student Evasion Using Offline Reinforcement Learning | Apr 20, 2021 | ClusteringDecision Making | —Unverified | 0 | 0 |
| Distributed Deep Reinforcement Learning: A Survey and A Multi-Player Multi-Agent Learning Toolbox | Dec 1, 2022 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 | 0 |
| Distributed Learning: Sequential Decision Making in Resource-Constrained Environments | Apr 13, 2020 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Distributed Multi-Objective Dynamic Offloading Scheduling for Air-Ground Cooperative MEC | Mar 16, 2024 | Decision MakingEdge-computing | —Unverified | 0 | 0 |
| Distributed Online Learning in Social Recommender Systems | Sep 26, 2013 | Decision MakingRecommendation Systems | —Unverified | 0 | 0 |
| Distributed Optimization via Kernelized Multi-armed Bandits | Dec 7, 2023 | Decision MakingDistributed Optimization | —Unverified | 0 | 0 |
| Distributional Robustness and Regularization in Reinforcement Learning | Mar 5, 2020 | Decision Makingreinforcement-learning | —Unverified | 0 | 0 |
| Divide-and-Conquer Monte Carlo Tree Search For Goal-Directed Planning | Apr 23, 2020 | continuous-controlContinuous Control | —Unverified | 0 | 0 |
| Divide-and-Conquer Monte Carlo Tree Search | Jan 1, 2021 | continuous-controlContinuous Control | —Unverified | 0 | 0 |
| Do LLMs Play Dice? Exploring Probability Distribution Sampling in Large Language Models for Behavioral Simulation | Apr 13, 2024 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Don't Watch Me: A Spatio-Temporal Trojan Attack on Deep-Reinforcement-Learning-Augment Autonomous Driving | Nov 22, 2022 | Autonomous DrivingDecision Making | —Unverified | 0 | 0 |
| DOPL: Direct Online Preference Learning for Restless Bandits with Preference Feedback | Oct 7, 2024 | Multi-Armed BanditsSequential Decision Making | —Unverified | 0 | 0 |
| Doubly Robust Off-policy Value Evaluation for Reinforcement Learning | Nov 11, 2015 | Decision Makingreinforcement-learning | —Unverified | 0 | 0 |
| Doubly Robust Policy Evaluation and Optimization | Mar 10, 2015 | Decision MakingMulti-Armed Bandits | —Unverified | 0 | 0 |
| DriveGPT: Scaling Autoregressive Behavior Models for Driving | Dec 19, 2024 | Autonomous DrivingDecision Making | —Unverified | 0 | 0 |
| Dynamic Bi-Objective Routing of Multiple Vehicles | May 28, 2020 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Dynamic Decision Making for Graphical Models Applied to Oil Exploration | Jan 20, 2012 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Dynamics Generalization via Information Bottleneck in Deep Reinforcement Learning | Aug 3, 2020 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 | 0 |
| EARL-BO: Reinforcement Learning for Multi-Step Lookahead, High-Dimensional Bayesian Optimization | Oct 31, 2024 | Bayesian OptimizationDecision Making | —Unverified | 0 | 0 |
| Effective Dimension in Bandit Problems under Censorship | Feb 14, 2023 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Effective Reward Specification in Deep Reinforcement Learning | Dec 10, 2024 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 | 0 |
| Efficient and Optimal Algorithms for Contextual Dueling Bandits under Realizability | Nov 24, 2021 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Efficient quantum recurrent reinforcement learning via quantum reservoir computing | Sep 13, 2023 | Decision Makingreinforcement-learning | —Unverified | 0 | 0 |
| Efficient Reinforcement Learning with Large Language Model Priors | Oct 10, 2024 | Bayesian InferenceDecision Making | —Unverified | 0 | 0 |
| Efficient Sequential Decision Making with Large Language Models | Jun 17, 2024 | Decision MakingModel Selection | —Unverified | 0 | 0 |
| Efficient Strategy Synthesis for MDPs via Hierarchical Block Decomposition | Jun 21, 2025 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Embodied Scene Understanding for Vision Language Models via MetaVQA | Jan 15, 2025 | Decision MakingQuestion Answering | —Unverified | 0 | 0 |
| Emergent Risk Awareness in Rational Agents under Resource Constraints | May 29, 2025 | Sequential Decision Making | —Unverified | 0 | 0 |
| Enhancing Q-Learning with Large Language Model Heuristics | May 6, 2024 | Decision MakingLanguage Modeling | —Unverified | 0 | 0 |
| Entropy Regularization for Population Estimation | Aug 24, 2022 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Estimating Link Flows in Road Networks with Synthetic Trajectory Data Generation: Reinforcement Learning-based Approaches | Jun 26, 2022 | Decision Makingreinforcement-learning | —Unverified | 0 | 0 |
| Evaluating Dynamic Conditional Quantile Treatment Effects with Applications in Ridesharing | May 17, 2023 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Evaluating Explanation Methods for Vision-and-Language Navigation | Oct 10, 2023 | Decision MakingNavigate | —Unverified | 0 | 0 |
| Experimental analysis of data-driven control for a building heating system | Jul 13, 2015 | Decision Makingreinforcement-learning | —Unverified | 0 | 0 |
| Experimentation Platforms Meet Reinforcement Learning: Bayesian Sequential Decision-Making for Continuous Monitoring | Apr 2, 2023 | Decision Makingreinforcement-learning | —Unverified | 0 | 0 |
| Explainable Reinforcement Learning Agents Using World Models | May 12, 2025 | counterfactualreinforcement-learning | —Unverified | 0 | 0 |