| Design of intentional backdoors in sequential models | Feb 26, 2019 | Decision MakingReinforcement Learning | —Unverified | 0 |
| Bandit Convex Optimization in Non-stationary Environments | Jul 29, 2019 | Decision MakingSequential Decision Making | —Unverified | 0 |
| MSPM: A Modularized and Scalable Multi-Agent Reinforcement Learning-based System for Financial Portfolio Management | Feb 6, 2021 | Decision MakingManagement | —Unverified | 0 |
| Differentiable Quantum Architecture Search in Asynchronous Quantum Reinforcement Learning | Jul 25, 2024 | Decision Makingreinforcement-learning | —Unverified | 0 |
| Bandit Linear Optimization for Sequential Decision Making and Extensive-Form Games | Mar 8, 2021 | counterfactualDecision Making | —Unverified | 0 |
| Bandits in Matching Markets: Ideas and Proposals for Peer Lending | Oct 30, 2020 | Decision MakingFairness | —Unverified | 0 |
| Digital Twins for forecasting and decision optimisation with machine learning: applications in wastewater treatment | Apr 23, 2024 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Dimension-Free Rates for Natural Policy Gradient in Multi-Agent Reinforcement Learning | Sep 23, 2021 | Decision MakingMulti-agent Reinforcement Learning | —Unverified | 0 |
| DIP-RL: Demonstration-Inferred Preference Learning in Minecraft | Jul 22, 2023 | Decision MakingMinecraft | —Unverified | 0 |
| Direct and indirect reinforcement learning | Dec 23, 2019 | Decision Makingreinforcement-learning | —Unverified | 0 |
| Discovering an Aid Policy to Minimize Student Evasion Using Offline Reinforcement Learning | Apr 20, 2021 | ClusteringDecision Making | —Unverified | 0 |
| Batched Neural Bandits | Feb 25, 2021 | Decision MakingSequential Decision Making | —Unverified | 0 |
| A Computational Framework for Motor Skill Acquisition | Jan 3, 2019 | Decision MakingReinforcement Learning | —Unverified | 0 |
| Distributed Deep Reinforcement Learning: A Survey and A Multi-Player Multi-Agent Learning Toolbox | Dec 1, 2022 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Distributed Learning: Sequential Decision Making in Resource-Constrained Environments | Apr 13, 2020 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Distributed Multi-Objective Dynamic Offloading Scheduling for Air-Ground Cooperative MEC | Mar 16, 2024 | Decision MakingEdge-computing | —Unverified | 0 |
| Distributed Online Learning in Social Recommender Systems | Sep 26, 2013 | Decision MakingRecommendation Systems | —Unverified | 0 |
| Distributed Optimization via Kernelized Multi-armed Bandits | Dec 7, 2023 | Decision MakingDistributed Optimization | —Unverified | 0 |
| Distributional Robustness and Regularization in Reinforcement Learning | Mar 5, 2020 | Decision Makingreinforcement-learning | —Unverified | 0 |
| Divide-and-Conquer Monte Carlo Tree Search For Goal-Directed Planning | Apr 23, 2020 | continuous-controlContinuous Control | —Unverified | 0 |
| Divide-and-Conquer Monte Carlo Tree Search | Jan 1, 2021 | continuous-controlContinuous Control | —Unverified | 0 |
| Deep Bayesian Estimation for Dynamic Treatment Regimes with a Long Follow-up Time | Sep 20, 2021 | Decision Makingregression | —Unverified | 0 |
| Algorithms for CVaR Optimization in MDPs | Jun 12, 2014 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Don't Watch Me: A Spatio-Temporal Trojan Attack on Deep-Reinforcement-Learning-Augment Autonomous Driving | Nov 22, 2022 | Autonomous DrivingDecision Making | —Unverified | 0 |
| A Law of Iterated Logarithm for Multi-Agent Reinforcement Learning | Oct 27, 2021 | Decision MakingMulti-agent Reinforcement Learning | —Unverified | 0 |