| A Contextual Bandit Approach for Stream-Based Active Learning | Jan 24, 2017 | Active LearningDecision Making | —Unverified | 0 | 0 |
| DOPL: Direct Online Preference Learning for Restless Bandits with Preference Feedback | Oct 7, 2024 | Multi-Armed BanditsSequential Decision Making | —Unverified | 0 | 0 |
| Don't Watch Me: A Spatio-Temporal Trojan Attack on Deep-Reinforcement-Learning-Augment Autonomous Driving | Nov 22, 2022 | Autonomous DrivingDecision Making | —Unverified | 0 | 0 |
| Bayesian Graph Traversal | Mar 7, 2025 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Do LLMs Play Dice? Exploring Probability Distribution Sampling in Large Language Models for Behavioral Simulation | Apr 13, 2024 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Federated Learning with Uncertainty via Distilled Predictive Distributions | Jun 15, 2022 | Active LearningDecision Making | —Unverified | 0 | 0 |
| Divide-and-Conquer Monte Carlo Tree Search | Jan 1, 2021 | continuous-controlContinuous Control | —Unverified | 0 | 0 |
| Divide-and-Conquer Monte Carlo Tree Search For Goal-Directed Planning | Apr 23, 2020 | continuous-controlContinuous Control | —Unverified | 0 | 0 |
| Bayesian Exploration Networks | Aug 24, 2023 | Decision MakingDecision Making Under Uncertainty | —Unverified | 0 | 0 |
| Distributional Robustness and Regularization in Reinforcement Learning | Mar 5, 2020 | Decision Makingreinforcement-learning | —Unverified | 0 | 0 |
| Distributed Optimization via Kernelized Multi-armed Bandits | Dec 7, 2023 | Decision MakingDistributed Optimization | —Unverified | 0 | 0 |
| Bayesian decision-making under misspecified priors with applications to meta-learning | Jul 3, 2021 | Decision MakingMeta-Learning | —Unverified | 0 | 0 |
| A naive aggregation algorithm for improving generalization in a class of learning problems | Sep 6, 2024 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Adaptive Sampling for Discovery | May 30, 2022 | Decision MakingDrug Discovery | —Unverified | 0 | 0 |
| Distributed Online Learning in Social Recommender Systems | Sep 26, 2013 | Decision MakingRecommendation Systems | —Unverified | 0 | 0 |
| Distributed Multi-Objective Dynamic Offloading Scheduling for Air-Ground Cooperative MEC | Mar 16, 2024 | Decision MakingEdge-computing | —Unverified | 0 | 0 |
| Learning "What-if" Explanations for Sequential Decision-Making | Jul 2, 2020 | counterfactualCounterfactual Reasoning | —Unverified | 0 | 0 |
| Distributed Learning: Sequential Decision Making in Resource-Constrained Environments | Apr 13, 2020 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Distributed Deep Reinforcement Learning: A Survey and A Multi-Player Multi-Agent Learning Toolbox | Dec 1, 2022 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 | 0 |
| Batched Nonparametric Bandits via k-Nearest Neighbor UCB | May 15, 2025 | Decision MakingMarketing | —Unverified | 0 | 0 |
| An advantage based policy transfer algorithm for reinforcement learning with measures of transferability | Nov 12, 2023 | continuous-controlContinuous Control | —Unverified | 0 | 0 |
| Discovering an Aid Policy to Minimize Student Evasion Using Offline Reinforcement Learning | Apr 20, 2021 | ClusteringDecision Making | —Unverified | 0 | 0 |
| Direct and indirect reinforcement learning | Dec 23, 2019 | Decision Makingreinforcement-learning | —Unverified | 0 | 0 |
| Batched Neural Bandits | Feb 25, 2021 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Adaptive Frontier Exploration on Graphs with Applications to Network-Based Disease Testing | May 27, 2025 | Sequential Decision Making | —Unverified | 0 | 0 |