| Beyond Adaptive Submodularity: Approximation Guarantees of Greedy Policy with Adaptive Submodularity Ratio | Apr 24, 2019 | Decision Makingfeature selection | —Unverified | 0 | 0 |
| Effective Reward Specification in Deep Reinforcement Learning | Dec 10, 2024 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 | 0 |
| Between Rate-Distortion Theory & Value Equivalence in Model-Based Reinforcement Learning | Jun 4, 2022 | Decision MakingModel-based Reinforcement Learning | —Unverified | 0 | 0 |
| A Near-Optimal Best-of-Both-Worlds Algorithm for Online Learning with Feedback Graphs | Jun 1, 2022 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Adaptive Rollout Length for Model-Based RL Using Model-Free Deep RL | Jun 6, 2022 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 | 0 |
| Effective Dimension in Bandit Problems under Censorship | Feb 14, 2023 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| BeTAIL: Behavior Transformer Adversarial Imitation Learning from Human Racing Gameplay | Feb 22, 2024 | Autonomous RacingDecision Making | —Unverified | 0 | 0 |
| EARL-BO: Reinforcement Learning for Multi-Step Lookahead, High-Dimensional Bayesian Optimization | Oct 31, 2024 | Bayesian OptimizationDecision Making | —Unverified | 0 | 0 |
| Beta DVBF: Learning State-Space Models for Control from High Dimensional Observations | Nov 2, 2019 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| An Arm-Wise Randomization Approach to Combinatorial Linear Semi-Bandits | Sep 5, 2019 | Decision MakingRecommendation Systems | —Unverified | 0 | 0 |
| Dynamics Generalization via Information Bottleneck in Deep Reinforcement Learning | Aug 3, 2020 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 | 0 |
| Dynamic Decision Making for Graphical Models Applied to Oil Exploration | Jan 20, 2012 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| An Anytime Algorithm for Task and Motion MDPs | Feb 16, 2018 | Decision MakingMotion Planning | —Unverified | 0 | 0 |
| Adaptive Robust Online Portfolio Selection | Jun 2, 2022 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Dynamic Bi-Objective Routing of Multiple Vehicles | May 28, 2020 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Be Considerate: Objectives, Side Effects, and Deciding How to Act | Jun 4, 2021 | Decision MakingReinforcement Learning (RL) | —Unverified | 0 | 0 |
| Bayesian optimization explains human active search | Dec 1, 2013 | Bayesian OptimizationDecision Making | —Unverified | 0 | 0 |
| An Analysis of Frame-skipping in Reinforcement Learning | Feb 7, 2021 | Decision Makingreinforcement-learning | —Unverified | 0 | 0 |
| DriveGPT: Scaling Autoregressive Behavior Models for Driving | Dec 19, 2024 | Autonomous DrivingDecision Making | —Unverified | 0 | 0 |
| Doubly Robust Policy Evaluation and Optimization | Mar 10, 2015 | Decision MakingMulti-Armed Bandits | —Unverified | 0 | 0 |
| Bayesian learning of the optimal action-value function in a Markov decision process | May 3, 2025 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Doubly Robust Off-policy Value Evaluation for Reinforcement Learning | Nov 11, 2015 | Decision Makingreinforcement-learning | —Unverified | 0 | 0 |
| Bayesian Inverse Transition Learning for Offline Settings | Aug 9, 2023 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Analysis of Thompson Sampling for Partially Observable Contextual Multi-Armed Bandits | Oct 23, 2021 | Decision MakingMulti-Armed Bandits | —Unverified | 0 | 0 |
| Adaptive Learning Rate for Follow-the-Regularized-Leader: Competitive Analysis and Best-of-Both-Worlds | Mar 1, 2024 | Decision MakingMulti-Armed Bandits | —Unverified | 0 | 0 |
| A Contextual Bandit Approach for Stream-Based Active Learning | Jan 24, 2017 | Active LearningDecision Making | —Unverified | 0 | 0 |
| DOPL: Direct Online Preference Learning for Restless Bandits with Preference Feedback | Oct 7, 2024 | Multi-Armed BanditsSequential Decision Making | —Unverified | 0 | 0 |
| Don't Watch Me: A Spatio-Temporal Trojan Attack on Deep-Reinforcement-Learning-Augment Autonomous Driving | Nov 22, 2022 | Autonomous DrivingDecision Making | —Unverified | 0 | 0 |
| Bayesian Graph Traversal | Mar 7, 2025 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Do LLMs Play Dice? Exploring Probability Distribution Sampling in Large Language Models for Behavioral Simulation | Apr 13, 2024 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Federated Learning with Uncertainty via Distilled Predictive Distributions | Jun 15, 2022 | Active LearningDecision Making | —Unverified | 0 | 0 |
| Divide-and-Conquer Monte Carlo Tree Search | Jan 1, 2021 | continuous-controlContinuous Control | —Unverified | 0 | 0 |
| Divide-and-Conquer Monte Carlo Tree Search For Goal-Directed Planning | Apr 23, 2020 | continuous-controlContinuous Control | —Unverified | 0 | 0 |
| Bayesian Exploration Networks | Aug 24, 2023 | Decision MakingDecision Making Under Uncertainty | —Unverified | 0 | 0 |
| Distributional Robustness and Regularization in Reinforcement Learning | Mar 5, 2020 | Decision Makingreinforcement-learning | —Unverified | 0 | 0 |
| Distributed Optimization via Kernelized Multi-armed Bandits | Dec 7, 2023 | Decision MakingDistributed Optimization | —Unverified | 0 | 0 |
| Bayesian decision-making under misspecified priors with applications to meta-learning | Jul 3, 2021 | Decision MakingMeta-Learning | —Unverified | 0 | 0 |
| A naive aggregation algorithm for improving generalization in a class of learning problems | Sep 6, 2024 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Adaptive Sampling for Discovery | May 30, 2022 | Decision MakingDrug Discovery | —Unverified | 0 | 0 |
| Distributed Online Learning in Social Recommender Systems | Sep 26, 2013 | Decision MakingRecommendation Systems | —Unverified | 0 | 0 |
| Distributed Multi-Objective Dynamic Offloading Scheduling for Air-Ground Cooperative MEC | Mar 16, 2024 | Decision MakingEdge-computing | —Unverified | 0 | 0 |
| Learning "What-if" Explanations for Sequential Decision-Making | Jul 2, 2020 | counterfactualCounterfactual Reasoning | —Unverified | 0 | 0 |
| Distributed Learning: Sequential Decision Making in Resource-Constrained Environments | Apr 13, 2020 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Distributed Deep Reinforcement Learning: A Survey and A Multi-Player Multi-Agent Learning Toolbox | Dec 1, 2022 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 | 0 |
| Batched Nonparametric Bandits via k-Nearest Neighbor UCB | May 15, 2025 | Decision MakingMarketing | —Unverified | 0 | 0 |
| An advantage based policy transfer algorithm for reinforcement learning with measures of transferability | Nov 12, 2023 | continuous-controlContinuous Control | —Unverified | 0 | 0 |
| Discovering an Aid Policy to Minimize Student Evasion Using Offline Reinforcement Learning | Apr 20, 2021 | ClusteringDecision Making | —Unverified | 0 | 0 |
| Direct and indirect reinforcement learning | Dec 23, 2019 | Decision Makingreinforcement-learning | —Unverified | 0 | 0 |
| Batched Neural Bandits | Feb 25, 2021 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Adaptive Frontier Exploration on Graphs with Applications to Network-Based Disease Testing | May 27, 2025 | Sequential Decision Making | —Unverified | 0 | 0 |