| Beyond Adaptive Submodularity: Approximation Guarantees of Greedy Policy with Adaptive Submodularity Ratio | Apr 24, 2019 | Decision Makingfeature selection | —Unverified | 0 | 0 |
| Effective Reward Specification in Deep Reinforcement Learning | Dec 10, 2024 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 | 0 |
| Between Rate-Distortion Theory & Value Equivalence in Model-Based Reinforcement Learning | Jun 4, 2022 | Decision MakingModel-based Reinforcement Learning | —Unverified | 0 | 0 |
| A Near-Optimal Best-of-Both-Worlds Algorithm for Online Learning with Feedback Graphs | Jun 1, 2022 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Adaptive Rollout Length for Model-Based RL Using Model-Free Deep RL | Jun 6, 2022 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 | 0 |
| Effective Dimension in Bandit Problems under Censorship | Feb 14, 2023 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| BeTAIL: Behavior Transformer Adversarial Imitation Learning from Human Racing Gameplay | Feb 22, 2024 | Autonomous RacingDecision Making | —Unverified | 0 | 0 |
| EARL-BO: Reinforcement Learning for Multi-Step Lookahead, High-Dimensional Bayesian Optimization | Oct 31, 2024 | Bayesian OptimizationDecision Making | —Unverified | 0 | 0 |
| Beta DVBF: Learning State-Space Models for Control from High Dimensional Observations | Nov 2, 2019 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| An Arm-Wise Randomization Approach to Combinatorial Linear Semi-Bandits | Sep 5, 2019 | Decision MakingRecommendation Systems | —Unverified | 0 | 0 |
| Dynamics Generalization via Information Bottleneck in Deep Reinforcement Learning | Aug 3, 2020 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 | 0 |
| Dynamic Decision Making for Graphical Models Applied to Oil Exploration | Jan 20, 2012 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| An Anytime Algorithm for Task and Motion MDPs | Feb 16, 2018 | Decision MakingMotion Planning | —Unverified | 0 | 0 |
| Adaptive Robust Online Portfolio Selection | Jun 2, 2022 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Dynamic Bi-Objective Routing of Multiple Vehicles | May 28, 2020 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Be Considerate: Objectives, Side Effects, and Deciding How to Act | Jun 4, 2021 | Decision MakingReinforcement Learning (RL) | —Unverified | 0 | 0 |
| Bayesian optimization explains human active search | Dec 1, 2013 | Bayesian OptimizationDecision Making | —Unverified | 0 | 0 |
| An Analysis of Frame-skipping in Reinforcement Learning | Feb 7, 2021 | Decision Makingreinforcement-learning | —Unverified | 0 | 0 |
| DriveGPT: Scaling Autoregressive Behavior Models for Driving | Dec 19, 2024 | Autonomous DrivingDecision Making | —Unverified | 0 | 0 |
| Doubly Robust Policy Evaluation and Optimization | Mar 10, 2015 | Decision MakingMulti-Armed Bandits | —Unverified | 0 | 0 |
| Bayesian learning of the optimal action-value function in a Markov decision process | May 3, 2025 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Doubly Robust Off-policy Value Evaluation for Reinforcement Learning | Nov 11, 2015 | Decision Makingreinforcement-learning | —Unverified | 0 | 0 |
| Bayesian Inverse Transition Learning for Offline Settings | Aug 9, 2023 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Analysis of Thompson Sampling for Partially Observable Contextual Multi-Armed Bandits | Oct 23, 2021 | Decision MakingMulti-Armed Bandits | —Unverified | 0 | 0 |
| Adaptive Learning Rate for Follow-the-Regularized-Leader: Competitive Analysis and Best-of-Both-Worlds | Mar 1, 2024 | Decision MakingMulti-Armed Bandits | —Unverified | 0 | 0 |