| Online Learning with Off-Policy Feedback | Jul 18, 2022 | Decision MakingMulti-Armed Bandits | —Unverified | 0 |
| Online Planning Algorithms for POMDPs | Jan 15, 2014 | Decision MakingDecision Making Under Uncertainty | —Unverified | 0 |
| Online Planning for Decentralized Stochastic Control with Partial History Sharing | Aug 6, 2019 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Online Restless Multi-Armed Bandits with Long-Term Fairness Constraints | Dec 16, 2023 | Decision MakingFairness | —Unverified | 0 |
| Online Sequential Decision-Making with Unknown Delays | Feb 12, 2024 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Online Statistical Inference for Contextual Bandits via Stochastic Gradient Descent | Dec 30, 2022 | Decision MakingMulti-Armed Bandits | —Unverified | 0 |
| Online Statistical Inference in Decision-Making with Matrix Context | Dec 21, 2022 | Decision MakingSequential Decision Making | —Unverified | 0 |
| On Optimal Robustness to Adversarial Corruption in Online Decision Problems | Sep 22, 2021 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Towards Tractable Optimism in Model-Based Reinforcement Learning | Jun 21, 2020 | continuous-controlContinuous Control | —Unverified | 0 |
| On preserving non-discrimination when combining expert advice | Oct 28, 2018 | Decision MakingSequential Decision Making | —Unverified | 0 |