| Online Learning with Off-Policy Feedback | Jul 18, 2022 | Decision MakingMulti-Armed Bandits | —Unverified | 0 |
| Online Planning Algorithms for POMDPs | Jan 15, 2014 | Decision MakingDecision Making Under Uncertainty | —Unverified | 0 |
| Online Planning for Decentralized Stochastic Control with Partial History Sharing | Aug 6, 2019 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Online Restless Multi-Armed Bandits with Long-Term Fairness Constraints | Dec 16, 2023 | Decision MakingFairness | —Unverified | 0 |
| Online Sequential Decision-Making with Unknown Delays | Feb 12, 2024 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Online Statistical Inference for Contextual Bandits via Stochastic Gradient Descent | Dec 30, 2022 | Decision MakingMulti-Armed Bandits | —Unverified | 0 |
| Online Statistical Inference in Decision-Making with Matrix Context | Dec 21, 2022 | Decision MakingSequential Decision Making | —Unverified | 0 |
| On Optimal Robustness to Adversarial Corruption in Online Decision Problems | Sep 22, 2021 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Towards Tractable Optimism in Model-Based Reinforcement Learning | Jun 21, 2020 | continuous-controlContinuous Control | —Unverified | 0 |
| On preserving non-discrimination when combining expert advice | Oct 28, 2018 | Decision MakingSequential Decision Making | —Unverified | 0 |
| On Sample-Efficient Offline Reinforcement Learning: Data Diversity, Posterior Sampling, and Beyond | Jan 6, 2024 | Decision MakingDiversity | —Unverified | 0 |
| On the Brittle Foundations of ReAct Prompting for Agentic Large Language Models | May 22, 2024 | Decision MakingSequential Decision Making | —Unverified | 0 |
| On the Expressivity of Multidimensional Markov Reward | Jul 22, 2023 | Decision MakingDecision Making Under Uncertainty | —Unverified | 0 |
| On the Global Convergence of Risk-Averse Policy Gradient Methods with Expected Conditional Risk Measures | Jan 26, 2023 | Decision MakingPolicy Gradient Methods | —Unverified | 0 |
| On the Modeling Capabilities of Large Language Models for Sequential Decision Making | Oct 8, 2024 | Decision MakingDiversity | —Unverified | 0 |
| On the Performance of Empirical Risk Minimization with Smoothed Data | Feb 22, 2024 | Decision MakingSequential Decision Making | —Unverified | 0 |
| On the Relationship Between Structure in Natural Language and Models of Sequential Decision Processes | Jun 12, 2020 | Decision MakingSequential Decision Making | —Unverified | 0 |
| On the Role of Information Structure in Reinforcement Learning for Partially-Observable Sequential Teams and Games | Mar 1, 2024 | Decision Makingreinforcement-learning | —Unverified | 0 |
| Open Problem: Approximate Planning of POMDPs in the class of Memoryless Policies | Aug 17, 2016 | Decision MakingReinforcement Learning | —Unverified | 0 |
| OPERA: Automatic Offline Policy Evaluation with Re-weighted Aggregates of Multiple Estimators | May 27, 2024 | Decision MakingOffline RL | —Unverified | 0 |
| Robust optimal policies for team Markov games | May 16, 2021 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Optimal Inspection and Maintenance Planning for Deteriorating Structural Components through Dynamic Bayesian Networks and Markov Decision Processes | Sep 9, 2020 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Optimal Learning for Sequential Decision Making for Expensive Cost Functions with Stochastic Binary Feedbacks | Sep 13, 2017 | Decision MakingMulti-Armed Bandits | —Unverified | 0 |
| Optimal Sensing via Multi-armed Bandit Relaxations in Mixed Observability Domains | Mar 15, 2016 | Decision MakingDecision Making Under Uncertainty | —Unverified | 0 |
| Optimal Sequential Decision-Making in Geosteering: A Reinforcement Learning Approach | Oct 7, 2023 | Decision Makingreinforcement-learning | —Unverified | 0 |