| A Bayesian Approach to Learning Bandit Structure in Markov Decision Processes | Jul 30, 2022 | Decision Makingreinforcement-learning | —Unverified | 0 |
| Sample-efficient Safe Learning for Online Nonlinear Control with Control Barrier Functions | Jul 29, 2022 | Decision MakingReinforcement Learning (RL) | —Unverified | 0 |
| Branch Ranking for Efficient Mixed-Integer Programming via Offline Ranking-based Policy Learning | Jul 26, 2022 | Decision MakingReinforcement Learning (RL) | —Unverified | 0 |
| Partial-Monotone Adaptive Submodular Maximization | Jul 26, 2022 | Active LearningDecision Making | —Unverified | 0 |
| Adaptive Asynchronous Control Using Meta-learned Neural Ordinary Differential Equations | Jul 25, 2022 | Decision MakingMeta-Learning | —Unverified | 0 |
| Learn Continuously, Act Discretely: Hybrid Action-Space Reinforcement Learning For Optimal Execution | Jul 22, 2022 | Algorithmic Tradingcontinuous-control | —Unverified | 0 |
| High dimensional stochastic linear contextual bandit with missing covariates | Jul 22, 2022 | Decision MakingExperimental Design | —Unverified | 0 |
| Strategising template-guided needle placement for MR-targeted prostate biopsy | Jul 21, 2022 | AnatomyDecision Making | —Unverified | 0 |
| Delayed Feedback in Generalised Linear Bandits Revisited | Jul 21, 2022 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Online Learning with Off-Policy Feedback | Jul 18, 2022 | Decision MakingMulti-Armed Bandits | —Unverified | 0 |