| Value Enhancement of Reinforcement Learning via Efficient and Robust Trust Region Optimization | Jan 5, 2023 | Decision Makingreinforcement-learning | —Unverified | 0 |
| Local Differential Privacy for Sequential Decision Making in a Changing Environment | Jan 2, 2023 | Decision MakingMulti-Armed Bandits | —Unverified | 0 |
| Online Statistical Inference for Contextual Bandits via Stochastic Gradient Descent | Dec 30, 2022 | Decision MakingMulti-Armed Bandits | —Unverified | 0 |
| Risk-Sensitive Policy with Distributional Reinforcement Learning | Dec 30, 2022 | Decision MakingDistributional Reinforcement Learning | CodeCode Available | 1 |
| Quantile Off-Policy Evaluation via Deep Conditional Generative Learning | Dec 29, 2022 | Decision MakingOff-policy evaluation | —Unverified | 0 |
| Offline Reinforcement Learning via Linear-Programming with Error-Bound Induced Constraints | Dec 28, 2022 | Decision MakingOffline RL | —Unverified | 0 |
| Linear Combinatorial Semi-Bandit with Causally Related Rewards | Dec 25, 2022 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Online Statistical Inference in Decision-Making with Matrix Context | Dec 21, 2022 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Bridging POMDPs and Bayesian decision making for robust maintenance planning under model uncertainty: An application to railway systems | Dec 15, 2022 | Decision MakingSequential Decision Making | CodeCode Available | 1 |
| Invariant Lipschitz Bandits: A Side Observation Approach | Dec 14, 2022 | Decision MakingSequential Decision Making | —Unverified | 0 |