| Best Arm Identification for Stochastic Rising Bandits | Feb 15, 2023 | Decision MakingModel Selection | CodeCode Available | 0 |
| Deep Offline Reinforcement Learning for Real-world Treatment Optimization Applications | Feb 15, 2023 | Decision MakingManagement | —Unverified | 0 |
| Effective Dimension in Bandit Problems under Censorship | Feb 14, 2023 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Scalable Bayesian optimization with high-dimensional outputs using randomized prior networks | Feb 14, 2023 | Bayesian OptimizationDecision Making | CodeCode Available | 0 |
| Statistical Complexity and Optimal Algorithms for Non-linear Ridge Bandits | Feb 12, 2023 | Decision MakingSequential Decision Making | —Unverified | 0 |
| A Survey on Causal Reinforcement Learning | Feb 10, 2023 | Decision Makingreinforcement-learning | —Unverified | 0 |
| Multi-task Representation Learning for Pure Exploration in Linear Bandits | Feb 9, 2023 | Decision MakingRepresentation Learning | —Unverified | 0 |
| A Scale-Independent Multi-Objective Reinforcement Learning with Convergence Analysis | Feb 8, 2023 | Decision MakingMulti-Objective Reinforcement Learning | —Unverified | 0 |
| Linear Partial Monitoring for Sequential Decision-Making: Algorithms, Regret Bounds and Applications | Feb 7, 2023 | Decision MakingSequential Decision Making | —Unverified | 0 |
| A Strong Baseline for Batch Imitation Learning | Feb 6, 2023 | continuous-controlContinuous Control | —Unverified | 0 |