| Quantile Off-Policy Evaluation via Deep Conditional Generative Learning | Dec 29, 2022 | Decision MakingOff-policy evaluation | —Unverified | 0 |
| Offline Reinforcement Learning via Linear-Programming with Error-Bound Induced Constraints | Dec 28, 2022 | Decision MakingOffline RL | —Unverified | 0 |
| Linear Combinatorial Semi-Bandit with Causally Related Rewards | Dec 25, 2022 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Online Statistical Inference in Decision-Making with Matrix Context | Dec 21, 2022 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Invariant Lipschitz Bandits: A Side Observation Approach | Dec 14, 2022 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Autoregressive Bandits | Dec 12, 2022 | Decision MakingSequential Decision Making | CodeCode Available | 0 |
| Reinforced Approximate Exploratory Data Analysis | Dec 12, 2022 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Information-Theoretic Safe Exploration with Gaussian Processes | Dec 9, 2022 | Decision MakingGaussian Processes | CodeCode Available | 0 |
| Model-based trajectory stitching for improved behavioural cloning and its applications | Dec 8, 2022 | Behavioural cloningBenchmarking | —Unverified | 0 |
| Winning the CityLearn Challenge: Adaptive Optimization with Evolutionary Search under Trajectory-based Guidance | Dec 4, 2022 | Decision MakingReinforcement Learning (RL) | —Unverified | 0 |