| Langevin Thompson Sampling with Logarithmic Communication: Bandits and Reinforcement Learning | Jun 15, 2023 | Decision MakingMulti-Armed Bandits | —Unverified | 0 |
| Provably Learning Nash Policies in Constrained Markov Potential Games | Jun 13, 2023 | Decision MakingMulti-agent Reinforcement Learning | —Unverified | 0 |
| Skill Disentanglement for Imitation Learning from Suboptimal Demonstrations | Jun 13, 2023 | Decision MakingDisentanglement | CodeCode Available | 0 |
| Bring Your Own (Non-Robust) Algorithm to Solve Robust MDPs by Estimating The Worst Kernel | Jun 9, 2023 | Decision Makingreinforcement-learning | —Unverified | 0 |
| Decision Stacks: Flexible Reinforcement Learning via Modular Generative Models | Jun 9, 2023 | Decision Makingreinforcement-learning | CodeCode Available | 1 |
| Federated Linear Contextual Bandits with User-level Differential Privacy | Jun 8, 2023 | Decision MakingMulti-Armed Bandits | —Unverified | 0 |
| Autonomous Capability Assessment of Sequential Decision-Making Systems in Stochastic Settings (Extended Version) | Jun 7, 2023 | Active LearningDecision Making | CodeCode Available | 0 |
| AI-based Identification of Most Critical Cyberattacks in Industrial Systems | Jun 7, 2023 | Decision MakingSequential Decision Making | —Unverified | 0 |
| PlayBest: Professional Basketball Player Behavior Synthesis via Planning with Diffusion | Jun 7, 2023 | Decision MakingSequential Decision Making | CodeCode Available | 0 |
| Finding Counterfactually Optimal Action Sequences in Continuous State Spaces | Jun 6, 2023 | Causal InferenceDecision Making | CodeCode Available | 0 |