| Provably Efficient UCB-type Algorithms For Learning Predictive State Representations | Jul 1, 2023 | Computational EfficiencyDecision Making | —Unverified | 0 |
| Thompson sampling for improved exploration in GFlowNets | Jun 30, 2023 | Active LearningDecision Making | —Unverified | 0 |
| Learning non-Markovian Decision-Making from State-only Sequences | Jun 27, 2023 | Decision MakingImitation Learning | CodeCode Available | 0 |
| Proportional Aggregation of Preferences for Sequential Decision Making | Jun 26, 2023 | Decision MakingSequential Decision Making | —Unverified | 0 |
| A General Framework for Sequential Decision-Making under Adaptivity Constraints | Jun 26, 2023 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Large Sequence Models for Sequential Decision-Making: A Survey | Jun 24, 2023 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Sampling from Gaussian Process Posteriors using Stochastic Gradient Descent | Jun 20, 2023 | Bayesian OptimizationDecision Making | CodeCode Available | 1 |
| You Can Trade Your Experience in Distributed Multi-Agent Multi-Armed Bandits | Jun 19, 2023 | Decision MakingMulti-Armed Bandits | —Unverified | 0 |
| IF2Net: Innately Forgetting-Free Networks for Continual Learning | Jun 18, 2023 | Continual LearningDecision Making | —Unverified | 0 |
| Simplified Temporal Consistency Reinforcement Learning | Jun 15, 2023 | Decision Makingreinforcement-learning | CodeCode Available | 1 |
| Langevin Thompson Sampling with Logarithmic Communication: Bandits and Reinforcement Learning | Jun 15, 2023 | Decision MakingMulti-Armed Bandits | —Unverified | 0 |
| Skill Disentanglement for Imitation Learning from Suboptimal Demonstrations | Jun 13, 2023 | Decision MakingDisentanglement | CodeCode Available | 0 |
| Provably Learning Nash Policies in Constrained Markov Potential Games | Jun 13, 2023 | Decision MakingMulti-agent Reinforcement Learning | —Unverified | 0 |
| Decision Stacks: Flexible Reinforcement Learning via Modular Generative Models | Jun 9, 2023 | Decision Makingreinforcement-learning | CodeCode Available | 1 |
| Bring Your Own (Non-Robust) Algorithm to Solve Robust MDPs by Estimating The Worst Kernel | Jun 9, 2023 | Decision Makingreinforcement-learning | —Unverified | 0 |
| Federated Linear Contextual Bandits with User-level Differential Privacy | Jun 8, 2023 | Decision MakingMulti-Armed Bandits | —Unverified | 0 |
| Autonomous Capability Assessment of Sequential Decision-Making Systems in Stochastic Settings (Extended Version) | Jun 7, 2023 | Active LearningDecision Making | CodeCode Available | 0 |
| AI-based Identification of Most Critical Cyberattacks in Industrial Systems | Jun 7, 2023 | Decision MakingSequential Decision Making | —Unverified | 0 |
| PlayBest: Professional Basketball Player Behavior Synthesis via Planning with Diffusion | Jun 7, 2023 | Decision MakingSequential Decision Making | CodeCode Available | 0 |
| Finding Counterfactually Optimal Action Sequences in Continuous State Spaces | Jun 6, 2023 | Causal InferenceDecision Making | CodeCode Available | 0 |
| Enabling Intelligent Interactions between an Agent and an LLM: A Reinforcement Learning Approach | Jun 6, 2023 | Decision MakingSequential Decision Making | CodeCode Available | 1 |
| Learning Embeddings for Sequential Tasks Using Population of Agents | Jun 5, 2023 | Decision MakingSequential Decision Making | CodeCode Available | 0 |
| Data-Driven Online Model Selection With Regret Guarantees | Jun 5, 2023 | Decision Makingmodel | —Unverified | 0 |
| Extracting Reward Functions from Diffusion Models | Jun 1, 2023 | Decision MakingImage Generation | CodeCode Available | 1 |
| STEVE-1: A Generative Model for Text-to-Behavior in Minecraft | Jun 1, 2023 | Decision MakingImage Generation | CodeCode Available | 2 |