| Selective Perception: Optimizing State Descriptions with Reinforcement Learning for Language Model Actors | Jul 21, 2023 | Decision MakingLanguage Modeling | —Unverified | 0 |
| Online Learning with Costly Features in Non-stationary Environments | Jul 18, 2023 | Decision MakingSequential Decision Making | CodeCode Available | 0 |
| Non-stationary Delayed Combinatorial Semi-Bandit with Causally Related Rewards | Jul 18, 2023 | Decision MakingDecision Making Under Uncertainty | —Unverified | 0 |
| POMDP inference and robust solution via deep reinforcement learning: An application to railway optimal maintenance | Jul 16, 2023 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 |
| Multi-Player Zero-Sum Markov Games with Networked Separable Interactions | Jul 13, 2023 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Probabilistic Constrained Reinforcement Learning with Formal Interpretability | Jul 13, 2023 | Decision Makingreinforcement-learning | CodeCode Available | 0 |
| FAIRO: Fairness-aware Adaptation in Sequential-Decision Making for Human-in-the-Loop Systems | Jul 12, 2023 | Decision MakingFairness | —Unverified | 0 |
| BOF-UCB: A Bayesian-Optimistic Frequentist Algorithm for Non-Stationary Contextual Bandits | Jul 7, 2023 | Decision MakingMulti-Armed Bandits | —Unverified | 0 |
| TGRL: An Algorithm for Teacher Guided Reinforcement Learning | Jul 6, 2023 | counterfactualDecision Making | —Unverified | 0 |
| Generative Flow Networks: a Markov Chain Perspective | Jul 4, 2023 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Provably Efficient UCB-type Algorithms For Learning Predictive State Representations | Jul 1, 2023 | Computational EfficiencyDecision Making | —Unverified | 0 |
| Thompson sampling for improved exploration in GFlowNets | Jun 30, 2023 | Active LearningDecision Making | —Unverified | 0 |
| Learning non-Markovian Decision-Making from State-only Sequences | Jun 27, 2023 | Decision MakingImitation Learning | CodeCode Available | 0 |
| A General Framework for Sequential Decision-Making under Adaptivity Constraints | Jun 26, 2023 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Proportional Aggregation of Preferences for Sequential Decision Making | Jun 26, 2023 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Large Sequence Models for Sequential Decision-Making: A Survey | Jun 24, 2023 | Decision MakingSequential Decision Making | —Unverified | 0 |
| You Can Trade Your Experience in Distributed Multi-Agent Multi-Armed Bandits | Jun 19, 2023 | Decision MakingMulti-Armed Bandits | —Unverified | 0 |
| IF2Net: Innately Forgetting-Free Networks for Continual Learning | Jun 18, 2023 | Continual LearningDecision Making | —Unverified | 0 |
| Langevin Thompson Sampling with Logarithmic Communication: Bandits and Reinforcement Learning | Jun 15, 2023 | Decision MakingMulti-Armed Bandits | —Unverified | 0 |
| Skill Disentanglement for Imitation Learning from Suboptimal Demonstrations | Jun 13, 2023 | Decision MakingDisentanglement | CodeCode Available | 0 |
| Provably Learning Nash Policies in Constrained Markov Potential Games | Jun 13, 2023 | Decision MakingMulti-agent Reinforcement Learning | —Unverified | 0 |
| Bring Your Own (Non-Robust) Algorithm to Solve Robust MDPs by Estimating The Worst Kernel | Jun 9, 2023 | Decision Makingreinforcement-learning | —Unverified | 0 |
| Federated Linear Contextual Bandits with User-level Differential Privacy | Jun 8, 2023 | Decision MakingMulti-Armed Bandits | —Unverified | 0 |
| Autonomous Capability Assessment of Sequential Decision-Making Systems in Stochastic Settings (Extended Version) | Jun 7, 2023 | Active LearningDecision Making | CodeCode Available | 0 |
| AI-based Identification of Most Critical Cyberattacks in Industrial Systems | Jun 7, 2023 | Decision MakingSequential Decision Making | —Unverified | 0 |