| Provably Efficient UCB-type Algorithms For Learning Predictive State Representations | Jul 1, 2023 | Computational EfficiencyDecision Making | —Unverified | 0 |
| Thompson sampling for improved exploration in GFlowNets | Jun 30, 2023 | Active LearningDecision Making | —Unverified | 0 |
| Learning non-Markovian Decision-Making from State-only Sequences | Jun 27, 2023 | Decision MakingImitation Learning | CodeCode Available | 0 |
| Proportional Aggregation of Preferences for Sequential Decision Making | Jun 26, 2023 | Decision MakingSequential Decision Making | —Unverified | 0 |
| A General Framework for Sequential Decision-Making under Adaptivity Constraints | Jun 26, 2023 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Large Sequence Models for Sequential Decision-Making: A Survey | Jun 24, 2023 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Sampling from Gaussian Process Posteriors using Stochastic Gradient Descent | Jun 20, 2023 | Bayesian OptimizationDecision Making | CodeCode Available | 1 |
| You Can Trade Your Experience in Distributed Multi-Agent Multi-Armed Bandits | Jun 19, 2023 | Decision MakingMulti-Armed Bandits | —Unverified | 0 |
| IF2Net: Innately Forgetting-Free Networks for Continual Learning | Jun 18, 2023 | Continual LearningDecision Making | —Unverified | 0 |
| Simplified Temporal Consistency Reinforcement Learning | Jun 15, 2023 | Decision Makingreinforcement-learning | CodeCode Available | 1 |
| Langevin Thompson Sampling with Logarithmic Communication: Bandits and Reinforcement Learning | Jun 15, 2023 | Decision MakingMulti-Armed Bandits | —Unverified | 0 |
| Provably Learning Nash Policies in Constrained Markov Potential Games | Jun 13, 2023 | Decision MakingMulti-agent Reinforcement Learning | —Unverified | 0 |
| Skill Disentanglement for Imitation Learning from Suboptimal Demonstrations | Jun 13, 2023 | Decision MakingDisentanglement | CodeCode Available | 0 |
| Decision Stacks: Flexible Reinforcement Learning via Modular Generative Models | Jun 9, 2023 | Decision Makingreinforcement-learning | CodeCode Available | 1 |
| Bring Your Own (Non-Robust) Algorithm to Solve Robust MDPs by Estimating The Worst Kernel | Jun 9, 2023 | Decision Makingreinforcement-learning | —Unverified | 0 |
| Federated Linear Contextual Bandits with User-level Differential Privacy | Jun 8, 2023 | Decision MakingMulti-Armed Bandits | —Unverified | 0 |
| Autonomous Capability Assessment of Sequential Decision-Making Systems in Stochastic Settings (Extended Version) | Jun 7, 2023 | Active LearningDecision Making | CodeCode Available | 0 |
| AI-based Identification of Most Critical Cyberattacks in Industrial Systems | Jun 7, 2023 | Decision MakingSequential Decision Making | —Unverified | 0 |
| PlayBest: Professional Basketball Player Behavior Synthesis via Planning with Diffusion | Jun 7, 2023 | Decision MakingSequential Decision Making | CodeCode Available | 0 |
| Finding Counterfactually Optimal Action Sequences in Continuous State Spaces | Jun 6, 2023 | Causal InferenceDecision Making | CodeCode Available | 0 |
| Enabling Intelligent Interactions between an Agent and an LLM: A Reinforcement Learning Approach | Jun 6, 2023 | Decision MakingSequential Decision Making | CodeCode Available | 1 |
| Learning Embeddings for Sequential Tasks Using Population of Agents | Jun 5, 2023 | Decision MakingSequential Decision Making | CodeCode Available | 0 |
| Data-Driven Online Model Selection With Regret Guarantees | Jun 5, 2023 | Decision Makingmodel | —Unverified | 0 |
| Extracting Reward Functions from Diffusion Models | Jun 1, 2023 | Decision MakingImage Generation | CodeCode Available | 1 |
| STEVE-1: A Generative Model for Text-to-Behavior in Minecraft | Jun 1, 2023 | Decision MakingImage Generation | CodeCode Available | 2 |
| Modeling Adversarial Attack on Pre-trained Language Models as Sequential Decision Making | May 27, 2023 | Adversarial AttackDecision Making | CodeCode Available | 0 |
| AdaPlanner: Adaptive Planning from Feedback with Language Models | May 26, 2023 | Decision MakingHallucination | CodeCode Available | 1 |
| Stability-penalty-adaptive follow-the-regularized-leader: Sparsity, game-dependency, and best-of-both-worlds | May 26, 2023 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Adversarial Attacks on Online Learning to Rank with Click Feedback | May 26, 2023 | Decision MakingLearning-To-Rank | —Unverified | 0 |
| Self-Supervised Reinforcement Learning that Transfers using Random Features | May 26, 2023 | Decision MakingModel Predictive Control | —Unverified | 0 |
| A Mini Review on the utilization of Reinforcement Learning with OPC UA | May 24, 2023 | Decision Makingreinforcement-learning | —Unverified | 0 |
| Evaluating Dynamic Conditional Quantile Treatment Effects with Applications in Ridesharing | May 17, 2023 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Optimizing Memory Mapping Using Deep Reinforcement Learning | May 11, 2023 | Cloud ComputingDecision Making | —Unverified | 0 |
| Replicating Complex Dialogue Policy of Humans via Offline Imitation Learning with Supervised Regularization | May 6, 2023 | Decision MakingImitation Learning | —Unverified | 0 |
| Knowledge Transfer from Teachers to Learners in Growing-Batch Reinforcement Learning | May 5, 2023 | Decision Makingreinforcement-learning | —Unverified | 0 |
| Masked Trajectory Models for Prediction, Representation, and Control | May 4, 2023 | continuous-controlContinuous Control | CodeCode Available | 1 |
| Semi-Infinitely Constrained Markov Decision Processes and Efficient Reinforcement Learning | Apr 29, 2023 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 |
| X-RLflow: Graph Reinforcement Learning for Neural Network Subgraphs Transformation | Apr 28, 2023 | Decision MakingGraph Neural Network | CodeCode Available | 1 |
| Distance Weighted Supervised Learning for Offline Interaction Data | Apr 26, 2023 | Decision MakingImitation Learning | CodeCode Available | 0 |
| TempoRL: laser pulse temporal shape optimization with Deep Reinforcement Learning | Apr 20, 2023 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 1 |
| A Review of Symbolic, Subsymbolic and Hybrid Methods for Sequential Decision Making | Apr 20, 2023 | Decision MakingReinforcement Learning (RL) | —Unverified | 0 |
| Inducing Stackelberg Equilibrium through Spatio-Temporal Sequential Decision-Making in Multi-Agent Reinforcement Learning | Apr 20, 2023 | Decision MakingMulti-agent Reinforcement Learning | —Unverified | 0 |
| Model Based Reinforcement Learning for Personalized Heparin Dosing | Apr 19, 2023 | Decision Makingmodel | —Unverified | 0 |
| AoI-Delay Tradeoff in Mobile Edge Caching: A Mixed-Order Drift-Plus-Penalty Algorithm | Apr 18, 2023 | Decision MakingScheduling | —Unverified | 0 |
| Promises and Pitfalls of the Linearized Laplace in Bayesian Optimization | Apr 17, 2023 | Bayesian OptimizationDecision Making | CodeCode Available | 0 |
| Synthetically Generating Human-like Data for Sequential Decision Making Tasks via Reward-Shaped Imitation Learning | Apr 14, 2023 | Decision MakingImitation Learning | —Unverified | 0 |
| Did we personalize? Assessing personalization by an online reinforcement learning algorithm using resampling | Apr 11, 2023 | Decision MakingReinforcement Learning (RL) | CodeCode Available | 0 |
| Automaton-Guided Curriculum Generation for Reinforcement Learning Agents | Apr 11, 2023 | Decision MakingQ-Learning | CodeCode Available | 0 |
| Experimentation Platforms Meet Reinforcement Learning: Bayesian Sequential Decision-Making for Continuous Monitoring | Apr 2, 2023 | Decision Makingreinforcement-learning | —Unverified | 0 |
| Risk-Sensitive and Robust Model-Based Reinforcement Learning and Planning | Apr 2, 2023 | Decision MakingModel-based Reinforcement Learning | —Unverified | 0 |