| Accelerating exploration and representation learning with offline pre-training | Mar 31, 2023 | Decision MakingNetHack | —Unverified | 0 |
| MAHALO: Unifying Offline Reinforcement Learning and Imitation Learning from Observations | Mar 30, 2023 | Decision MakingImitation Learning | CodeCode Available | 0 |
| Probabilistic inverse optimal control for non-linear partially observable systems disentangles perceptual uncertainty and behavioral costs | Mar 29, 2023 | Active LearningDecision Making | CodeCode Available | 0 |
| Boosting Reinforcement Learning and Planning with Demonstrations: A Survey | Mar 23, 2023 | Decision Makingreinforcement-learning | —Unverified | 0 |
| Welfare Maximization Algorithm for Solving Budget-Constrained Multi-Component POMDPs | Mar 18, 2023 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Reinforcement Learning for Omega-Regular Specifications on Continuous-Time MDP | Mar 16, 2023 | Decision Makingreinforcement-learning | —Unverified | 0 |
| Latent-Conditioned Policy Gradient for Multi-Objective Deep Reinforcement Learning | Mar 15, 2023 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Merging Decision Transformers: Weight Averaging for Forming Multi-Task Policies | Mar 14, 2023 | Decision MakingMuJoCo | CodeCode Available | 0 |
| Sample-efficient Adversarial Imitation Learning | Mar 14, 2023 | Decision MakingImitation Learning | —Unverified | 0 |
| Flooding with Absorption: An Efficient Protocol for Heterogeneous Bandits over Complex Networks | Mar 9, 2023 | Decision MakingMulti-Armed Bandits | CodeCode Available | 0 |
| Variance-aware robust reinforcement learning with linear function approximation under heavy-tailed rewards | Mar 9, 2023 | Decision Makingregression | —Unverified | 0 |
| Automated Cyber Defence: A Review | Mar 8, 2023 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Exploration via Epistemic Value Estimation | Mar 7, 2023 | Decision MakingEfficient Exploration | —Unverified | 0 |
| adaPARL: Adaptive Privacy-Aware Reinforcement Learning for Sequential-Decision Making Human-in-the-Loop Systems | Mar 7, 2023 | Decision MakingReinforcement Learning (RL) | —Unverified | 0 |
| Population-based Evaluation in Repeated Rock-Paper-Scissors as a Benchmark for Multiagent Reinforcement Learning | Mar 2, 2023 | Decision MakingLanguage Modeling | —Unverified | 0 |
| Causal Explanations for Sequential Decision-Making in Multi-Agent Systems | Feb 21, 2023 | Autonomous DrivingAutonomous Vehicles | CodeCode Available | 0 |
| Minimax-Bayes Reinforcement Learning | Feb 21, 2023 | Decision MakingDecision Making Under Uncertainty | CodeCode Available | 0 |
| Dynamic Simplex: Balancing Safety and Performance in Autonomous Cyber Physical Systems | Feb 20, 2023 | Decision MakingSequential Decision Making | CodeCode Available | 0 |
| Best Arm Identification for Stochastic Rising Bandits | Feb 15, 2023 | Decision MakingModel Selection | CodeCode Available | 0 |
| Deep Offline Reinforcement Learning for Real-world Treatment Optimization Applications | Feb 15, 2023 | Decision MakingManagement | —Unverified | 0 |
| Effective Dimension in Bandit Problems under Censorship | Feb 14, 2023 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Scalable Bayesian optimization with high-dimensional outputs using randomized prior networks | Feb 14, 2023 | Bayesian OptimizationDecision Making | CodeCode Available | 0 |
| Statistical Complexity and Optimal Algorithms for Non-linear Ridge Bandits | Feb 12, 2023 | Decision MakingSequential Decision Making | —Unverified | 0 |
| A Survey on Causal Reinforcement Learning | Feb 10, 2023 | Decision Makingreinforcement-learning | —Unverified | 0 |
| Multi-task Representation Learning for Pure Exploration in Linear Bandits | Feb 9, 2023 | Decision MakingRepresentation Learning | —Unverified | 0 |
| A Scale-Independent Multi-Objective Reinforcement Learning with Convergence Analysis | Feb 8, 2023 | Decision MakingMulti-Objective Reinforcement Learning | —Unverified | 0 |
| Linear Partial Monitoring for Sequential Decision-Making: Algorithms, Regret Bounds and Applications | Feb 7, 2023 | Decision MakingSequential Decision Making | —Unverified | 0 |
| A Strong Baseline for Batch Imitation Learning | Feb 6, 2023 | continuous-controlContinuous Control | —Unverified | 0 |
| A Reduction-based Framework for Sequential Decision Making with Delayed Feedback | Feb 3, 2023 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Learning Universal Policies via Text-Guided Video Generation | Jan 31, 2023 | Decision MakingImage Generation | —Unverified | 0 |
| Learning Coordination Policies over Heterogeneous Graphs for Human-Robot Teams via Recurrent Neural Schedule Propagation | Jan 30, 2023 | Decision MakingGraph Attention | CodeCode Available | 0 |
| Safe Posterior Sampling for Constrained MDPs with Bounded Constraint Violation | Jan 27, 2023 | Decision MakingSequential Decision Making | —Unverified | 0 |
| On the Global Convergence of Risk-Averse Policy Gradient Methods with Expected Conditional Risk Measures | Jan 26, 2023 | Decision MakingPolicy Gradient Methods | —Unverified | 0 |
| SMART: Self-supervised Multi-task pretrAining with contRol Transformers | Jan 24, 2023 | Decision MakingImitation Learning | —Unverified | 0 |
| Off-Policy Evaluation for Action-Dependent Non-Stationary Environments | Jan 24, 2023 | counterfactualCounterfactual Reasoning | CodeCode Available | 0 |
| Inducing Point Allocation for Sparse Gaussian Processes in High-Throughput Bayesian Optimisation | Jan 24, 2023 | Bayesian OptimisationDecision Making | —Unverified | 0 |
| The Conditional Cauchy-Schwarz Divergence with Applications to Time-Series Data and Sequential Decision Making | Jan 21, 2023 | Decision MakingSequential Decision Making | CodeCode Available | 0 |
| GBOSE: Generalized Bandit Orthogonalized Semiparametric Estimation | Jan 20, 2023 | Decision MakingManagement | —Unverified | 0 |
| Plan To Predict: Learning an Uncertainty-Foreseeing Model for Model-Based Reinforcement Learning | Jan 20, 2023 | Decision Makingmodel | CodeCode Available | 0 |
| Differential Privacy in Cooperative Multiagent Planning | Jan 20, 2023 | Decision MakingSequential Decision Making | CodeCode Available | 0 |
| Decision-Focused Evaluation: Analyzing Performance of Deployed Restless Multi-Arm Bandits | Jan 19, 2023 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Neuro-Symbolic World Models for Adapting to Open World Novelty | Jan 16, 2023 | Decision Makingreinforcement-learning | —Unverified | 0 |
| Neuro-symbolic Meta Reinforcement Learning for Trading | Jan 15, 2023 | Decision MakingMeta Reinforcement Learning | —Unverified | 0 |
| Fairness and Sequential Decision Making: Limits, Lessons, and Opportunities | Jan 13, 2023 | Decision MakingFairness | —Unverified | 0 |
| Asynchronous training of quantum reinforcement learning | Jan 12, 2023 | Decision MakingQuantum Machine Learning | —Unverified | 0 |
| Sequential Fair Resource Allocation under a Markov Decision Process Framework | Jan 10, 2023 | Decision MakingFairness | —Unverified | 0 |
| RLAS-BIABC: A Reinforcement Learning-Based Answer Selection Using the BERT Model Boosted by an Improved ABC Algorithm | Jan 7, 2023 | Answer SelectionDecision Making | —Unverified | 0 |
| Value Enhancement of Reinforcement Learning via Efficient and Robust Trust Region Optimization | Jan 5, 2023 | Decision Makingreinforcement-learning | —Unverified | 0 |
| Local Differential Privacy for Sequential Decision Making in a Changing Environment | Jan 2, 2023 | Decision MakingMulti-Armed Bandits | —Unverified | 0 |
| Online Statistical Inference for Contextual Bandits via Stochastic Gradient Descent | Dec 30, 2022 | Decision MakingMulti-Armed Bandits | —Unverified | 0 |