| A Scale-Independent Multi-Objective Reinforcement Learning with Convergence Analysis | Feb 8, 2023 | Decision MakingMulti-Objective Reinforcement Learning | —Unverified | 0 |
| Linear Partial Monitoring for Sequential Decision-Making: Algorithms, Regret Bounds and Applications | Feb 7, 2023 | Decision MakingSequential Decision Making | —Unverified | 0 |
| A Strong Baseline for Batch Imitation Learning | Feb 6, 2023 | continuous-controlContinuous Control | —Unverified | 0 |
| A Reduction-based Framework for Sequential Decision Making with Delayed Feedback | Feb 3, 2023 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Learning Universal Policies via Text-Guided Video Generation | Jan 31, 2023 | Decision MakingImage Generation | —Unverified | 0 |
| Learning Coordination Policies over Heterogeneous Graphs for Human-Robot Teams via Recurrent Neural Schedule Propagation | Jan 30, 2023 | Decision MakingGraph Attention | CodeCode Available | 0 |
| Safe Posterior Sampling for Constrained MDPs with Bounded Constraint Violation | Jan 27, 2023 | Decision MakingSequential Decision Making | —Unverified | 0 |
| On the Global Convergence of Risk-Averse Policy Gradient Methods with Expected Conditional Risk Measures | Jan 26, 2023 | Decision MakingPolicy Gradient Methods | —Unverified | 0 |
| SMART: Self-supervised Multi-task pretrAining with contRol Transformers | Jan 24, 2023 | Decision MakingImitation Learning | —Unverified | 0 |
| Off-Policy Evaluation for Action-Dependent Non-Stationary Environments | Jan 24, 2023 | counterfactualCounterfactual Reasoning | CodeCode Available | 0 |
| Inducing Point Allocation for Sparse Gaussian Processes in High-Throughput Bayesian Optimisation | Jan 24, 2023 | Bayesian OptimisationDecision Making | —Unverified | 0 |
| The Conditional Cauchy-Schwarz Divergence with Applications to Time-Series Data and Sequential Decision Making | Jan 21, 2023 | Decision MakingSequential Decision Making | CodeCode Available | 0 |
| GBOSE: Generalized Bandit Orthogonalized Semiparametric Estimation | Jan 20, 2023 | Decision MakingManagement | —Unverified | 0 |
| Plan To Predict: Learning an Uncertainty-Foreseeing Model for Model-Based Reinforcement Learning | Jan 20, 2023 | Decision Makingmodel | CodeCode Available | 0 |
| Differential Privacy in Cooperative Multiagent Planning | Jan 20, 2023 | Decision MakingSequential Decision Making | CodeCode Available | 0 |
| Decision-Focused Evaluation: Analyzing Performance of Deployed Restless Multi-Arm Bandits | Jan 19, 2023 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Neuro-Symbolic World Models for Adapting to Open World Novelty | Jan 16, 2023 | Decision Makingreinforcement-learning | —Unverified | 0 |
| Neuro-symbolic Meta Reinforcement Learning for Trading | Jan 15, 2023 | Decision MakingMeta Reinforcement Learning | —Unverified | 0 |
| Fairness and Sequential Decision Making: Limits, Lessons, and Opportunities | Jan 13, 2023 | Decision MakingFairness | —Unverified | 0 |
| Asynchronous training of quantum reinforcement learning | Jan 12, 2023 | Decision MakingQuantum Machine Learning | —Unverified | 0 |
| Sequential Fair Resource Allocation under a Markov Decision Process Framework | Jan 10, 2023 | Decision MakingFairness | —Unverified | 0 |
| RLAS-BIABC: A Reinforcement Learning-Based Answer Selection Using the BERT Model Boosted by an Improved ABC Algorithm | Jan 7, 2023 | Answer SelectionDecision Making | —Unverified | 0 |
| Value Enhancement of Reinforcement Learning via Efficient and Robust Trust Region Optimization | Jan 5, 2023 | Decision Makingreinforcement-learning | —Unverified | 0 |
| Local Differential Privacy for Sequential Decision Making in a Changing Environment | Jan 2, 2023 | Decision MakingMulti-Armed Bandits | —Unverified | 0 |
| Online Statistical Inference for Contextual Bandits via Stochastic Gradient Descent | Dec 30, 2022 | Decision MakingMulti-Armed Bandits | —Unverified | 0 |