| Learning Coordination Policies over Heterogeneous Graphs for Human-Robot Teams via Recurrent Neural Schedule Propagation | Jan 30, 2023 | Decision MakingGraph Attention | CodeCode Available | 0 |
| Safe Posterior Sampling for Constrained MDPs with Bounded Constraint Violation | Jan 27, 2023 | Decision MakingSequential Decision Making | —Unverified | 0 |
| On the Global Convergence of Risk-Averse Policy Gradient Methods with Expected Conditional Risk Measures | Jan 26, 2023 | Decision MakingPolicy Gradient Methods | —Unverified | 0 |
| Off-Policy Evaluation for Action-Dependent Non-Stationary Environments | Jan 24, 2023 | counterfactualCounterfactual Reasoning | CodeCode Available | 0 |
| SMART: Self-supervised Multi-task pretrAining with contRol Transformers | Jan 24, 2023 | Decision MakingImitation Learning | —Unverified | 0 |
| Inducing Point Allocation for Sparse Gaussian Processes in High-Throughput Bayesian Optimisation | Jan 24, 2023 | Bayesian OptimisationDecision Making | —Unverified | 0 |
| The Conditional Cauchy-Schwarz Divergence with Applications to Time-Series Data and Sequential Decision Making | Jan 21, 2023 | Decision MakingSequential Decision Making | CodeCode Available | 0 |
| GBOSE: Generalized Bandit Orthogonalized Semiparametric Estimation | Jan 20, 2023 | Decision MakingManagement | —Unverified | 0 |
| Differential Privacy in Cooperative Multiagent Planning | Jan 20, 2023 | Decision MakingSequential Decision Making | CodeCode Available | 0 |
| Plan To Predict: Learning an Uncertainty-Foreseeing Model for Model-Based Reinforcement Learning | Jan 20, 2023 | Decision Makingmodel | CodeCode Available | 0 |