| Adaptive Asynchronous Control Using Meta-learned Neural Ordinary Differential Equations | Jul 25, 2022 | Decision MakingMeta-Learning | —Unverified | 0 |
| High dimensional stochastic linear contextual bandit with missing covariates | Jul 22, 2022 | Decision MakingExperimental Design | —Unverified | 0 |
| Learn Continuously, Act Discretely: Hybrid Action-Space Reinforcement Learning For Optimal Execution | Jul 22, 2022 | Algorithmic Tradingcontinuous-control | —Unverified | 0 |
| Strategising template-guided needle placement for MR-targeted prostate biopsy | Jul 21, 2022 | AnatomyDecision Making | —Unverified | 0 |
| Delayed Feedback in Generalised Linear Bandits Revisited | Jul 21, 2022 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Online Learning with Off-Policy Feedback | Jul 18, 2022 | Decision MakingMulti-Armed Bandits | —Unverified | 0 |
| Guaranteed Discovery of Control-Endogenous Latent States with Multi-Step Inverse Models | Jul 17, 2022 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Hindsight Learning for MDPs with Exogenous Inputs | Jul 13, 2022 | counterfactualDecision Making | CodeCode Available | 0 |
| Contextual Bandits with Large Action Spaces: Made Practical | Jul 12, 2022 | Decision MakingMulti-Armed Bandits | CodeCode Available | 0 |
| Scaling up ML-based Black-box Planning with Partial STRIPS Models | Jul 10, 2022 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Improving saliency models' predictions of the next fixation with humans' intrinsic cost of gaze shifts | Jul 9, 2022 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Transformer Neural Processes: Uncertainty-Aware Meta Learning Via Sequence Modeling | Jul 9, 2022 | Bayesian OptimizationDecision Making | CodeCode Available | 1 |
| Learning Optimal Solutions via an LSTM-Optimization Framework | Jul 6, 2022 | CPUDecision Making | —Unverified | 0 |
| Reinforcement Learning Approaches for the Orienteering Problem with Stochastic and Dynamic Release Dates | Jul 2, 2022 | Decision Makingreinforcement-learning | —Unverified | 0 |
| Reinforcement Learning Based Dynamic Model Combination for Time Series Forecasting | Jun 28, 2022 | Decision MakingEnsemble Learning | —Unverified | 0 |
| Utility Theory for Sequential Decision Making | Jun 27, 2022 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Estimating Link Flows in Road Networks with Synthetic Trajectory Data Generation: Reinforcement Learning-based Approaches | Jun 26, 2022 | Decision Makingreinforcement-learning | —Unverified | 0 |
| A Survey on Model-based Reinforcement Learning | Jun 19, 2022 | Decision Makingmodel | —Unverified | 0 |
| Federated Learning with Uncertainty via Distilled Predictive Distributions | Jun 15, 2022 | Active LearningDecision Making | —Unverified | 0 |
| Interactively Learning Preference Constraints in Linear Bandits | Jun 10, 2022 | Decision MakingSequential Decision Making | CodeCode Available | 0 |
| Adaptive Rollout Length for Model-Based RL Using Model-Free Deep RL | Jun 6, 2022 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Convergence and sample complexity of natural policy gradient primal-dual methods for constrained MDPs | Jun 6, 2022 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Deciding What to Model: Value-Equivalent Sampling for Reinforcement Learning | Jun 4, 2022 | Decision MakingModel-based Reinforcement Learning | —Unverified | 0 |
| Interpolating Between Softmax Policy Gradient and Neural Replicator Dynamics with Capped Implicit Exploration | Jun 4, 2022 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Between Rate-Distortion Theory & Value Equivalence in Model-Based Reinforcement Learning | Jun 4, 2022 | Decision MakingModel-based Reinforcement Learning | —Unverified | 0 |