| A Survey on Large-Population Systems and Scalable Multi-Agent Reinforcement Learning | Sep 8, 2022 | Decision MakingEpidemiology | —Unverified | 0 |
| Sequential Information Design: Learning to Persuade in the Dark | Sep 8, 2022 | Decision MakingPersuasiveness | —Unverified | 0 |
| MetaTrader: An Reinforcement Learning Approach Integrating Diverse Policies for Portfolio Optimization | Sep 1, 2022 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Federated Online Clustering of Bandits | Aug 31, 2022 | ClusteringDecision Making | CodeCode Available | 0 |
| JARVIS: A Neuro-Symbolic Commonsense Reasoning Framework for Conversational Embodied Agents | Aug 28, 2022 | Action GenerationCommon Sense Reasoning | —Unverified | 0 |
| Entropy Regularization for Population Estimation | Aug 24, 2022 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Sampling Through the Lens of Sequential Decision Making | Aug 17, 2022 | Decision MakingInformation Retrieval | —Unverified | 0 |
| Streaming Adaptive Submodular Maximization | Aug 17, 2022 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Understanding the stochastic dynamics of sequential decision-making processes: A path-integral analysis of multi-armed bandits | Aug 11, 2022 | Decision MakingDecision Making Under Uncertainty | —Unverified | 0 |
| Deep VULMAN: A Deep Reinforcement Learning-Enabled Cyber Vulnerability Management Framework | Aug 3, 2022 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| A Bayesian Approach to Learning Bandit Structure in Markov Decision Processes | Jul 30, 2022 | Decision Makingreinforcement-learning | —Unverified | 0 |
| Sample-efficient Safe Learning for Online Nonlinear Control with Control Barrier Functions | Jul 29, 2022 | Decision MakingReinforcement Learning (RL) | —Unverified | 0 |
| Branch Ranking for Efficient Mixed-Integer Programming via Offline Ranking-based Policy Learning | Jul 26, 2022 | Decision MakingReinforcement Learning (RL) | —Unverified | 0 |
| Partial-Monotone Adaptive Submodular Maximization | Jul 26, 2022 | Active LearningDecision Making | —Unverified | 0 |
| Adaptive Asynchronous Control Using Meta-learned Neural Ordinary Differential Equations | Jul 25, 2022 | Decision MakingMeta-Learning | —Unverified | 0 |
| Learn Continuously, Act Discretely: Hybrid Action-Space Reinforcement Learning For Optimal Execution | Jul 22, 2022 | Algorithmic Tradingcontinuous-control | —Unverified | 0 |
| High dimensional stochastic linear contextual bandit with missing covariates | Jul 22, 2022 | Decision MakingExperimental Design | —Unverified | 0 |
| Strategising template-guided needle placement for MR-targeted prostate biopsy | Jul 21, 2022 | AnatomyDecision Making | —Unverified | 0 |
| Delayed Feedback in Generalised Linear Bandits Revisited | Jul 21, 2022 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Online Learning with Off-Policy Feedback | Jul 18, 2022 | Decision MakingMulti-Armed Bandits | —Unverified | 0 |
| Guaranteed Discovery of Control-Endogenous Latent States with Multi-Step Inverse Models | Jul 17, 2022 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Hindsight Learning for MDPs with Exogenous Inputs | Jul 13, 2022 | counterfactualDecision Making | CodeCode Available | 0 |
| Contextual Bandits with Large Action Spaces: Made Practical | Jul 12, 2022 | Decision MakingMulti-Armed Bandits | CodeCode Available | 0 |
| Scaling up ML-based Black-box Planning with Partial STRIPS Models | Jul 10, 2022 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Improving saliency models' predictions of the next fixation with humans' intrinsic cost of gaze shifts | Jul 9, 2022 | Decision MakingSequential Decision Making | —Unverified | 0 |