| A Survey on Large-Population Systems and Scalable Multi-Agent Reinforcement Learning | Sep 8, 2022 | Decision MakingEpidemiology | —Unverified | 0 |
| Sequential Information Design: Learning to Persuade in the Dark | Sep 8, 2022 | Decision MakingPersuasiveness | —Unverified | 0 |
| MetaTrader: An Reinforcement Learning Approach Integrating Diverse Policies for Portfolio Optimization | Sep 1, 2022 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Federated Online Clustering of Bandits | Aug 31, 2022 | ClusteringDecision Making | CodeCode Available | 0 |
| JARVIS: A Neuro-Symbolic Commonsense Reasoning Framework for Conversational Embodied Agents | Aug 28, 2022 | Action GenerationCommon Sense Reasoning | —Unverified | 0 |
| Entropy Regularization for Population Estimation | Aug 24, 2022 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Sampling Through the Lens of Sequential Decision Making | Aug 17, 2022 | Decision MakingInformation Retrieval | —Unverified | 0 |
| Streaming Adaptive Submodular Maximization | Aug 17, 2022 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Understanding the stochastic dynamics of sequential decision-making processes: A path-integral analysis of multi-armed bandits | Aug 11, 2022 | Decision MakingDecision Making Under Uncertainty | —Unverified | 0 |
| Deep VULMAN: A Deep Reinforcement Learning-Enabled Cyber Vulnerability Management Framework | Aug 3, 2022 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| A Bayesian Approach to Learning Bandit Structure in Markov Decision Processes | Jul 30, 2022 | Decision Makingreinforcement-learning | —Unverified | 0 |
| Sample-efficient Safe Learning for Online Nonlinear Control with Control Barrier Functions | Jul 29, 2022 | Decision MakingReinforcement Learning (RL) | —Unverified | 0 |
| Branch Ranking for Efficient Mixed-Integer Programming via Offline Ranking-based Policy Learning | Jul 26, 2022 | Decision MakingReinforcement Learning (RL) | —Unverified | 0 |
| Partial-Monotone Adaptive Submodular Maximization | Jul 26, 2022 | Active LearningDecision Making | —Unverified | 0 |
| Adaptive Asynchronous Control Using Meta-learned Neural Ordinary Differential Equations | Jul 25, 2022 | Decision MakingMeta-Learning | —Unverified | 0 |
| Learn Continuously, Act Discretely: Hybrid Action-Space Reinforcement Learning For Optimal Execution | Jul 22, 2022 | Algorithmic Tradingcontinuous-control | —Unverified | 0 |
| High dimensional stochastic linear contextual bandit with missing covariates | Jul 22, 2022 | Decision MakingExperimental Design | —Unverified | 0 |
| Strategising template-guided needle placement for MR-targeted prostate biopsy | Jul 21, 2022 | AnatomyDecision Making | —Unverified | 0 |
| Delayed Feedback in Generalised Linear Bandits Revisited | Jul 21, 2022 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Online Learning with Off-Policy Feedback | Jul 18, 2022 | Decision MakingMulti-Armed Bandits | —Unverified | 0 |
| Guaranteed Discovery of Control-Endogenous Latent States with Multi-Step Inverse Models | Jul 17, 2022 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Hindsight Learning for MDPs with Exogenous Inputs | Jul 13, 2022 | counterfactualDecision Making | CodeCode Available | 0 |
| Contextual Bandits with Large Action Spaces: Made Practical | Jul 12, 2022 | Decision MakingMulti-Armed Bandits | CodeCode Available | 0 |
| Scaling up ML-based Black-box Planning with Partial STRIPS Models | Jul 10, 2022 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Improving saliency models' predictions of the next fixation with humans' intrinsic cost of gaze shifts | Jul 9, 2022 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Learning Optimal Solutions via an LSTM-Optimization Framework | Jul 6, 2022 | CPUDecision Making | —Unverified | 0 |
| Reinforcement Learning Approaches for the Orienteering Problem with Stochastic and Dynamic Release Dates | Jul 2, 2022 | Decision Makingreinforcement-learning | —Unverified | 0 |
| Reinforcement Learning Based Dynamic Model Combination for Time Series Forecasting | Jun 28, 2022 | Decision MakingEnsemble Learning | —Unverified | 0 |
| Utility Theory for Sequential Decision Making | Jun 27, 2022 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Estimating Link Flows in Road Networks with Synthetic Trajectory Data Generation: Reinforcement Learning-based Approaches | Jun 26, 2022 | Decision Makingreinforcement-learning | —Unverified | 0 |
| A Survey on Model-based Reinforcement Learning | Jun 19, 2022 | Decision Makingmodel | —Unverified | 0 |
| Federated Learning with Uncertainty via Distilled Predictive Distributions | Jun 15, 2022 | Active LearningDecision Making | —Unverified | 0 |
| Interactively Learning Preference Constraints in Linear Bandits | Jun 10, 2022 | Decision MakingSequential Decision Making | CodeCode Available | 0 |
| Convergence and sample complexity of natural policy gradient primal-dual methods for constrained MDPs | Jun 6, 2022 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Adaptive Rollout Length for Model-Based RL Using Model-Free Deep RL | Jun 6, 2022 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Deciding What to Model: Value-Equivalent Sampling for Reinforcement Learning | Jun 4, 2022 | Decision MakingModel-based Reinforcement Learning | —Unverified | 0 |
| Interpolating Between Softmax Policy Gradient and Neural Replicator Dynamics with Capped Implicit Exploration | Jun 4, 2022 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Between Rate-Distortion Theory & Value Equivalence in Model-Based Reinforcement Learning | Jun 4, 2022 | Decision MakingModel-based Reinforcement Learning | —Unverified | 0 |
| A Deep Reinforcement Learning Framework For Column Generation | Jun 3, 2022 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 |
| Adaptive Robust Online Portfolio Selection | Jun 2, 2022 | Decision MakingSequential Decision Making | —Unverified | 0 |
| A Near-Optimal Best-of-Both-Worlds Algorithm for Online Learning with Feedback Graphs | Jun 1, 2022 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Robust Anytime Learning of Markov Decision Processes | May 31, 2022 | Bayesian InferenceDecision Making | CodeCode Available | 0 |
| Multi-Agent Learning of Numerical Methods for Hyperbolic PDEs with Factored Dec-MDP | May 31, 2022 | Decision Makingreinforcement-learning | —Unverified | 0 |
| Adaptive Sampling for Discovery | May 30, 2022 | Decision MakingDrug Discovery | —Unverified | 0 |
| Causal Explanations for Sequential Decision Making Under Uncertainty | May 30, 2022 | Causal InferenceDecision Making | —Unverified | 0 |
| Temporal Latent Bottleneck: Synthesis of Fast and Slow Processing Mechanisms in Sequence Learning | May 30, 2022 | Decision MakingInductive Bias | CodeCode Available | 0 |
| Efficient Deviation Types and Learning for Hindsight Rationality in Extensive-Form Games: Corrections | May 24, 2022 | counterfactualDecision Making | CodeCode Available | 0 |
| Flow-based Recurrent Belief State Learning for POMDPs | May 23, 2022 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Survey on Fair Reinforcement Learning: Theory and Practice | May 20, 2022 | ArticlesDecision Making | —Unverified | 0 |
| Marginal and Joint Cross-Entropies & Predictives for Online Bayesian Inference, Active Learning, and Active Sampling | May 18, 2022 | Active LearningBayesian Inference | —Unverified | 0 |