| Blessing from Human-AI Interaction: Super Reinforcement Learning in Confounded Environments | Sep 29, 2022 | Decision Makingreinforcement-learning | —Unverified | 0 |
| Modeling driver's evasive behavior during safety-critical lane changes:Two-dimensional time-to-collision and deep reinforcement learning | Sep 29, 2022 | Collision AvoidanceDecision Making | —Unverified | 0 |
| Optimistic MLE -- A Generic Model-based Algorithm for Partially Observable Sequential Decision Making | Sep 29, 2022 | Decision MakingModel-based Reinforcement Learning | —Unverified | 0 |
| Reinforcement Learning with Non-Exponential Discounting | Sep 27, 2022 | Decision MakingModel-based Reinforcement Learning | —Unverified | 0 |
| On Efficient Online Imitation Learning via Classification | Sep 26, 2022 | ClassificationDecision Making | —Unverified | 0 |
| Deep Reinforcement Learning for Adaptive Mesh Refinement | Sep 25, 2022 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Non-monotonic Resource Utilization in the Bandits with Knapsacks Problem | Sep 24, 2022 | Decision MakingDecision Making Under Uncertainty | CodeCode Available | 0 |
| Graph Neural Networks for Multi-Robot Active Information Acquisition | Sep 24, 2022 | Decision MakingImitation Learning | —Unverified | 0 |
| SCALES: From Fairness Principles to Constrained Decision-Making | Sep 22, 2022 | Decision MakingFairness | CodeCode Available | 0 |
| Batch Bayesian optimisation via density-ratio estimation with guarantees | Sep 22, 2022 | Bayesian InferenceBayesian Optimisation | CodeCode Available | 0 |
| Thompson Sampling with Virtual Helping Agents | Sep 16, 2022 | Decision MakingSequential Decision Making | —Unverified | 0 |
| A Survey on Large-Population Systems and Scalable Multi-Agent Reinforcement Learning | Sep 8, 2022 | Decision MakingEpidemiology | —Unverified | 0 |
| Sequential Information Design: Learning to Persuade in the Dark | Sep 8, 2022 | Decision MakingPersuasiveness | —Unverified | 0 |
| MetaTrader: An Reinforcement Learning Approach Integrating Diverse Policies for Portfolio Optimization | Sep 1, 2022 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Federated Online Clustering of Bandits | Aug 31, 2022 | ClusteringDecision Making | CodeCode Available | 0 |
| JARVIS: A Neuro-Symbolic Commonsense Reasoning Framework for Conversational Embodied Agents | Aug 28, 2022 | Action GenerationCommon Sense Reasoning | —Unverified | 0 |
| Entropy Regularization for Population Estimation | Aug 24, 2022 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Sampling Through the Lens of Sequential Decision Making | Aug 17, 2022 | Decision MakingInformation Retrieval | —Unverified | 0 |
| Streaming Adaptive Submodular Maximization | Aug 17, 2022 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Understanding the stochastic dynamics of sequential decision-making processes: A path-integral analysis of multi-armed bandits | Aug 11, 2022 | Decision MakingDecision Making Under Uncertainty | —Unverified | 0 |
| Deep VULMAN: A Deep Reinforcement Learning-Enabled Cyber Vulnerability Management Framework | Aug 3, 2022 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| A Bayesian Approach to Learning Bandit Structure in Markov Decision Processes | Jul 30, 2022 | Decision Makingreinforcement-learning | —Unverified | 0 |
| Sample-efficient Safe Learning for Online Nonlinear Control with Control Barrier Functions | Jul 29, 2022 | Decision MakingReinforcement Learning (RL) | —Unverified | 0 |
| Branch Ranking for Efficient Mixed-Integer Programming via Offline Ranking-based Policy Learning | Jul 26, 2022 | Decision MakingReinforcement Learning (RL) | —Unverified | 0 |
| Partial-Monotone Adaptive Submodular Maximization | Jul 26, 2022 | Active LearningDecision Making | —Unverified | 0 |