| HSVI can solve zero-sum Partially Observable Stochastic Games | Oct 26, 2022 | Decision MakingHeuristic Search | —Unverified | 0 |
| Quantum deep recurrent reinforcement learning | Oct 26, 2022 | Decision MakingQ-Learning | —Unverified | 0 |
| PopArt: Efficient Sparse Regression and Experimental Design for Optimal Sparse Linear Bandits | Oct 25, 2022 | Decision MakingExperimental Design | CodeCode Available | 0 |
| Sequential Decision Making on Unmatched Data using Bayesian Kernel Embeddings | Oct 25, 2022 | Bayesian OptimizationDecision Making | —Unverified | 0 |
| Integrating Policy Summaries with Reward Decomposition for Explaining Reinforcement Learning Agents | Oct 21, 2022 | Decision Makingreinforcement-learning | —Unverified | 0 |
| Fine-Grained Session Recommendations in E-commerce using Deep Reinforcement Learning | Oct 20, 2022 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Movement Penalized Bayesian Optimization with Application to Wind Energy Systems | Oct 14, 2022 | Bayesian OptimizationDecision Making | —Unverified | 0 |
| Learning on the Edge: Online Learning with Stochastic Feedback Graphs | Oct 9, 2022 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Advice Conformance Verification by Reinforcement Learning agents for Human-in-the-Loop | Oct 7, 2022 | Decision Makingreinforcement-learning | —Unverified | 0 |
| Reinforcement Learning Approach for Multi-Agent Flexible Scheduling Problems | Oct 7, 2022 | Combinatorial OptimizationDecision Making | —Unverified | 0 |
| Hyperbolic Deep Reinforcement Learning | Oct 4, 2022 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Continuous Monte Carlo Graph Search | Oct 4, 2022 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Generalizing Bayesian Optimization with Decision-theoretic Entropies | Oct 4, 2022 | Bayesian OptimizationDecision Making | —Unverified | 0 |
| Offline Reinforcement Learning with Differentiable Function Approximation is Provably Efficient | Oct 3, 2022 | Decision MakingOffline RL | —Unverified | 0 |
| Modeling driver's evasive behavior during safety-critical lane changes:Two-dimensional time-to-collision and deep reinforcement learning | Sep 29, 2022 | Collision AvoidanceDecision Making | —Unverified | 0 |
| Optimistic MLE -- A Generic Model-based Algorithm for Partially Observable Sequential Decision Making | Sep 29, 2022 | Decision MakingModel-based Reinforcement Learning | —Unverified | 0 |
| Blessing from Human-AI Interaction: Super Reinforcement Learning in Confounded Environments | Sep 29, 2022 | Decision Makingreinforcement-learning | —Unverified | 0 |
| Reinforcement Learning with Non-Exponential Discounting | Sep 27, 2022 | Decision MakingModel-based Reinforcement Learning | —Unverified | 0 |
| On Efficient Online Imitation Learning via Classification | Sep 26, 2022 | ClassificationDecision Making | —Unverified | 0 |
| Deep Reinforcement Learning for Adaptive Mesh Refinement | Sep 25, 2022 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Non-monotonic Resource Utilization in the Bandits with Knapsacks Problem | Sep 24, 2022 | Decision MakingDecision Making Under Uncertainty | CodeCode Available | 0 |
| Graph Neural Networks for Multi-Robot Active Information Acquisition | Sep 24, 2022 | Decision MakingImitation Learning | —Unverified | 0 |
| SCALES: From Fairness Principles to Constrained Decision-Making | Sep 22, 2022 | Decision MakingFairness | CodeCode Available | 0 |
| Batch Bayesian optimisation via density-ratio estimation with guarantees | Sep 22, 2022 | Bayesian InferenceBayesian Optimisation | CodeCode Available | 0 |
| Thompson Sampling with Virtual Helping Agents | Sep 16, 2022 | Decision MakingSequential Decision Making | —Unverified | 0 |