| Mathematics of statistical sequential decision-making: concentration, risk-awareness and modelling in stochastic bandits, with applications to bariatric surgery | May 3, 2024 | Decision MakingInterpretable Machine Learning | —Unverified | 0 |
| Provably Efficient Reinforcement Learning for Adversarial Restless Multi-Armed Bandits with Unknown Transitions and Bandit Feedback | May 2, 2024 | Multi-Armed BanditsSequential Decision Making | —Unverified | 0 |
| Scalable Bayesian Inference in the Era of Deep Learning: From Gaussian Processes to Deep Neural Networks | Apr 29, 2024 | Bayesian InferenceGaussian Processes | —Unverified | 0 |
| Q-learning with temporal memory to navigate turbulence | Apr 26, 2024 | Decision MakingNavigate | —Unverified | 0 |
| Digital Twins for forecasting and decision optimisation with machine learning: applications in wastewater treatment | Apr 23, 2024 | Decision MakingSequential Decision Making | —Unverified | 0 |
| What Hides behind Unfairness? Exploring Dynamics Fairness in Reinforcement Learning | Apr 16, 2024 | Attributecounterfactual | CodeCode Available | 0 |
| Do LLMs Play Dice? Exploring Probability Distribution Sampling in Large Language Models for Behavioral Simulation | Apr 13, 2024 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Sequential Decision Making with Expert Demonstrations under Unobserved Heterogeneity | Apr 10, 2024 | Decision MakingMeta Reinforcement Learning | CodeCode Available | 0 |
| Multi-Agent Soft Actor-Critic with Coordinated Loss for Autonomous Mobility-on-Demand Fleet Control | Apr 10, 2024 | Decision MakingSequential Decision Making | CodeCode Available | 0 |
| Reward Learning from Suboptimal Demonstrations with Applications in Surgical Electrocautery | Apr 10, 2024 | Decision MakingImitation Learning | —Unverified | 0 |