Cascading Bandits for Large-Scale Recommendation Problems Mar 17, 2016 Multi-Armed Bandits Recommendation Systems
Code Code Available 0Causal Bandits for Linear Structural Equation Models Aug 26, 2022 Thompson Sampling
Code Code Available 0Thompson Sampling: An Asymptotically Optimal Finite Time Analysis May 18, 2012 3D Reconstruction Thompson Sampling
Code Code Available 0Scalable Exploration via Ensemble++ Jul 18, 2024 Computational Efficiency Decision Making
Code Code Available 0Evolutionary Multi-Armed Bandits with Genetic Thompson Sampling Apr 26, 2022 Decision Making Evolutionary Algorithms
Code Code Available 0Practical Bayesian Learning of Neural Networks via Adaptive Optimisation Methods Nov 8, 2018 Multi-Armed Bandits Thompson Sampling
Code Code Available 0Sample-Efficient Model-Free Reinforcement Learning with Off-Policy Critics Mar 11, 2019 reinforcement-learning Reinforcement Learning
Code Code Available 0Adapting multi-armed bandits policies to contextual bandits scenarios Nov 11, 2018 Binary Classification Classification
Code Code Available 0Machine Learning for Online Algorithm Selection under Censored Feedback Sep 13, 2021 BIG-bench Machine Learning Thompson Sampling
Code Code Available 0Stacked Thompson Bandits Feb 28, 2017 Thompson Sampling
Code Code Available 0Modeling Human Exploration Through Resource-Rational Reinforcement Learning Jan 27, 2022 Meta-Learning reinforcement-learning
Code Code Available 0Online Learning of Decision Trees with Thompson Sampling Apr 9, 2024 Interpretable Machine Learning Thompson Sampling
Code Code Available 0Fast, Precise Thompson Sampling for Bayesian Optimization Nov 26, 2024 Bayesian Optimization STS
Code Code Available 0Vaccine allocation policy optimization and budget sharing mechanism using Thompson sampling Sep 21, 2021 Decision Making Management
Code Code Available 0Bayesian Algorithms for Decentralized Stochastic Bandits Oct 20, 2020 Thompson Sampling
Code Code Available 0FedRTS: Federated Robust Pruning via Combinatorial Thompson Sampling Jan 31, 2025 Federated Learning Thompson Sampling
Code Code Available 0Adaptive Thompson Sampling Stacks for Memory Bounded Open-Loop Planning Jul 11, 2019 Thompson Sampling
Code Code Available 0State-Aware Variational Thompson Sampling for Deep Q-Networks Feb 7, 2021 Thompson Sampling
Code Code Available 0Constructing Adversarial Examples for Vertical Federated Learning: Optimal Client Corruption through Multi-Armed Bandit Aug 8, 2024 Federated Learning Thompson Sampling
Code Code Available 0Constructing Adversarial Examples for Vertical Federated Learning: Optimal Client Corruption through Multi-Armed Bandit May 7, 2024 Federated Learning Thompson Sampling
Code Code Available 0Finite-Time Frequentist Regret Bounds of Multi-Agent Thompson Sampling on Sparse Hypergraphs Dec 24, 2023 Computational Efficiency Thompson Sampling
Code Code Available 0Memory Bounded Open-Loop Planning in Large POMDPs using Thompson Sampling May 10, 2019 Thompson Sampling
Code Code Available 0Adaptive Interventions with User-Defined Goals for Health Behavior Change Nov 16, 2023 Thompson Sampling
Code Code Available 0A Unifying Theory of Thompson Sampling for Continuous Risk-Averse Bandits Aug 25, 2021 Thompson Sampling
Code Code Available 0MergeDTS: A Method for Effective Large-Scale Online Ranker Evaluation Dec 11, 2018 Information Retrieval Online Ranker Evaluation
Code Code Available 0Queueing Matching Bandits with Preference Feedback Oct 14, 2024 Thompson Sampling
Code Code Available 0Scalable Bayesian Optimization Using Vecchia Approximations of Gaussian Processes Mar 2, 2022 Bayesian Optimization Gaussian Processes
Code Code Available 0On Provably Robust Meta-Bayesian Optimization Jun 14, 2022 Bayesian Optimization Meta-Learning
Code Code Available 0Multi-Agent Thompson Sampling for Bandit Applications with Sparse Neighbourhood Structures Nov 22, 2019 Thompson Sampling
Code Code Available 0Bandit-Based Prompt Design Strategy Selection Improves Prompt Optimizers Mar 3, 2025 Prompt Engineering Thompson Sampling
Code Code Available 0Atlas: Automate Online Service Configuration in Network Slicing Oct 30, 2022 Bayesian Optimization Safe Exploration
Code Code Available 0Scalable Optimization for Wind Farm Control using Coordination Graphs Jan 19, 2021 Thompson Sampling
Code Code Available 0Variational inference for the multi-armed contextual bandit Sep 10, 2017 Multi-Armed Bandits Reinforcement Learning
Code Code Available 0Cost-Efficient Online Decision Making: A Combinatorial Multi-Armed Bandit Approach Aug 21, 2023 Decision Making Multi-Armed Bandits
Code Code Available 0Mixed-Effect Thompson Sampling May 30, 2022 Thompson Sampling
Code Code Available 0On the Suboptimality of Thompson Sampling in High Dimensions Feb 10, 2021 Thompson Sampling Vocal Bursts Intensity Prediction
Code Code Available 0Randomized Value Functions via Multiplicative Normalizing Flows Jun 6, 2018 Efficient Exploration Thompson Sampling
Code Code Available 0Minimum Empirical Divergence for Sub-Gaussian Linear Bandits Oct 31, 2024 Multi-Armed Bandits Off-policy evaluation
Code Code Available 0Ranking In Generalized Linear Bandits Jun 30, 2022 Diversity Multi-Armed Bandits
Code Code Available 0RoME: A Robust Mixed-Effects Bandit Algorithm for Optimizing Mobile Health Interventions Dec 11, 2023 Multi-Armed Bandits Off-policy evaluation
Code Code Available 0Thompson Sampling for High-Dimensional Sparse Linear Contextual Bandits Nov 11, 2022 Multi-Armed Bandits Thompson Sampling
Code Code Available 0Using Adaptive Bandit Experiments to Increase and Investigate Engagement in Mental Health Oct 13, 2023 Thompson Sampling
Code Code Available 0Sub-sampling for Efficient Non-Parametric Bandit Exploration Oct 27, 2020 Thompson Sampling
Code Code Available 0Information-Directed Selection for Top-Two Algorithms May 24, 2022 Multi-Armed Bandits Thompson Sampling
Code Code Available 0Thompson Sampling for a Fatigue-aware Online Recommendation System Jan 23, 2019 Thompson Sampling
Code Code Available 0Bayesian Optimization for Categorical and Category-Specific Continuous Inputs Nov 28, 2019 Bayesian Optimization BIG-bench Machine Learning
Code Code Available 0Deep Bayesian Bandits Showdown: An Empirical Comparison of Bayesian Deep Networks for Thompson Sampling Feb 26, 2018 Decision Making Deep Reinforcement Learning
Code Code Available 0Regret Bounds for Thompson Sampling in Episodic Restless Bandit Problems May 29, 2019 Multi-Armed Bandits Thompson Sampling
Code Code Available 0More Efficient Randomized Exploration for Reinforcement Learning via Approximate Sampling Jun 18, 2024 reinforcement-learning Reinforcement Learning
Code Code Available 0Mostly Exploration-Free Algorithms for Contextual Bandits Apr 28, 2017 Diversity Multi-Armed Bandits
Code Code Available 0