Odds-Ratio Thompson Sampling to Control for Time-Varying Effect Mar 4, 2020 Thompson Sampling
Code Code Available 05 Old Dog Learns New Tricks: Randomized UCB for Bandit Problems Oct 11, 2019 Thompson Sampling
Code Code Available 05 Multi-Agent Active Search using Realistic Depth-Aware Noise Model Nov 9, 2020 object-detection Object Detection
Code Code Available 05 Online Learning of Decision Trees with Thompson Sampling Apr 9, 2024 Interpretable Machine Learning Thompson Sampling
Code Code Available 05 Optimal Regret Analysis of Thompson Sampling in Stochastic Multi-armed Bandit Problem with Multiple Plays Jun 2, 2015 Thompson Sampling
Code Code Available 05 Optimal Regret Is Achievable with Bounded Approximate Inference Error: An Enhanced Bayesian Upper Confidence Bound Framework Jan 31, 2022 Bayesian Inference Multi-Armed Bandits
Code Code Available 05 Incentivizing Exploration In Reinforcement Learning With Deep Predictive Models Jul 3, 2015 Atari Games reinforcement-learning
Code Code Available 05 Optimizing Pessimism in Dynamic Treatment Regimes: A Bayesian Learning Approach Oct 26, 2022 Thompson Sampling Variational Inference
Code Code Available 05 Finite-Time Frequentist Regret Bounds of Multi-Agent Thompson Sampling on Sparse Hypergraphs Dec 24, 2023 Computational Efficiency Thompson Sampling
Code Code Available 05 Information-Directed Exploration for Deep Reinforcement Learning Dec 18, 2018 Atari Games Deep Reinforcement Learning
Code Code Available 05 Modeling Human Exploration Through Resource-Rational Reinforcement Learning Jan 27, 2022 Meta-Learning reinforcement-learning
Code Code Available 05 Randomized Value Functions via Multiplicative Normalizing Flows Jun 6, 2018 Efficient Exploration Thompson Sampling
Code Code Available 05 Evaluating Deep Vs. Wide & Deep Learners As Contextual Bandits For Personalized Email Promo Recommendations Jan 31, 2022 Multi-Armed Bandits Thompson Sampling
Code Code Available 05 Fast, Precise Thompson Sampling for Bayesian Optimization Nov 26, 2024 Bayesian Optimization STS
Code Code Available 05 Kullback-Leibler Maillard Sampling for Multi-armed Bandits with Bounded Rewards Apr 28, 2023 Multi-Armed Bandits Thompson Sampling
Code Code Available 05 Multi-armed bandits for resource efficient, online optimization of language model pre-training: the use case of dynamic masking Mar 24, 2022 Bayesian Optimization Decision Making
Code Code Available 05 Dynamic Assortment Selection and Pricing with Censored Preference Feedback Apr 3, 2025 Thompson Sampling
Code Code Available 05 Adaptive Thompson Sampling Stacks for Memory Bounded Open-Loop Planning Jul 11, 2019 Thompson Sampling
Code Code Available 05 Double Thompson Sampling for Dueling Bandits Apr 25, 2016 Thompson Sampling
Code Code Available 05 Deep Bayesian Bandits Showdown: An Empirical Comparison of Bayesian Deep Networks for Thompson Sampling Feb 26, 2018 Decision Making Deep Reinforcement Learning
Code Code Available 05 Differentially Private Online Bayesian Estimation With Adaptive Truncation Jan 19, 2023 Privacy Preserving Sensitivity
Code Code Available 05 Cost-Efficient Online Decision Making: A Combinatorial Multi-Armed Bandit Approach Aug 21, 2023 Decision Making Multi-Armed Bandits
Code Code Available 05 Bandit-Based Prompt Design Strategy Selection Improves Prompt Optimizers Mar 3, 2025 Prompt Engineering Thompson Sampling
Code Code Available 05 RoME: A Robust Mixed-Effects Bandit Algorithm for Optimizing Mobile Health Interventions Dec 11, 2023 Multi-Armed Bandits Off-policy evaluation
Code Code Available 05 Distributed Thompson sampling under constrained communication Oct 21, 2024 Bayesian Optimization Thompson Sampling
Code Code Available 05 Two-sided Competing Matching Recommendation Markets With Quota and Complementary Preferences Constraints Jan 24, 2023 Thompson Sampling
Code Code Available 05 Efficient Exploration through Bayesian Deep Q-Networks Feb 13, 2018 Atari Games Efficient Exploration
Code Code Available 05 Cascading Bandits for Large-Scale Recommendation Problems Mar 17, 2016 Multi-Armed Bandits Recommendation Systems
Code Code Available 05 Addressing Missing Data Issue for Diffusion-based Recommendation May 18, 2025 Denoising Thompson Sampling
Code Code Available 05 ESCADA: Efficient Safety and Context Aware Dose Allocation for Precision Medicine Nov 26, 2021 Thompson Sampling
Code Code Available 05 Accelerating Approximate Thompson Sampling with Underdamped Langevin Monte Carlo Jan 22, 2024 Thompson Sampling
Code Code Available 05 Evolutionary Multi-Armed Bandits with Genetic Thompson Sampling Apr 26, 2022 Decision Making Evolutionary Algorithms
Code Code Available 05 Causal Bandits for Linear Structural Equation Models Aug 26, 2022 Thompson Sampling
Code Code Available 05 FedRTS: Federated Robust Pruning via Combinatorial Thompson Sampling Jan 31, 2025 Federated Learning Thompson Sampling
Code Code Available 05 Mixed-Effect Thompson Sampling May 30, 2022 Thompson Sampling
Code Code Available 05 Improving Portfolio Optimization Results with Bandit Networks Oct 5, 2024 Portfolio Optimization Recommendation Systems
Code Code Available 05 Bayesian Non-stationary Linear Bandits for Large-Scale Recommender Systems Feb 7, 2022 Decision Making Dimensionality Reduction
Code Code Available 05 Bayesian bandits: balancing the exploration-exploitation tradeoff via double sampling Sep 10, 2017 Reinforcement Learning Thompson Sampling
Code Code Available 05 Learning to Play Imperfect-Information Games by Imitating an Oracle Planner Dec 22, 2020 Thompson Sampling
Code Code Available 05 Machine Learning for Online Algorithm Selection under Censored Feedback Sep 13, 2021 BIG-bench Machine Learning Thompson Sampling
Code Code Available 05 Bayesian Optimization for Categorical and Category-Specific Continuous Inputs Nov 28, 2019 Bayesian Optimization BIG-bench Machine Learning
Code Code Available 05 MergeDTS: A Method for Effective Large-Scale Online Ranker Evaluation Dec 11, 2018 Information Retrieval Online Ranker Evaluation
Code Code Available 05 Minimum Empirical Divergence for Sub-Gaussian Linear Bandits Oct 31, 2024 Multi-Armed Bandits Off-policy evaluation
Code Code Available 05 Asynchronous ε-Greedy Bayesian Optimisation Oct 15, 2020 Bayesian Optimisation Thompson Sampling
Code Code Available 05 Constructing Adversarial Examples for Vertical Federated Learning: Optimal Client Corruption through Multi-Armed Bandit Aug 8, 2024 Federated Learning Thompson Sampling
Code Code Available 05 Asynchronous Parallel Bayesian Optimisation via Thompson Sampling May 25, 2017 Bayesian Optimisation Thompson Sampling
Code Code Available 05 Atlas: Automate Online Service Configuration in Network Slicing Oct 30, 2022 Bayesian Optimization Safe Exploration
Code Code Available 05 Bandit Learning with Implicit Feedback Dec 1, 2018 Bayesian Inference Thompson Sampling
Code Code Available 05 Adaptive Interventions with User-Defined Goals for Health Behavior Change Nov 16, 2023 Thompson Sampling
Code Code Available 05 A Unifying Theory of Thompson Sampling for Continuous Risk-Averse Bandits Aug 25, 2021 Thompson Sampling
Code Code Available 05