Accelerating Approximate Thompson Sampling with Underdamped Langevin Monte Carlo Jan 22, 2024 Thompson Sampling
Code Code Available 0Thompson Sampling for Bandit Learning in Matching Markets Apr 26, 2022 Multi-Armed Bandits Thompson Sampling
Code Code Available 0Differentially Private Online Bayesian Estimation With Adaptive Truncation Jan 19, 2023 Privacy Preserving Sensitivity
Code Code Available 0Multi-Agent Active Search using Realistic Depth-Aware Noise Model Nov 9, 2020 object-detection Object Detection
Code Code Available 0Optimal Regret Analysis of Thompson Sampling in Stochastic Multi-armed Bandit Problem with Multiple Plays Jun 2, 2015 Thompson Sampling
Code Code Available 0Multi-armed bandits for resource efficient, online optimization of language model pre-training: the use case of dynamic masking Mar 24, 2022 Bayesian Optimization Decision Making
Code Code Available 0Optimal Regret Is Achievable with Bounded Approximate Inference Error: An Enhanced Bayesian Upper Confidence Bound Framework Jan 31, 2022 Bayesian Inference Multi-Armed Bandits
Code Code Available 0Improving Portfolio Optimization Results with Bandit Networks Oct 5, 2024 Portfolio Optimization Recommendation Systems
Code Code Available 0Thompson Sampling for Robust Transfer in Multi-Task Bandits Jun 17, 2022 Multi-Task Learning Thompson Sampling
Code Code Available 0Sequential Monte Carlo Bandits Aug 8, 2018 Decision Making Sequential Decision Making
Code Code Available 0Distributed Thompson sampling under constrained communication Oct 21, 2024 Bayesian Optimization Thompson Sampling
Code Code Available 0Thompson Sampling via Local Uncertainty Oct 30, 2019 Decision Making Multi-Armed Bandits
Code Code Available 0Myopic Bayesian Design of Experiments via Posterior Sampling and Probabilistic Programming May 25, 2018 Bayesian Inference Multi-Armed Bandits
Code Code Available 0ReLU to the Rescue: Improve Your On-Policy Actor-Critic with Positive Advantages Jun 2, 2023 Bayesian Inference continuous-control
Code Code Available 0Two-sided Competing Matching Recommendation Markets With Quota and Complementary Preferences Constraints Jan 24, 2023 Thompson Sampling
Code Code Available 0Double Thompson Sampling for Dueling Bandits Apr 25, 2016 Thompson Sampling
Code Code Available 0Incentivizing Exploration In Reinforcement Learning With Deep Predictive Models Jul 3, 2015 Atari Games reinforcement-learning
Code Code Available 0Randomized Exploration for Non-Stationary Stochastic Linear Bandits Dec 11, 2019 Computational Efficiency Thompson Sampling
Code Code Available 0Neural Bandits for Data Mining: Searching for Dangerous Polypharmacy Dec 10, 2022 Thompson Sampling
Code Code Available 0Optimizing Conditional Value-At-Risk of Black-Box Functions Dec 1, 2021 Bayesian Optimization Thompson Sampling
Code Code Available 0Optimizing Pessimism in Dynamic Treatment Regimes: A Bayesian Learning Approach Oct 26, 2022 Thompson Sampling Variational Inference
Code Code Available 0Asynchronous Parallel Bayesian Optimisation via Thompson Sampling May 25, 2017 Bayesian Optimisation Thompson Sampling
Code Code Available 0Dynamic Assortment Selection and Pricing with Censored Preference Feedback Apr 3, 2025 Thompson Sampling
Code Code Available 0Addressing Missing Data Issue for Diffusion-based Recommendation May 18, 2025 Denoising Thompson Sampling
Code Code Available 0Asynchronous ε-Greedy Bayesian Optimisation Oct 15, 2020 Bayesian Optimisation Thompson Sampling
Code Code Available 0Bayesian Non-stationary Linear Bandits for Large-Scale Recommender Systems Feb 7, 2022 Decision Making Dimensionality Reduction
Code Code Available 0Bayesian bandits: balancing the exploration-exploitation tradeoff via double sampling Sep 10, 2017 Reinforcement Learning Thompson Sampling
Code Code Available 0Information-Directed Exploration for Deep Reinforcement Learning Dec 18, 2018 Atari Games Deep Reinforcement Learning
Code Code Available 0VITS : Variational Inference Thompson Sampling for contextual bandits Jul 19, 2023 Multi-Armed Bandits Thompson Sampling
Code Code Available 0Representative Action Selection for Large Action-Space Meta-Bandits May 23, 2025 Thompson Sampling
Code Code Available 0Nonparametric Gaussian Mixture Models for the Multi-Armed Bandit Aug 8, 2018 Density Estimation Multi-Armed Bandits
Code Code Available 0Thompson Sampling For Combinatorial Bandits: Polynomial Regret and Mismatched Sampling Paradox Oct 7, 2024 Thompson Sampling
Code Code Available 0Efficient Exploration through Bayesian Deep Q-Networks Feb 13, 2018 Atari Games Efficient Exploration
Code Code Available 0Show Me the Whole World: Towards Entire Item Space Exploration for Interactive Personalized Recommendations Oct 19, 2021 Decision Making Model Selection
Code Code Available 0Thompson Sampling for Linearly Constrained Bandits Apr 20, 2020 Multi-Armed Bandits Thompson Sampling
Code Code Available 0Simple Modification of the Upper Confidence Bound Algorithm by Generalized Weighted Averages Aug 28, 2023 Decision Making Decision Making Under Uncertainty
Code Code Available 0Tsetlin Machine for Solving Contextual Bandit Problems Feb 4, 2022 Thompson Sampling
Code Code Available 0Kullback-Leibler Maillard Sampling for Multi-armed Bandits with Bounded Rewards Apr 28, 2023 Multi-Armed Bandits Thompson Sampling
Code Code Available 0Bandit Learning with Implicit Feedback Dec 1, 2018 Bayesian Inference Thompson Sampling
Code Code Available 0Automated Creative Optimization for E-Commerce Advertising Feb 28, 2021 AutoML Click-Through Rate Prediction
Code Code Available 0Thompson Sampling with Information Relaxation Penalties Feb 12, 2019 Thompson Sampling
Code Code Available 0Efficient Optimal Selection for Composited Advertising Creatives with Tree Structure Mar 2, 2021 Efficient Exploration Thompson Sampling
Code Code Available 0Odds-Ratio Thompson Sampling to Control for Time-Varying Effect Mar 4, 2020 Thompson Sampling
Code Code Available 0Old Dog Learns New Tricks: Randomized UCB for Bandit Problems Oct 11, 2019 Thompson Sampling
Code Code Available 0Thompson Sampling for Multinomial Logit Contextual Bandits Dec 1, 2019 Multi-Armed Bandits Thompson Sampling
Code Code Available 0Trajectory-oriented optimization of stochastic epidemiological models May 6, 2023 Thompson Sampling
Code Code Available 0On Bits and Bandits: Quantifying the Regret-Information Trade-off May 26, 2024 Decision Making Question Answering
Code Code Available 0Learning to Play Imperfect-Information Games by Imitating an Oracle Planner Dec 22, 2020 Thompson Sampling
Code Code Available 0Process-constrained batch Bayesian approaches for yield optimization in multi-reactor systems Aug 5, 2024 Bayesian Optimization Thompson Sampling
Code Code Available 0ESCADA: Efficient Safety and Context Aware Dose Allocation for Precision Medicine Nov 26, 2021 Thompson Sampling
Code Code Available 0