New Insights into Bootstrapping for Bandits May 24, 2018 Thompson Sampling
— Unverified 0No Algorithmic Collusion in Two-Player Blindfolded Game with Thompson Sampling May 23, 2024 Thompson Sampling
— Unverified 0Nonparametric General Reinforcement Learning Nov 28, 2016 General Reinforcement Learning reinforcement-learning
— Unverified 0Non-Stationary Bandit Learning via Predictive Sampling May 4, 2022 Attribute Thompson Sampling
— Unverified 0Non-Stationary Dynamic Pricing Via Actor-Critic Information-Directed Pricing Aug 19, 2022 Thompson Sampling
— Unverified 0Non-Stationary Latent Bandits Dec 1, 2020 Recommendation Systems Thompson Sampling
— Unverified 0No Regrets for Learning the Prior in Bandits Jul 13, 2021 Thompson Sampling
— Unverified 0Observation-Free Attacks on Stochastic Bandits Dec 1, 2021 Thompson Sampling
— Unverified 0On Adaptive Estimation for Dynamic Bernoulli Bandits Dec 8, 2017 Thompson Sampling
— Unverified 0On Batch Bayesian Optimization Nov 4, 2019 Bayesian Optimization Thompson Sampling
— Unverified 0On Dynamic Pricing with Covariates Dec 25, 2021 Thompson Sampling
— Unverified 0On Efficiency in Hierarchical Reinforcement Learning Dec 1, 2020 Computational Efficiency Decision Making
— Unverified 0On Improved Regret Bounds In Bayesian Optimization with Gaussian Noise Dec 25, 2024 Bayesian Optimization Thompson Sampling
— Unverified 0On Kernelized Multi-Armed Bandits with Constraints Mar 29, 2022 Multi-Armed Bandits Thompson Sampling
— Unverified 0On learning Whittle index policy for restless bandits with scalable regret Feb 7, 2022 Scheduling Thompson Sampling
— Unverified 0Online Algorithms For Parameter Mean And Variance Estimation In Dynamic Regression Models May 18, 2016 parameter estimation regression
— Unverified 0Online Continuous Hyperparameter Optimization for Generalized Linear Contextual Bandits Feb 18, 2023 Hyperparameter Optimization Multi-Armed Bandits
— Unverified 0Online Causal Inference for Advertising in Real-Time Bidding Auctions Aug 22, 2019 Causal Inference Experimental Design
— Unverified 0Online Learning and Distributed Control for Residential Demand Response Oct 11, 2020 Stochastic Optimization Thompson Sampling
— Unverified 0Online Learning-based Waveform Selection for Improved Vehicle Recognition in Automotive Radar Dec 1, 2022 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Online Learning of Energy Consumption for Navigation of Electric Vehicles Nov 3, 2021 Navigate Thompson Sampling
— Unverified 0Online Learning of Network Bottlenecks via Minimax Paths Sep 17, 2021 Thompson Sampling
— Unverified 0Online Residential Demand Response via Contextual Multi-Armed Bandits Mar 7, 2020 Decision Making Multi-Armed Bandits
— Unverified 0Only Pay for What Is Uncertain: Variance-Adaptive Thompson Sampling Mar 16, 2023 Multi-Armed Bandits Thompson Sampling
— Unverified 0On Multi-Armed Bandit Designs for Dose-Finding Clinical Trials Mar 17, 2019 Thompson Sampling
— Unverified 0On Online Learning in Kernelized Markov Decision Processes Nov 4, 2019 Thompson Sampling
— Unverified 0On The Differential Privacy of Thompson Sampling With Gaussian Prior Jun 24, 2018 Thompson Sampling
— Unverified 0On the Importance of Uncertainty in Decision-Making with Large Language Models Apr 3, 2024 Decision Making Multi-Armed Bandits
— Unverified 0On the Performance of Thompson Sampling on Logistic Bandits May 12, 2019 Thompson Sampling
— Unverified 0On the Prior Sensitivity of Thompson Sampling Jun 10, 2015 Sensitivity Thompson Sampling
— Unverified 0On Thompson Sampling for Smoother-than-Lipschitz Bandits Jan 8, 2020 reinforcement-learning Reinforcement Learning
— Unverified 0On Thompson Sampling with Langevin Algorithms Feb 23, 2020 Thompson Sampling
— Unverified 0On Frequentist Regret of Linear Thompson Sampling Jun 11, 2020 Thompson Sampling
— Unverified 0Near-Optimal Algorithms for Differentially Private Online Learning in a Stochastic Environment Feb 16, 2021 Thompson Sampling
— Unverified 0Optimal Exploration is no harder than Thompson Sampling Oct 9, 2023 Thompson Sampling
— Unverified 0Optimality of Thompson Sampling with Noninformative Priors for Pareto Bandits Feb 3, 2023 Thompson Sampling
— Unverified 0Optimal Learning for Dynamic Coding in Deadline-Constrained Multi-Channel Networks Nov 27, 2018 Thompson Sampling
— Unverified 0Optimal No-regret Learning in Repeated First-price Auctions Mar 22, 2020 Multi-Armed Bandits Thompson Sampling
— Unverified 0Optimal Recommendation to Users that React: Online Learning for a Class of POMDPs Mar 30, 2016 Recommendation Systems Reinforcement Learning
— Unverified 0Optimistic posterior sampling for reinforcement learning: worst-case regret bounds Dec 1, 2017 reinforcement-learning Reinforcement Learning
— Unverified 0Optimistic Thompson Sampling for No-Regret Learning in Unknown Games Feb 7, 2024 Decision Making Thompson Sampling
— Unverified 0Optimization of a SSP's Header Bidding Strategy using Thompson Sampling Jul 9, 2018 Thompson Sampling
— Unverified 0Optimizing Adaptive Experiments: A Unified Approach to Regret Minimization and Best-Arm Identification Feb 16, 2024 Thompson Sampling
— Unverified 0Ordinal Bayesian Optimisation Dec 5, 2019 Bayesian Optimisation Thompson Sampling
— Unverified 0Parallel and Distributed Thompson Sampling for Large-scale Accelerated Exploration of Chemical Space Jun 6, 2017 Bayesian Optimization Thompson Sampling
— Unverified 0Parallel Bayesian Optimization Using Satisficing Thompson Sampling for Time-Sensitive Black-Box Optimization Oct 19, 2023 Bayesian Optimization STS
— Unverified 0Parallel Contextual Bandits in Wireless Handover Optimization Jan 21, 2019 Multi-Armed Bandits Thompson Sampling
— Unverified 0Parallelizing Thompson Sampling Jun 2, 2021 Decision Making Thompson Sampling
— Unverified 0Partial Likelihood Thompson Sampling Mar 2, 2022 Thompson Sampling
— Unverified 0Partially Observable Contextual Bandits with Linear Payoffs Sep 17, 2024 Decision Making Multi-Armed Bandits
— Unverified 0