Thompson Sampling Achieves O(T) Regret in Linear Quadratic Control Jun 17, 2022 Decision Making Decision Making Under Uncertainty
— Unverified 00 Thompson Sampling with Approximate Inference Aug 14, 2019 Decision Making Thompson Sampling
— Unverified 00 Thompson Sampling and Approximate Inference Dec 1, 2019 Decision Making Thompson Sampling
— Unverified 00 Analysis of Thompson Sampling for Controlling Unknown Linear Diffusion Processes Jun 20, 2022 Decision Making Decision Making Under Uncertainty
— Unverified 00 Thompson Sampling for 1-Dimensional Exponential Family Bandits Jul 12, 2013 Thompson Sampling
— Unverified 00 Thompson Sampling for Adversarial Bit Prediction Jun 21, 2019 Prediction Thompson Sampling
— Unverified 00 Thompson Sampling for Bandits with Clustered Arms Sep 6, 2021 Clustering Thompson Sampling
— Unverified 00 Thompson Sampling for Budgeted Multi-armed Bandits May 1, 2015 Multi-Armed Bandits Thompson Sampling
— Unverified 00 Thompson Sampling Algorithms for Cascading Bandits Oct 2, 2018 Efficient Exploration Multi-Armed Bandits
— Unverified 00 Thompson Sampling for Combinatorial Network Optimization in Unknown Environments Jul 7, 2019 Combinatorial Optimization Thompson Sampling
— Unverified 00 Thompson Sampling for (Combinatorial) Pure Exploration Jun 18, 2022 Thompson Sampling
— Unverified 00 Thompson Sampling for Combinatorial Semi-Bandits Mar 13, 2018 Thompson Sampling
— Unverified 00 Thompson Sampling for Combinatorial Semi-bandits with Sleeping Arms and Long-Term Fairness Constraints May 14, 2020 Fairness Movie Recommendation
— Unverified 00 Thompson Sampling for Complex Bandit Problems Nov 3, 2013 Thompson Sampling
— Unverified 00 Thompson Sampling for Contextual Bandit Problems with Auxiliary Safety Constraints Nov 2, 2019 Bayesian Optimization Decision Making
— Unverified 00 Thompson Sampling for Dynamic Pricing Feb 8, 2018 Active Learning Thompson Sampling
— Unverified 00 Thompson Sampling for Gaussian Entropic Risk Bandits May 14, 2021 Decision Making Thompson Sampling
— Unverified 00 Thompson sampling for improved exploration in GFlowNets Jun 30, 2023 Active Learning Decision Making
— Unverified 00 Thompson Sampling for Infinite-Horizon Discounted Decision Processes May 14, 2024 Thompson Sampling
— Unverified 00 Thompson Sampling for Learning Parameterized Markov Decision Processes Jun 29, 2014 Form reinforcement-learning
— Unverified 00 Thompson Sampling for Linear Bandit Problems with Normal-Gamma Priors Mar 6, 2023 Thompson Sampling
— Unverified 00 Thompson Sampling for Linear-Quadratic Control Problems Mar 27, 2017 Reinforcement Learning Thompson Sampling
— Unverified 00 Thompson sampling for linear quadratic mean-field teams Nov 9, 2020 Thompson Sampling
— Unverified 00 Thompson Sampling for Noncompliant Bandits Dec 3, 2018 Thompson Sampling
— Unverified 00 Thompson Sampling for Online Learning with Linear Experts Nov 3, 2013 Thompson Sampling
— Unverified 00 Thompson Sampling for Parameterized Markov Decision Processes with Uninformative Actions May 13, 2023 Bayesian Inference Thompson Sampling
— Unverified 00 Thompson Sampling for Pursuit-Evasion Problems Nov 11, 2018 Thompson Sampling
— Unverified 00 Thompson Sampling for Real-Valued Combinatorial Pure Exploration of Multi-Armed Bandit Aug 20, 2023 Thompson Sampling
— Unverified 00 Thompson Sampling For Stochastic Bandits with Graph Feedback Jan 16, 2017 Thompson Sampling
— Unverified 00 Thompson Sampling for Stochastic Bandits with Noisy Contexts: An Information-Theoretic Regret Analysis Jan 21, 2024 Thompson Sampling
— Unverified 00 Thompson Sampling for the MNL-Bandit Jun 3, 2017 Thompson Sampling
— Unverified 00 Thompson Sampling for Unimodal Bandits Jun 15, 2021 Thompson Sampling
— Unverified 00 Thompson Sampling for Unsupervised Sequential Selection Sep 16, 2020 Multi-Armed Bandits Thompson Sampling
— Unverified 00 Thompson sampling for zero-inflated count outcomes with an application to the Drink Less mobile health study Nov 24, 2023 Decision Making Multi-Armed Bandits
— Unverified 00 Thompson Sampling Guided Stochastic Searching on the Line for Deceptive Environments with Applications to Root-Finding Problems Aug 5, 2017 Stochastic Optimization Thompson Sampling
— Unverified 00 Thompson Sampling in Dynamic Systems for Contextual Bandit Problems Oct 17, 2013 Thompson Sampling
— Unverified 00 Thompson Sampling in Non-Episodic Restless Bandits Oct 12, 2019 Open-Ended Question Answering Thompson Sampling
— Unverified 00 Thompson Sampling in Online RLHF with General Function Approximation May 29, 2025 Thompson Sampling
— Unverified 00 Thompson Sampling in Partially Observable Contextual Bandits Feb 15, 2024 Decision Making Decision Making Under Uncertainty
— Unverified 00 Thompson Sampling is Asymptotically Optimal in General Environments Feb 25, 2016 reinforcement-learning Reinforcement Learning
— Unverified 00 Thompson Sampling Itself is Differentially Private Jul 20, 2024 Thompson Sampling
— Unverified 00 Thompson Sampling-like Algorithms for Stochastic Rising Bandits May 17, 2025 Model Selection Thompson Sampling
— Unverified 00 Thompson Sampling on Asymmetric α-Stable Bandits Mar 19, 2022 reinforcement-learning Reinforcement Learning (RL)
— Unverified 00 Thompson Sampling on Symmetric α-Stable Bandits Jul 8, 2019 Bayesian Inference Decision Making
— Unverified 00 Thompson Sampling Regret Bounds for Contextual Bandits with sub-Gaussian rewards Apr 26, 2023 Multi-Armed Bandits Thompson Sampling
— Unverified 00 Thompson Sampling under Bernoulli Rewards with Local Differential Privacy Jul 3, 2023 Thompson Sampling
— Unverified 00 Thompson Sampling with a Mixture Prior Jun 10, 2021 Decision Making Multi-Task Learning
— Unverified 00 Thompson Sampling with Diffusion Generative Prior Jan 12, 2023 Decision Making Denoising
— Unverified 00 Thompson sampling with the online bootstrap Oct 15, 2014 Thompson Sampling
— Unverified 00 Thompson Sampling with Unrestricted Delays Feb 24, 2022 Thompson Sampling
— Unverified 00