Robust Dynamic Assortment Optimization in the Presence of Outlier Customers Oct 9, 2019 Assortment Optimization Thompson Sampling
— Unverified 00 Robust Policy Switching for Antifragile Reinforcement Learning for UAV Deconfliction in Adversarial Environments Jun 26, 2025 Reinforcement Learning (RL) Thompson Sampling
— Unverified 00 Robust Thompson Sampling Algorithms Against Reward Poisoning Attacks Oct 25, 2024 Decision Making Sequential Decision Making
— Unverified 00 Safe Linear Leveling Bandits Dec 13, 2021 Multi-Armed Bandits Thompson Sampling
— Unverified 00 Safe Linear Thompson Sampling with Side Information Nov 6, 2019 Thompson Sampling
— Unverified 00 Sample-based Dynamic Hierarchical Transformer with Layer and Head Flexibility via Contextual Bandit Dec 5, 2023 Thompson Sampling
— Unverified 00 The Price of Incentivizing Exploration: A Characterization via Thompson Sampling and Sample Complexity Feb 3, 2020 Multi-Armed Bandits Thompson Sampling
— Unverified 00 Sampling Acquisition Functions for Batch Bayesian Optimization Mar 22, 2019 Bayesian Optimization Thompson Sampling
— Unverified 00 Satisficing in Time-Sensitive Bandit Learning Mar 7, 2018 Thompson Sampling
— Unverified 00 Scalable and Interpretable Contextual Bandits: A Literature Review and Retail Offer Prototype May 22, 2025 Feature Engineering Large Language Model
— Unverified 00 Scalable Generalized Linear Bandits: Online Computation and Hashing Jun 1, 2017 Thompson Sampling
— Unverified 00 Scalable Neural Contextual Bandit for Recommender Systems Jun 26, 2023 Recommendation Systems Thompson Sampling
— Unverified 00 Scalable regret for learning to control network-coupled subsystems with unknown dynamics Aug 18, 2021 Thompson Sampling
— Unverified 00 Scalable Thompson Sampling using Sparse Gaussian Process Models Jun 9, 2020 Thompson Sampling
— Unverified 00 Scalable Thompson Sampling via Optimal Transport Feb 19, 2019 Decision Making Sequential Decision Making
— Unverified 00 Scaling Multi-Armed Bandit Algorithms Jul 25, 2019 Multi-Armed Bandits Sequential Decision Making
— Unverified 00 Screening for an Infectious Disease as a Problem in Stochastic Control Nov 1, 2020 Thompson Sampling
— Unverified 00 Semi-Parametric Contextual Bandits with Graph-Laplacian Regularization May 17, 2022 Multi-Armed Bandits Thompson Sampling
— Unverified 00 Sequential Best-Arm Identification with Application to Brain-Computer Interface May 17, 2023 Brain Computer Interface EEG
— Unverified 00 Sequential Matrix Completion Oct 23, 2017 Collaborative Filtering Matrix Completion
— Unverified 00 Sequential Test for the Lowest Mean: From Thompson to Murphy Sampling Jun 4, 2018 Reinforcement Learning Reinforcement Learning (RL)
— Unverified 00 Sharp Deviations Bounds for Dirichlet Weighted Sums with Application to analysis of Bayesian algorithms Apr 6, 2023 Multi-Armed Bandits Thompson Sampling
— Unverified 00 Simple Bayesian Algorithms for Best Arm Identification Feb 26, 2016 Thompson Sampling
— Unverified 00 Simplifying Bayesian Optimization Via In-Context Direct Optimum Sampling May 29, 2025 Bayesian Optimization Thompson Sampling
— Unverified 00 Sliding-Window Thompson Sampling for Non-Stationary Settings Sep 8, 2024 Decision Making Sequential Decision Making
— Unverified 00 Smart Routing with Precise Link Estimation: DSEE-Based Anypath Routing for Reliable Wireless Networking May 16, 2024 Thompson Sampling
— Unverified 00 Solving Bernoulli Rank-One Bandits with Unimodal Thompson Sampling Dec 6, 2019 Thompson Sampling
— Unverified 00 Sparse Nonparametric Contextual Bandits Mar 20, 2025 Multi-Armed Bandits Thompson Sampling
— Unverified 00 Sparse Spectrum Gaussian Process for Bayesian Optimization Jun 21, 2019 Bayesian Optimisation Bayesian Optimization
— Unverified 00 Speculative Decoding via Early-exiting for Faster LLM Inference with Thompson Sampling Control Mechanism Jun 6, 2024 Thompson Sampling
— Unverified 00 SPRT-based Efficient Best Arm Identification in Stochastic Bandits Jul 22, 2022 Multi-Armed Bandits Thompson Sampling
— Unverified 00 Stable Thompson Sampling: Valid Inference via Variance Inflation May 29, 2025 Decision Making Thompson Sampling
— Unverified 00 Stage-wise Conservative Linear Bandits Sep 30, 2020 Form Thompson Sampling
— Unverified 00 Statistical Efficiency of Thompson Sampling for Combinatorial Semi-Bandits Jun 11, 2020 Thompson Sampling
— Unverified 00 Stochastically Constrained Best Arm Identification with Thompson Sampling Jan 7, 2025 Thompson Sampling
— Unverified 00 Stochastic Neural Network with Kronecker Flow Jun 10, 2019 Multi-Armed Bandits Thompson Sampling
— Unverified 00 Streaming kernel regression with provably adaptive mean, variance, and regularization Aug 2, 2017 regression Thompson Sampling
— Unverified 00 Surrogate modeling for Bayesian optimization beyond a single Gaussian process May 27, 2022 Bayesian Optimization Drug Discovery
— Unverified 00 Synthetically Controlled Bandits Feb 14, 2022 Thompson Sampling
— Unverified 00 Taming Non-stationary Bandits: A Bayesian Approach Jul 31, 2017 Thompson Sampling
— Unverified 00 Task Selection and Assignment for Multi-modal Multi-task Dialogue Act Classification with Non-stationary Multi-armed Bandits Sep 18, 2023 Dialogue Act Classification Multi-Armed Bandits
— Unverified 00 Cramming Contextual Bandits for On-policy Statistical Evaluation Mar 11, 2024 Multi-Armed Bandits Off-policy evaluation
— Unverified 00 The Effect of Communication on Noncooperative Multiplayer Multi-Armed Bandit Problems Nov 5, 2017 Thompson Sampling
— Unverified 00 The End of Optimism? An Asymptotic Analysis of Finite-Armed Linear Bandits Oct 14, 2016 reinforcement-learning Reinforcement Learning
— Unverified 00 The Hardness Analysis of Thompson Sampling for Combinatorial Semi-bandits with Greedy Oracle Nov 8, 2021 Combinatorial Optimization Open-Ended Question Answering
— Unverified 00 The Intrinsic Robustness of Stochastic Bandits to Strategic Manipulation Jun 4, 2019 Recommendation Systems Thompson Sampling
— Unverified 00 The Elliptical Potential Lemma for General Distributions with an Application to Linear Thompson Sampling Feb 16, 2021 Decision Making LEMMA
— Unverified 00 The Sliding Regret in Stochastic Bandits: Discriminating Index and Randomized Policies Nov 30, 2023 Thompson Sampling
— Unverified 00 The Typical Behavior of Bandit Algorithms Oct 11, 2022 Thompson Sampling
— Unverified 00 Thompson Exploration with Best Challenger Rule in Best Arm Identification Oct 1, 2023 Thompson Sampling
— Unverified 00