SOTAVerified

Thompson Sampling

Thompson sampling, named after William R. Thompson, is a heuristic for choosing actions that addresses the exploration-exploitation dilemma in the multi-armed bandit problem. It consists of choosing the action that maximizes the expected reward with respect to a randomly drawn belief.

Papers

Showing 476500 of 655 papers

TitleStatusHype
On the Performance of Thompson Sampling on Logistic Bandits0
On the Prior Sensitivity of Thompson Sampling0
On Thompson Sampling for Smoother-than-Lipschitz Bandits0
On Thompson Sampling with Langevin Algorithms0
On Frequentist Regret of Linear Thompson Sampling0
Near-Optimal Algorithms for Differentially Private Online Learning in a Stochastic Environment0
Optimal Exploration is no harder than Thompson Sampling0
Optimality of Thompson Sampling with Noninformative Priors for Pareto Bandits0
Optimal Learning for Dynamic Coding in Deadline-Constrained Multi-Channel Networks0
Optimal No-regret Learning in Repeated First-price Auctions0
Optimal Recommendation to Users that React: Online Learning for a Class of POMDPs0
Optimistic posterior sampling for reinforcement learning: worst-case regret bounds0
Optimistic Thompson Sampling for No-Regret Learning in Unknown Games0
Optimization of a SSP's Header Bidding Strategy using Thompson Sampling0
Optimizing Adaptive Experiments: A Unified Approach to Regret Minimization and Best-Arm Identification0
Ordinal Bayesian Optimisation0
Parallel and Distributed Thompson Sampling for Large-scale Accelerated Exploration of Chemical Space0
Parallel Bayesian Optimization Using Satisficing Thompson Sampling for Time-Sensitive Black-Box Optimization0
Parallel Contextual Bandits in Wireless Handover Optimization0
Parallelizing Thompson Sampling0
Partial Likelihood Thompson Sampling0
Partially Observable Contextual Bandits with Linear Payoffs0
Partially Observable Online Change Detection via Smooth-Sparse Decomposition0
PG-TS: Improved Thompson Sampling for Logistic Contextual Bandits0
Planning and Learning in Risk-Aware Restless Multi-Arm Bandit Problem0
Show:102550
← PrevPage 20 of 27Next →

No leaderboard results yet.