SOTAVerified

Thompson Sampling

Thompson sampling, named after William R. Thompson, is a heuristic for choosing actions that addresses the exploration-exploitation dilemma in the multi-armed bandit problem. It consists of choosing the action that maximizes the expected reward with respect to a randomly drawn belief.

Papers

Showing 601625 of 655 papers

TitleStatusHype
The Choice of Noninformative Priors for Thompson Sampling in Multiparameter Bandit Models0
Asymptotic Convergence of Thompson Sampling0
Asymptotic Performance of Thompson Sampling in the Batched Multi-Armed Bandits0
Asynchronous Multi Agent Active Search0
Augmented RBMLE-UCB Approach for Adaptive Control of Linear Quadratic Systems0
A Unified and Efficient Coordinating Framework for Autonomous DBMS Tuning0
Automatic Ensemble Learning for Online Influence Maximization0
AutoSeM: Automatic Task Selection and Mixing in Multi-Task Learning0
Bag of Policies for Distributional Deep Exploration0
BanditCAT and AutoIRT: Machine Learning Approaches to Computerized Adaptive Testing and Item Calibration0
Bandit Change-Point Detection for Real-Time Monitoring High-Dimensional Data Under Sampling Control0
Bandit Convex Optimization: sqrtT Regret in One Dimension0
Bandit Learning for Diversified Interactive Recommendation0
Bandit Models of Human Behavior: Reward Processing in Mental Disorders0
Bandit Policies for Reliable Cellular Network Handovers in Extreme Mobility0
Bandits Under The Influence (Extended Version)0
Bandit Theory and Thompson Sampling-Guided Directed Evolution for Sequence Optimization0
Batch Bayesian Optimization for Replicable Experimental Design0
Batched Thompson Sampling0
Batched Thompson Sampling for Multi-Armed Bandits0
Bayesian Bandit Algorithms with Approximate Inference in Stochastic Linear Bandits0
Bayesian Best-Arm Identification for Selecting Influenza Mitigation Strategies0
Bayesian Collaborative Bandits with Thompson Sampling for Improved Outreach in Maternal Health Program0
Bayesian decision-making under misspecified priors with applications to meta-learning0
Bayesian-Guided Generation of Synthetic Microbiomes with Minimized Pathogenicity0
Show:102550
← PrevPage 25 of 27Next →

No leaderboard results yet.