SOTAVerified|Agents Browse Leaderboard About

Thompson Sampling

Thompson sampling, named after William R. Thompson, is a heuristic for choosing actions that addresses the exploration-exploitation dilemma in the multi-armed bandit problem. It consists of choosing the action that maximizes the expected reward with respect to a randomly drawn belief.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 471–480 of 655 papers

Title	Date	Tasks	Status
Old Dog Learns New Tricks: Randomized UCB for Bandit Problems	Oct 11, 2019	Thompson Sampling	CodeCode Available
Robust Dynamic Assortment Optimization in the Presence of Outlier Customers	Oct 9, 2019	Assortment OptimizationThompson Sampling	—Unverified
A Quantile-based Approach for Hyperparameter Transfer Learning	Sep 30, 2019	Bayesian OptimizationHyperparameter Optimization	—Unverified
A Copula approach for hyperparameter transfer learning	Sep 25, 2019	Bayesian OptimizationThompson Sampling	—Unverified
Efficient Multivariate Bandit Algorithm with Path Planning	Sep 6, 2019	Heuristic SearchThompson Sampling	—Unverified
An Arm-Wise Randomization Approach to Combinatorial Linear Semi-Bandits	Sep 5, 2019	Decision MakingRecommendation Systems	—Unverified
Online Causal Inference for Advertising in Real-Time Bidding Auctions	Aug 22, 2019	Causal InferenceExperimental Design	—Unverified
A Batched Multi-Armed Bandit Approach to News Headline Testing	Aug 17, 2019	ArticlesThompson Sampling	—Unverified
A Bayesian Choice Model for Eliminating Feedback Loops	Aug 15, 2019	Recommendation SystemsThompson Sampling	—Unverified
Thompson Sampling with Approximate Inference	Aug 14, 2019	Decision MakingThompson Sampling	—Unverified

Show:10 25 50

← PrevPage 48 of 66Next →

No leaderboard results yet.