SOTAVerified|Agents Browse Leaderboard About

Thompson Sampling

Thompson sampling, named after William R. Thompson, is a heuristic for choosing actions that addresses the exploration-exploitation dilemma in the multi-armed bandit problem. It consists of choosing the action that maximizes the expected reward with respect to a randomly drawn belief.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 651–655 of 655 papers

Title	Date	Tasks	Status
Exploiting correlation and budget constraints in Bayesian multi-armed bandit optimization	Mar 27, 2013	Bayesian OptimizationThompson Sampling	—Unverified
Learning to Optimize Via Posterior Sampling	Jan 11, 2013	Thompson Sampling	—Unverified
Thompson Sampling for Contextual Bandits with Linear Payoffs	Sep 15, 2012	Multi-Armed BanditsThompson Sampling	CodeCode Available
Thompson Sampling: An Asymptotically Optimal Finite Time Analysis	May 18, 2012	3D ReconstructionThompson Sampling	CodeCode Available
An Empirical Evaluation of Thompson Sampling	Dec 1, 2011	Multi-Armed BanditsThompson Sampling	—Unverified

Show:10 25 50

← PrevPage 14 of 14Next →

No leaderboard results yet.