SOTAVerified|Agents Browse Leaderboard About

Thompson Sampling

Thompson sampling, named after William R. Thompson, is a heuristic for choosing actions that addresses the exploration-exploitation dilemma in the multi-armed bandit problem. It consists of choosing the action that maximizes the expected reward with respect to a randomly drawn belief.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 501–510 of 655 papers

Title	Date	Tasks	Status
Thompson Sampling with Virtual Helping Agents	Sep 16, 2022	Decision MakingSequential Decision Making	—Unverified
Time-Sensitive Bandit Learning and Satisficing Thompson Sampling	Apr 28, 2017	Thompson Sampling	—Unverified
Top Two Algorithms Revisited	Jun 13, 2022	Thompson SamplingVocal Bursts Valence Prediction	—Unverified
Towards Optimal Algorithms for Prediction with Expert Advice	Sep 10, 2014	PredictionThompson Sampling	—Unverified
Towards Scalable and Robust Structured Bandits: A Meta-Learning Framework	Feb 26, 2022	Meta-LearningThompson Sampling	—Unverified
Tree Ensembles for Contextual Bandits	Feb 10, 2024	Multi-Armed BanditsThompson Sampling	—Unverified
Truthful mechanisms for linear bandit games with private contexts	Jan 7, 2025	Thompson Sampling	—Unverified
TSEB: More Efficient Thompson Sampling for Policy Learning	Oct 10, 2015	Thompson Sampling	—Unverified
TSEC: a framework for online experimentation under experimental constraints	Jan 17, 2021	Portfolio OptimizationThompson Sampling	—Unverified
TS-UCB: Improving on Thompson Sampling With Little to No Additional Computation	Jun 11, 2020	Multi-Armed BanditsThompson Sampling	—Unverified

Show:10 25 50

← PrevPage 51 of 66Next →

No leaderboard results yet.