SOTAVerified|Agents Browse Leaderboard About Blog

Thompson Sampling

Thompson sampling, named after William R. Thompson, is a heuristic for choosing actions that addresses the exploration-exploitation dilemma in the multi-armed bandit problem. It consists of choosing the action that maximizes the expected reward with respect to a randomly drawn belief.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 21–30 of 655 papers

Title	Date	Tasks	Status	Hype
Optimal Thompson Sampling strategies for support-aware CVaR bandits	Dec 10, 2020	Thompson Sampling	CodeCode Available	1
Federated Bayesian Optimization via Thompson Sampling	Oct 20, 2020	Bayesian OptimizationComputational Efficiency	CodeCode Available	1
Neural Thompson Sampling	Oct 2, 2020	Multi-Armed BanditsThompson Sampling	CodeCode Available	1
Meta-Learning Stationary Stochastic Process Prediction with Convolutional Neural Processes	Jul 2, 2020	Meta-LearningThompson Sampling	CodeCode Available	1
Seamlessly Unifying Attributes and Items: Conversational Recommendation for Cold-Start Users	May 23, 2020	Collaborative FilteringConversational Recommendation	CodeCode Available	1
On Isometry Robustness of Deep 3D Point Cloud Models under Adversarial Attacks	Feb 27, 2020	Thompson Sampling	CodeCode Available	1
A Tutorial on Thompson Sampling	Jul 7, 2017	Active LearningProduct Recommendation	CodeCode Available	1
Robust Policy Switching for Antifragile Reinforcement Learning for UAV Deconfliction in Adversarial Environments	Jun 26, 2025	Reinforcement Learning (RL)Thompson Sampling	—Unverified	0
Context Attribution with Multi-Armed Bandit Optimization	Jun 24, 2025	Thompson Sampling	—Unverified	0
Adaptive Data Augmentation for Thompson Sampling	Jun 17, 2025	Data AugmentationMulti-Armed Bandits	—Unverified	0

Show:10 25 50

← PrevPage 3 of 66Next →

No leaderboard results yet.