SOTAVerified|Agents Browse Leaderboard About

Thompson Sampling

Thompson sampling, named after William R. Thompson, is a heuristic for choosing actions that addresses the exploration-exploitation dilemma in the multi-armed bandit problem. It consists of choosing the action that maximizes the expected reward with respect to a randomly drawn belief.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 571–580 of 655 papers

Title	Date	Tasks	Status
Finite-Time Frequentist Regret Bounds of Multi-Agent Thompson Sampling on Sparse Hypergraphs	Dec 24, 2023	Computational EfficiencyThompson Sampling	CodeCode Available
Memory Bounded Open-Loop Planning in Large POMDPs using Thompson Sampling	May 10, 2019	Thompson Sampling	CodeCode Available
Adaptive Interventions with User-Defined Goals for Health Behavior Change	Nov 16, 2023	Thompson Sampling	CodeCode Available
A Unifying Theory of Thompson Sampling for Continuous Risk-Averse Bandits	Aug 25, 2021	Thompson Sampling	CodeCode Available
MergeDTS: A Method for Effective Large-Scale Online Ranker Evaluation	Dec 11, 2018	Information RetrievalOnline Ranker Evaluation	CodeCode Available
Queueing Matching Bandits with Preference Feedback	Oct 14, 2024	Thompson Sampling	CodeCode Available
Scalable Bayesian Optimization Using Vecchia Approximations of Gaussian Processes	Mar 2, 2022	Bayesian OptimizationGaussian Processes	CodeCode Available
On Provably Robust Meta-Bayesian Optimization	Jun 14, 2022	Bayesian OptimizationMeta-Learning	CodeCode Available
Multi-Agent Thompson Sampling for Bandit Applications with Sparse Neighbourhood Structures	Nov 22, 2019	Thompson Sampling	CodeCode Available
Bandit-Based Prompt Design Strategy Selection Improves Prompt Optimizers	Mar 3, 2025	Prompt EngineeringThompson Sampling	CodeCode Available

Show:10 25 50

← PrevPage 58 of 66Next →

No leaderboard results yet.