SOTAVerified|Agents Browse Leaderboard About

Thompson Sampling

Thompson sampling, named after William R. Thompson, is a heuristic for choosing actions that addresses the exploration-exploitation dilemma in the multi-armed bandit problem. It consists of choosing the action that maximizes the expected reward with respect to a randomly drawn belief.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 281–290 of 655 papers

Title	Date	Tasks	Status
Generator-Mediated Bandits: Thompson Sampling for GenAI-Powered Adaptive Interventions	May 22, 2025	Large Language ModelThompson Sampling	—Unverified
Geometry-Aware Approaches for Balancing Performance and Theoretical Guarantees in Linear Bandits	Jun 26, 2023	Decision MakingThompson Sampling	—Unverified
Graph Neural Thompson Sampling	Jun 15, 2024	Decision MakingGraph Neural Network	—Unverified
Feedback graph regret bounds for Thompson Sampling and UCB	May 23, 2019	Thompson Sampling	—Unverified
Greedy Bandits with Sampled Context	Jul 27, 2020	Decision MakingMulti-Armed Bandits	—Unverified
Greedy k-Center from Noisy Distance Samples	Nov 3, 2020	Thompson Sampling	—Unverified
GuideBoot: Guided Bootstrap for Deep Contextual Bandits	Jul 18, 2021	Multi-Armed BanditsThompson Sampling	—Unverified
GUTS: Generalized Uncertainty-Aware Thompson Sampling for Multi-Agent Active Search	Apr 4, 2023	AllDisaster Response	—Unverified
gym-saturation: Gymnasium environments for saturation provers (System description)	Sep 16, 2023	OpenAI Gymreinforcement-learning	—Unverified
Bayesian Mixture Modelling and Inference based Thompson Sampling in Monte-Carlo Tree Search	Dec 1, 2013	Thompson Sampling	—Unverified

Show:10 25 50

← PrevPage 29 of 66Next →

No leaderboard results yet.