SOTAVerified|Agents Browse Leaderboard About

Thompson Sampling

Thompson sampling, named after William R. Thompson, is a heuristic for choosing actions that addresses the exploration-exploitation dilemma in the multi-armed bandit problem. It consists of choosing the action that maximizes the expected reward with respect to a randomly drawn belief.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 421–430 of 655 papers

Title	Date	Tasks	Status
Sequential Test for the Lowest Mean: From Thompson to Murphy Sampling	Jun 4, 2018	Reinforcement LearningReinforcement Learning (RL)	—Unverified
Sharp Deviations Bounds for Dirichlet Weighted Sums with Application to analysis of Bayesian algorithms	Apr 6, 2023	Multi-Armed BanditsThompson Sampling	—Unverified
Simple Bayesian Algorithms for Best Arm Identification	Feb 26, 2016	Thompson Sampling	—Unverified
Simplifying Bayesian Optimization Via In-Context Direct Optimum Sampling	May 29, 2025	Bayesian OptimizationThompson Sampling	—Unverified
Sliding-Window Thompson Sampling for Non-Stationary Settings	Sep 8, 2024	Decision MakingSequential Decision Making	—Unverified
Smart Routing with Precise Link Estimation: DSEE-Based Anypath Routing for Reliable Wireless Networking	May 16, 2024	Thompson Sampling	—Unverified
Solving Bernoulli Rank-One Bandits with Unimodal Thompson Sampling	Dec 6, 2019	Thompson Sampling	—Unverified
Sparse Nonparametric Contextual Bandits	Mar 20, 2025	Multi-Armed BanditsThompson Sampling	—Unverified
Sparse Spectrum Gaussian Process for Bayesian Optimization	Jun 21, 2019	Bayesian OptimisationBayesian Optimization	—Unverified
Speculative Decoding via Early-exiting for Faster LLM Inference with Thompson Sampling Control Mechanism	Jun 6, 2024	Thompson Sampling	—Unverified

Show:10 25 50

← PrevPage 43 of 66Next →

No leaderboard results yet.