SOTAVerified|Agents Browse Leaderboard About

Thompson Sampling

Thompson sampling, named after William R. Thompson, is a heuristic for choosing actions that addresses the exploration-exploitation dilemma in the multi-armed bandit problem. It consists of choosing the action that maximizes the expected reward with respect to a randomly drawn belief.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 521–530 of 655 papers

Title	Date	Tasks	Status
When and why randomised exploration works (in linear bandits)	Feb 13, 2025	Thompson Sampling	—Unverified
When Combinatorial Thompson Sampling meets Approximation Regret	Feb 22, 2023	Thompson Sampling	—Unverified
Practical Batch Bayesian Sampling Algorithms for Online Adaptive Traffic Experimentation	May 24, 2023	Thompson Sampling	—Unverified
Zero-Inflated Bandits	Dec 25, 2023	Multi-Armed BanditsThompson Sampling	—Unverified
A Bandit Approach to Online Pricing for Heterogeneous Edge Resource Allocation	Feb 14, 2023	Edge-computingThompson Sampling	—Unverified
A Batched Multi-Armed Bandit Approach to News Headline Testing	Aug 17, 2019	ArticlesThompson Sampling	—Unverified
Context in Public Health for Underserved Communities: A Bayesian Approach to Online Restless Bandits	Feb 7, 2024	Multi-Armed BanditsReinforcement Learning (RL)	—Unverified
A Bayesian Choice Model for Eliminating Feedback Loops	Aug 15, 2019	Recommendation SystemsThompson Sampling	—Unverified
Accelerating Grasp Exploration by Leveraging Learned Priors	Nov 11, 2020	ObjectThompson Sampling	—Unverified
A Change-Detection Based Thompson Sampling Framework for Non-Stationary Bandits	Sep 6, 2020	Change DetectionThompson Sampling	—Unverified

Show:10 25 50

← PrevPage 53 of 66Next →

No leaderboard results yet.