SOTAVerified|Agents Browse Leaderboard About

Thompson Sampling

Thompson sampling, named after William R. Thompson, is a heuristic for choosing actions that addresses the exploration-exploitation dilemma in the multi-armed bandit problem. It consists of choosing the action that maximizes the expected reward with respect to a randomly drawn belief.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 341–350 of 655 papers

Title	Date	Tasks	Status
On Online Learning in Kernelized Markov Decision Processes	Nov 4, 2019	Thompson Sampling	—Unverified
On The Differential Privacy of Thompson Sampling With Gaussian Prior	Jun 24, 2018	Thompson Sampling	—Unverified
On the Importance of Uncertainty in Decision-Making with Large Language Models	Apr 3, 2024	Decision MakingMulti-Armed Bandits	—Unverified
On the Performance of Thompson Sampling on Logistic Bandits	May 12, 2019	Thompson Sampling	—Unverified
On the Prior Sensitivity of Thompson Sampling	Jun 10, 2015	SensitivityThompson Sampling	—Unverified
On Thompson Sampling for Smoother-than-Lipschitz Bandits	Jan 8, 2020	reinforcement-learningReinforcement Learning	—Unverified
On Thompson Sampling with Langevin Algorithms	Feb 23, 2020	Thompson Sampling	—Unverified
On Frequentist Regret of Linear Thompson Sampling	Jun 11, 2020	Thompson Sampling	—Unverified
Near-Optimal Algorithms for Differentially Private Online Learning in a Stochastic Environment	Feb 16, 2021	Thompson Sampling	—Unverified
Optimal Exploration is no harder than Thompson Sampling	Oct 9, 2023	Thompson Sampling	—Unverified

Show:10 25 50

← PrevPage 35 of 66Next →

No leaderboard results yet.