SOTAVerified|Agents Browse Leaderboard About

Thompson Sampling

Thompson sampling, named after William R. Thompson, is a heuristic for choosing actions that addresses the exploration-exploitation dilemma in the multi-armed bandit problem. It consists of choosing the action that maximizes the expected reward with respect to a randomly drawn belief.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 181–190 of 655 papers

Title	Date	Tasks	Status	Hype
Provable and Practical: Efficient Exploration in Reinforcement Learning via Langevin Monte Carlo	May 29, 2023	Efficient Explorationreinforcement-learning	CodeCode Available	1
Practical Batch Bayesian Sampling Algorithms for Online Adaptive Traffic Experimentation	May 24, 2023	Thompson Sampling	—Unverified	0
Discounted Thompson Sampling for Non-Stationary Bandit Problems	May 18, 2023	Thompson Sampling	—Unverified	0
Sequential Best-Arm Identification with Application to Brain-Computer Interface	May 17, 2023	Brain Computer InterfaceEEG	—Unverified	0
Thompson Sampling for Parameterized Markov Decision Processes with Uninformative Actions	May 13, 2023	Bayesian InferenceThompson Sampling	—Unverified	0
Trajectory-oriented optimization of stochastic epidemiological models	May 6, 2023	Thompson Sampling	CodeCode Available	0
An improved regret analysis for UCB-N and TS-N	May 6, 2023	LEMMAThompson Sampling	—Unverified	0
Neural Exploitation and Exploration of Contextual Bandits	May 5, 2023	Multi-Armed BanditsThompson Sampling	CodeCode Available	1
Kullback-Leibler Maillard Sampling for Multi-armed Bandits with Bounded Rewards	Apr 28, 2023	Multi-Armed BanditsThompson Sampling	CodeCode Available	0
Thompson Sampling Regret Bounds for Contextual Bandits with sub-Gaussian rewards	Apr 26, 2023	Multi-Armed BanditsThompson Sampling	—Unverified	0

Show:10 25 50

← PrevPage 19 of 66Next →

No leaderboard results yet.