SOTAVerified|Agents Browse Leaderboard About

Thompson Sampling

Thompson sampling, named after William R. Thompson, is a heuristic for choosing actions that addresses the exploration-exploitation dilemma in the multi-armed bandit problem. It consists of choosing the action that maximizes the expected reward with respect to a randomly drawn belief.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 571–580 of 655 papers

Title	Date	Tasks	Status
An Arm-Wise Randomization Approach to Combinatorial Linear Semi-Bandits	Sep 5, 2019	Decision MakingRecommendation Systems	—Unverified
An Efficient Algorithm For Generalized Linear Bandit: Online Stochastic Gradient Descent and Thompson Sampling	Jun 7, 2020	Thompson Sampling	—Unverified
An Empirical Evaluation of Thompson Sampling	Dec 1, 2011	Multi-Armed BanditsThompson Sampling	—Unverified
An Extremely Data-efficient and Generative LLM-based Reinforcement Learning Agent for Recommenders	Aug 28, 2024	Recommendation SystemsThompson Sampling	—Unverified
An improved regret analysis for UCB-N and TS-N	May 6, 2023	LEMMAThompson Sampling	—Unverified
An Information-Theoretic Analysis for Thompson Sampling with Many Actions	May 30, 2018	Thompson Sampling	—Unverified
An Information-Theoretic Analysis of Thompson Sampling	Mar 21, 2014	Thompson Sampling	—Unverified
An Information-Theoretic Analysis of Thompson Sampling for Logistic Bandits	Dec 3, 2024	Thompson Sampling	—Unverified
An Information-Theoretic Analysis of Thompson Sampling with Infinite Action Spaces	Feb 4, 2025	Thompson Sampling	—Unverified
An Online Learning Framework for Energy-Efficient Navigation of Electric Vehicles	Mar 3, 2020	NavigateThompson Sampling	—Unverified

Show:10 25 50

← PrevPage 58 of 66Next →

No leaderboard results yet.