SOTAVerified|Agents Browse Leaderboard About

Thompson Sampling

Thompson sampling, named after William R. Thompson, is a heuristic for choosing actions that addresses the exploration-exploitation dilemma in the multi-armed bandit problem. It consists of choosing the action that maximizes the expected reward with respect to a randomly drawn belief.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 441–450 of 655 papers

Title	Date	Tasks	Status	Hype
On Isometry Robustness of Deep 3D Point Cloud Models under Adversarial Attacks	Feb 27, 2020	Thompson Sampling	CodeCode Available	1
Efficient exploration of zero-sum stochastic games	Feb 24, 2020	Efficient ExplorationThompson Sampling	—Unverified	0
On Thompson Sampling with Langevin Algorithms	Feb 23, 2020	Thompson Sampling	—Unverified	0
Residual Bootstrap Exploration for Bandit Algorithms	Feb 19, 2020	Computational EfficiencyMulti-Armed Bandits	—Unverified	0
A General Theory of the Stochastic Linear Bandit and Its Applications	Feb 12, 2020	Multi-Armed BanditsThompson Sampling	—Unverified	0
The Price of Incentivizing Exploration: A Characterization via Thompson Sampling and Sample Complexity	Feb 3, 2020	Multi-Armed BanditsThompson Sampling	—Unverified	0
Thompson Sampling Algorithms for Mean-Variance Bandits	Feb 1, 2020	Decision MakingThompson Sampling	CodeCode Available	0
Bayesian Quantile and Expectile Optimisation	Jan 12, 2020	Bayesian OptimisationGaussian Processes	—Unverified	0
On Thompson Sampling for Smoother-than-Lipschitz Bandits	Jan 8, 2020	reinforcement-learningReinforcement Learning	—Unverified	0
Making Sense of Reinforcement Learning and Probabilistic Inference	Jan 3, 2020	reinforcement-learningReinforcement Learning	—Unverified	0

Show:10 25 50

← PrevPage 45 of 66Next →

No leaderboard results yet.