SOTAVerified|Agents Browse Leaderboard About

Thompson Sampling

Thompson sampling, named after William R. Thompson, is a heuristic for choosing actions that addresses the exploration-exploitation dilemma in the multi-armed bandit problem. It consists of choosing the action that maximizes the expected reward with respect to a randomly drawn belief.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 441–450 of 655 papers

Title	Date	Tasks	Status
An Online Learning Framework for Energy-Efficient Navigation of Electric Vehicles	Mar 3, 2020	NavigateThompson Sampling	—Unverified
MOTS: Minimax Optimal Thompson Sampling	Mar 3, 2020	Thompson Sampling	—Unverified
Efficient exploration of zero-sum stochastic games	Feb 24, 2020	Efficient ExplorationThompson Sampling	—Unverified
On Thompson Sampling with Langevin Algorithms	Feb 23, 2020	Thompson Sampling	—Unverified
Residual Bootstrap Exploration for Bandit Algorithms	Feb 19, 2020	Computational EfficiencyMulti-Armed Bandits	—Unverified
A General Theory of the Stochastic Linear Bandit and Its Applications	Feb 12, 2020	Multi-Armed BanditsThompson Sampling	—Unverified
The Price of Incentivizing Exploration: A Characterization via Thompson Sampling and Sample Complexity	Feb 3, 2020	Multi-Armed BanditsThompson Sampling	—Unverified
Thompson Sampling Algorithms for Mean-Variance Bandits	Feb 1, 2020	Decision MakingThompson Sampling	CodeCode Available
Bayesian Quantile and Expectile Optimisation	Jan 12, 2020	Bayesian OptimisationGaussian Processes	—Unverified
On Thompson Sampling for Smoother-than-Lipschitz Bandits	Jan 8, 2020	reinforcement-learningReinforcement Learning	—Unverified

Show:10 25 50

← PrevPage 45 of 66Next →

No leaderboard results yet.