SOTAVerified|Agents Browse Leaderboard About Blog

Thompson Sampling

Thompson sampling, named after William R. Thompson, is a heuristic for choosing actions that addresses the exploration-exploitation dilemma in the multi-armed bandit problem. It consists of choosing the action that maximizes the expected reward with respect to a randomly drawn belief.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 11–20 of 655 papers

Title	Date	Tasks	Status	Hype	Score
Meta-Learning Stationary Stochastic Process Prediction with Convolutional Neural Processes	Jul 2, 2020	Meta-LearningThompson Sampling	CodeCode Available	1	5
Approximate Thompson Sampling via Epistemic Neural Networks	Feb 18, 2023	Thompson Sampling	CodeCode Available	1	5
A Tutorial on Thompson Sampling	Jul 7, 2017	Active LearningProduct Recommendation	CodeCode Available	1	5
Neural Thompson Sampling	Oct 2, 2020	Multi-Armed BanditsThompson Sampling	CodeCode Available	1	5
Dynamic Slate Recommendation with Gated Recurrent Units and Thompson Sampling	Apr 30, 2021	Recommendation SystemsThompson Sampling	CodeCode Available	1	5
Adaptive Anytime Multi-Agent Path Finding Using Bandit-Based Large Neighborhood Search	Dec 28, 2023	Multi-Agent Path FindingThompson Sampling	CodeCode Available	1	5
Batched Bayesian optimization by maximizing the probability of including the optimum	Oct 8, 2024	Bayesian OptimizationDiversity	CodeCode Available	1	5
Bayesian Optimization over Permutation Spaces	Dec 2, 2021	Bayesian OptimizationHeuristic Search	CodeCode Available	1	5
EE-Net: Exploitation-Exploration Neural Networks in Contextual Bandits	Oct 7, 2021	Multi-Armed BanditsThompson Sampling	CodeCode Available	1	5
Mercer Features for Efficient Combinatorial Bayesian Optimization	Dec 14, 2020	Bayesian OptimizationThompson Sampling	CodeCode Available	1	5

Show:10 25 50

← PrevPage 2 of 66Next →

No leaderboard results yet.