SOTAVerified|Agents Browse Leaderboard About

Thompson Sampling

Thompson sampling, named after William R. Thompson, is a heuristic for choosing actions that addresses the exploration-exploitation dilemma in the multi-armed bandit problem. It consists of choosing the action that maximizes the expected reward with respect to a randomly drawn belief.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 11–20 of 655 papers

Title	Date	Tasks	Status	Hype
Neural Exploitation and Exploration of Contextual Bandits	May 5, 2023	Multi-Armed BanditsThompson Sampling	CodeCode Available	1
Approximate Thompson Sampling via Epistemic Neural Networks	Feb 18, 2023	Thompson Sampling	CodeCode Available	1
Sample-Then-Optimize Batch Neural Thompson Sampling	Oct 13, 2022	AutoMLBayesian Optimization	CodeCode Available	1
Langevin Monte Carlo for Contextual Bandits	Jun 22, 2022	Multi-Armed BanditsThompson Sampling	CodeCode Available	1
Bayesian Optimization over Permutation Spaces	Dec 2, 2021	Bayesian OptimizationHeuristic Search	CodeCode Available	1
EE-Net: Exploitation-Exploration Neural Networks in Contextual Bandits	Oct 7, 2021	Multi-Armed BanditsThompson Sampling	CodeCode Available	1
Deep Bandits Show-Off: Simple and Efficient Exploration with Deep Networks	May 10, 2021	Efficient ExplorationMulti-Armed Bandits	CodeCode Available	1
Dynamic Slate Recommendation with Gated Recurrent Units and Thompson Sampling	Apr 30, 2021	Recommendation SystemsThompson Sampling	CodeCode Available	1
An empirical evaluation of active inference in multi-armed bandits	Jan 21, 2021	BIG-bench Machine LearningDecision Making	CodeCode Available	1
Mercer Features for Efficient Combinatorial Bayesian Optimization	Dec 14, 2020	Bayesian OptimizationThompson Sampling	CodeCode Available	1

Show:10 25 50

← PrevPage 2 of 66Next →

No leaderboard results yet.