SOTAVerified|Agents Browse Leaderboard About

Thompson Sampling

Thompson sampling, named after William R. Thompson, is a heuristic for choosing actions that addresses the exploration-exploitation dilemma in the multi-armed bandit problem. It consists of choosing the action that maximizes the expected reward with respect to a randomly drawn belief.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 491–500 of 655 papers

Title	Date	Tasks	Status
Stochastic Neural Network with Kronecker Flow	Jun 10, 2019	Multi-Armed BanditsThompson Sampling	—Unverified
The Intrinsic Robustness of Stochastic Bandits to Strategic Manipulation	Jun 4, 2019	Recommendation SystemsThompson Sampling	—Unverified
Regret Bounds for Thompson Sampling in Episodic Restless Bandit Problems	May 29, 2019	Multi-Armed BanditsThompson Sampling	CodeCode Available
Connections Between Mirror Descent, Thompson Sampling and the Information Ratio	May 28, 2019	Thompson Sampling	—Unverified
Feedback graph regret bounds for Thompson Sampling and UCB	May 23, 2019	Thompson Sampling	—Unverified
Adaptive Model Selection Framework: An Application to Airline Pricing	May 21, 2019	Model SelectionThompson Sampling	—Unverified
Adaptive Sensor Placement for Continuous Spaces	May 16, 2019	Thompson Sampling	—Unverified
On the Performance of Thompson Sampling on Logistic Bandits	May 12, 2019	Thompson Sampling	—Unverified
Memory Bounded Open-Loop Planning in Large POMDPs using Thompson Sampling	May 10, 2019	Thompson Sampling	CodeCode Available
AutoSeM: Automatic Task Selection and Mixing in Multi-Task Learning	Apr 8, 2019	Bayesian OptimizationInductive Bias	—Unverified

Show:10 25 50

← PrevPage 50 of 66Next →

No leaderboard results yet.