SOTAVerified|Agents Browse Leaderboard About

Thompson Sampling

Thompson sampling, named after William R. Thompson, is a heuristic for choosing actions that addresses the exploration-exploitation dilemma in the multi-armed bandit problem. It consists of choosing the action that maximizes the expected reward with respect to a randomly drawn belief.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 521–530 of 655 papers

Title	Date	Tasks	Status
Adapting multi-armed bandits policies to contextual bandits scenarios	Nov 11, 2018	Binary ClassificationClassification	CodeCode Available
Practical Bayesian Learning of Neural Networks via Adaptive Optimisation Methods	Nov 8, 2018	Multi-Armed BanditsThompson Sampling	CodeCode Available
A Unified Approach to Translate Classical Bandit Algorithms to the Structured Bandit Setting	Oct 18, 2018	Thompson Sampling	—Unverified
Combining Bayesian Optimization and Lipschitz Optimization	Oct 10, 2018	Bayesian Optimizationglobal-optimization	—Unverified
Contextual Multi-Armed Bandits for Causal Marketing	Oct 2, 2018	Causal Inferencecounterfactual	—Unverified
Thompson Sampling Algorithms for Cascading Bandits	Oct 2, 2018	Efficient ExplorationMulti-Armed Bandits	—Unverified
Efficient Linear Bandits through Matrix Sketching	Sep 28, 2018	Thompson Sampling	—Unverified
Incorporating Behavioral Constraints in Online AI Systems	Sep 15, 2018	Thompson Sampling	—Unverified
Analysis of Thompson Sampling for Combinatorial Multi-armed Bandit with Probabilistically Triggered Arms	Sep 7, 2018	Thompson Sampling	—Unverified
Adaptive Grey-Box Fuzz-Testing with Thompson Sampling	Aug 24, 2018	Thompson Sampling	—Unverified

Show:10 25 50

← PrevPage 53 of 66Next →

No leaderboard results yet.