SOTAVerified|Agents Browse Leaderboard About

Thompson Sampling

Thompson sampling, named after William R. Thompson, is a heuristic for choosing actions that addresses the exploration-exploitation dilemma in the multi-armed bandit problem. It consists of choosing the action that maximizes the expected reward with respect to a randomly drawn belief.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 281–290 of 655 papers

Title	Date	Tasks	Status
Partial Likelihood Thompson Sampling	Mar 2, 2022	Thompson Sampling	—Unverified
Towards Scalable and Robust Structured Bandits: A Meta-Learning Framework	Feb 26, 2022	Meta-LearningThompson Sampling	—Unverified
Thompson Sampling with Unrestricted Delays	Feb 24, 2022	Thompson Sampling	—Unverified
Double Thompson Sampling in Finite stochastic Games	Feb 21, 2022	Thompson Sampling	—Unverified
Adaptive Experimentation in the Presence of Exogenous Nonstationary Variation	Feb 18, 2022	Thompson Sampling	—Unverified
Fast online inference for nonlinear contextual bandit based on Generative Adversarial Network	Feb 17, 2022	Bayesian InferenceGenerative Adversarial Network	—Unverified
Synthetically Controlled Bandits	Feb 14, 2022	Thompson Sampling	—Unverified
Remote Contextual Bandits	Feb 10, 2022	MarketingMulti-Armed Bandits	—Unverified
Fourier Representations for Black-Box Optimization over Categorical Variables	Feb 8, 2022	regressionThompson Sampling	—Unverified
Bayesian Non-stationary Linear Bandits for Large-Scale Recommender Systems	Feb 7, 2022	Decision MakingDimensionality Reduction	CodeCode Available

Show:10 25 50

← PrevPage 29 of 66Next →

No leaderboard results yet.