SOTAVerified

Thompson Sampling

Thompson sampling, named after William R. Thompson, is a heuristic for choosing actions that addresses the exploration-exploitation dilemma in the multi-armed bandit problem. It consists of choosing the action that maximizes the expected reward with respect to a randomly drawn belief.

Papers

Showing 401425 of 655 papers

TitleStatusHype
Causal Bandits without prior knowledge using separating sets0
Thompson Sampling for Unsupervised Sequential Selection0
A Change-Detection Based Thompson Sampling Framework for Non-Stationary Bandits0
Efficient Online Learning for Cognitive Radar-Cellular Coexistence via Contextual Thompson Sampling0
Contextual Bandits for Advertising Budget Allocation0
Near Optimal Adversarial Attacks on Stochastic Bandits and Defenses with Smoothed Responses0
Reinforcement Learning with Trajectory Feedback0
Lenient Regret for Multi-Armed Bandits0
IntelligentPooling: Practical Thompson Sampling for mHealth0
Greedy Bandits with Sampled Context0
Influence Diagram Bandits: Variational Thompson Sampling for Structured Bandit Problems0
Meta-Learning Stationary Stochastic Process Prediction with Convolutional Neural ProcessesCode1
Variable Selection via Thompson Sampling0
Policy Gradient Optimization of Thompson Sampling Policies0
Asynchronous Multi Agent Active Search0
Learning by Repetition: Stochastic Multi-armed Bandits under Priming Effect0
Constrained Thompson Sampling for Real-Time Electricity Pricing with Grid Reliability Constraints0
Analysis and Design of Thompson Sampling for Stochastic Partial Monitoring0
Latent Bandits Revisited0
Hypermodels for Exploration0
TS-UCB: Improving on Thompson Sampling With Little to No Additional Computation0
On Frequentist Regret of Linear Thompson Sampling0
Statistical Efficiency of Thompson Sampling for Combinatorial Semi-Bandits0
Scalable Thompson Sampling using Sparse Gaussian Process Models0
Random Hypervolume Scalarizations for Provable Multi-Objective Black Box Optimization0
Show:102550
← PrevPage 17 of 27Next →

No leaderboard results yet.