SOTAVerified|Agents Browse Leaderboard About

Thompson Sampling

Thompson sampling, named after William R. Thompson, is a heuristic for choosing actions that addresses the exploration-exploitation dilemma in the multi-armed bandit problem. It consists of choosing the action that maximizes the expected reward with respect to a randomly drawn belief.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 561–570 of 655 papers

Title	Date	Tasks	Status
Modeling Human Exploration Through Resource-Rational Reinforcement Learning	Jan 27, 2022	Meta-Learningreinforcement-learning	CodeCode Available
Online Learning of Decision Trees with Thompson Sampling	Apr 9, 2024	Interpretable Machine LearningThompson Sampling	CodeCode Available
Fast, Precise Thompson Sampling for Bayesian Optimization	Nov 26, 2024	Bayesian OptimizationSTS	CodeCode Available
Vaccine allocation policy optimization and budget sharing mechanism using Thompson sampling	Sep 21, 2021	Decision MakingManagement	CodeCode Available
Bayesian Algorithms for Decentralized Stochastic Bandits	Oct 20, 2020	Thompson Sampling	CodeCode Available
FedRTS: Federated Robust Pruning via Combinatorial Thompson Sampling	Jan 31, 2025	Federated LearningThompson Sampling	CodeCode Available
Adaptive Thompson Sampling Stacks for Memory Bounded Open-Loop Planning	Jul 11, 2019	Thompson Sampling	CodeCode Available
State-Aware Variational Thompson Sampling for Deep Q-Networks	Feb 7, 2021	Thompson Sampling	CodeCode Available
Constructing Adversarial Examples for Vertical Federated Learning: Optimal Client Corruption through Multi-Armed Bandit	Aug 8, 2024	Federated LearningThompson Sampling	CodeCode Available
Constructing Adversarial Examples for Vertical Federated Learning: Optimal Client Corruption through Multi-Armed Bandit	May 7, 2024	Federated LearningThompson Sampling	CodeCode Available

Show:10 25 50

← PrevPage 57 of 66Next →

No leaderboard results yet.