SOTAVerified|Agents Browse Leaderboard About

Multi-Armed Bandits

Multi-armed bandits refer to a task where a fixed amount of resources must be allocated between competing resources that maximizes expected gain. Typically these problems involve an exploration/exploitation trade-off.

( Image credit: Microsoft Research )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 881–890 of 1262 papers

Title	Date	Tasks	Status	Hype
Contextual Bandits with Side-Observations	Jun 6, 2020	Multi-Armed Bandits	—Unverified	0
Concurrent Decentralized Channel Allocation and Access Point Selection using Multi-Armed Bandits in multi BSS WLANs	Jun 5, 2020	Multi-Armed BanditsThompson Sampling	—Unverified	0
(Locally) Differentially Private Combinatorial Semi-Bandits	Jun 1, 2020	Multi-Armed BanditsPrivacy Preserving	—Unverified	0
Locally Differentially Private (Contextual) Bandits Learning	Jun 1, 2020	Multi-Armed BanditsPrivacy Preserving Deep Learning	CodeCode Available	0
To update or not to update? Delayed Nonparametric Bandits with Randomized Allocation	May 26, 2020	Multi-Armed Bandits	—Unverified	0
Greedy Algorithm almost Dominates in Smoothed Contextual Bandits	May 19, 2020	DiversityMulti-Armed Bandits	—Unverified	0
Unified Models of Human Behavioral Agents in Bandits, Contextual Bandits and RL	May 10, 2020	Decision MakingLifelong learning	CodeCode Available	1
Neural Network Retraining for Model Serving	Apr 29, 2020	modelMulti-Armed Bandits	—Unverified	0
Learning to Rank in the Position Based Model with Bandit Feedback	Apr 27, 2020	Learning-To-RankMulti-Armed Bandits	—Unverified	0
Thompson Sampling for Linearly Constrained Bandits	Apr 20, 2020	Multi-Armed BanditsThompson Sampling	CodeCode Available	0

Show:10 25 50

← PrevPage 89 of 127Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	NeuralLinear FullPosterior-MR	Cumulative regret	1.92	—	Unverified
2	Linear FullPosterior-MR	Cumulative regret	1.82	—	Unverified