SOTAVerified|Agents Browse Leaderboard About

Multi-Armed Bandits

Multi-armed bandits refer to a task where a fixed amount of resources must be allocated between competing resources that maximizes expected gain. Typically these problems involve an exploration/exploitation trade-off.

( Image credit: Microsoft Research )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 271–280 of 1262 papers

Title	Date	Tasks	Status
Contextual Bandits for adapting to changing User preferences over time	Sep 21, 2020	Incremental LearningMulti-Armed Bandits	—Unverified
Contextual Bandits for Advertising Budget Allocation	Aug 22, 2020	MarketingMulti-Armed Bandits	—Unverified
Contextual Bandits for Advertising Campaigns: A Diffusion-Model Independent Approach (Extended Version)	Jan 13, 2022	Multi-Armed Bandits	—Unverified
Contextual Bandits for Evaluating and Improving Inventory Control Policies	Oct 24, 2023	Multi-Armed Bandits	—Unverified
Contextual Bandits for Unbounded Context Distributions	Aug 19, 2024	Decision MakingMulti-Armed Bandits	—Unverified
Contextual Bandits in a Survey Experiment on Charitable Giving: Within-Experiment Outcomes versus Policy Learning	Nov 22, 2022	Multi-Armed Bandits	—Unverified
Contextual Bandits in Payment Processing: Non-uniform Exploration and Supervised Learning at Adyen	Nov 30, 2024	Multi-Armed Banditsregression	—Unverified
Linear Bandits with Stochastic Delayed Feedback	Jul 5, 2018	MarketingMulti-Armed Bandits	—Unverified
Contextual Bandits with Arm Request Costs and Delays	Oct 17, 2024	Movie RecommendationMulti-Armed Bandits	—Unverified
A Federated Online Restless Bandit Framework for Cooperative Resource Allocation	Jun 12, 2024	Federated LearningMulti-Armed Bandits	—Unverified

Show:10 25 50

← PrevPage 28 of 127Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	NeuralLinear FullPosterior-MR	Cumulative regret	1.92	—	Unverified
2	Linear FullPosterior-MR	Cumulative regret	1.82	—	Unverified