SOTAVerified|Agents Browse Leaderboard About

Multi-Armed Bandits

Multi-armed bandits refer to a task where a fixed amount of resources must be allocated between competing resources that maximizes expected gain. Typically these problems involve an exploration/exploitation trade-off.

( Image credit: Microsoft Research )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 401–410 of 1262 papers

Title	Date	Tasks	Status
Designing an Interpretable Interface for Contextual Bandits	Sep 23, 2024	Multi-Armed BanditsOff-policy evaluation	—Unverified
Dynamic Global Sensitivity for Differentially Private Contextual Bandits	Aug 30, 2022	Interactive RecommendationMulti-Armed Bandits	—Unverified
Dynamic pricing and assortment under a contextual MNL demand	Oct 19, 2021	Multi-Armed Bandits	—Unverified
Dynamic Pricing with Limited Supply	Aug 20, 2011	Multi-Armed Bandits	—Unverified
Dynamic Product Image Generation and Recommendation at Scale for Personalized E-commerce	Aug 22, 2024	Image GenerationMulti-Armed Bandits	—Unverified
Early Stopping in Contextual Bandits and Inferences	Feb 5, 2025	Decision MakingMulti-Armed Bandits	—Unverified
Ease.ml: Towards Multi-tenant Resource Sharing for Machine Learning Workloads	Aug 24, 2017	Bayesian OptimizationBIG-bench Machine Learning	—Unverified
EduQate: Generating Adaptive Curricula through RMABs in Education Settings	Jun 20, 2024	Multi-Armed BanditsQ-Learning	—Unverified
BEACON: Balancing Convenience and Nutrition in Meals With Long-Term Group Recommendations and Reasoning on Multimodal Recipes	Jun 19, 2024	Multi-Armed BanditsNutrition	—Unverified
Delegating via Quitting Games	Apr 20, 2018	Multi-Armed Bandits	—Unverified

Show:10 25 50

← PrevPage 41 of 127Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	NeuralLinear FullPosterior-MR	Cumulative regret	1.92	—	Unverified
2	Linear FullPosterior-MR	Cumulative regret	1.82	—	Unverified