SOTAVerified|Agents Browse Leaderboard About

Multi-Armed Bandits

Multi-armed bandits refer to a task where a fixed amount of resources must be allocated between competing resources that maximizes expected gain. Typically these problems involve an exploration/exploitation trade-off.

( Image credit: Microsoft Research )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 421–430 of 1262 papers

Title	Date	Tasks	Status
Improving Fairness in Adaptive Social Exergames via Shapley Bandits	Feb 18, 2023	FairnessMulti-Armed Bandits	—Unverified
Practical Contextual Bandits with Feedback Graphs	Feb 17, 2023	Multi-Armed Banditsregression	—Unverified
Infinite Action Contextual Bandits with Reusable Data Exhaust	Feb 16, 2023	Model SelectionMulti-Armed Bandits	CodeCode Available
Bandit Social Learning: Exploration under Myopic Behavior	Feb 15, 2023	Multi-Armed Bandits	—Unverified
Genetic multi-armed bandits: a reinforcement learning approach for discrete optimization via simulation	Feb 15, 2023	Multi-Armed BanditsStochastic Optimization	—Unverified
Adversarial Rewards in Universal Learning for Contextual Bandits	Feb 14, 2023	Multi-Armed Bandits	—Unverified
Piecewise-Stationary Multi-Objective Multi-Armed Bandit with Application to Joint Communications and Sensing	Feb 10, 2023	Change DetectionMulti-Armed Bandits	CodeCode Available
Leveraging User-Triggered Supervision in Contextual Bandits	Feb 7, 2023	Multi-Armed Bandits	—Unverified
On Private and Robust Bandits	Feb 6, 2023	Multi-Armed Bandits	—Unverified
Multiplier Bootstrap-based Exploration	Feb 3, 2023	Multi-Armed Bandits	—Unverified

Show:10 25 50

← PrevPage 43 of 127Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	NeuralLinear FullPosterior-MR	Cumulative regret	1.92	—	Unverified
2	Linear FullPosterior-MR	Cumulative regret	1.82	—	Unverified