SOTAVerified

Multi-Armed Bandits

Multi-armed bandits refer to a task where a fixed amount of resources must be allocated between competing resources that maximizes expected gain. Typically these problems involve an exploration/exploitation trade-off.

( Image credit: Microsoft Research )

Papers

Showing 821830 of 1262 papers

TitleStatusHype
Multi-agent Multi-armed Bandits with Stochastic Sharable Arm Capacities0
Multi-Agent Multi-Armed Bandits with Limited Communication0
Multi-agent Multi-armed Bandit with Fully Heavy-tailed Dynamics0
Multi-Agent Stochastic Bandits Robust to Adversarial Corruptions0
Multi-armed Bandit Learning for TDMA Transmission Slot Scheduling and Defragmentation for Improved Bandwidth Usage0
Multi-Armed Bandits and Quantum Channel Oracles0
Multi-armed Bandits: Competing with Optimal Sequences0
Multi-Armed Bandits for Correlated Markovian Environments with Smoothed Reward Feedback0
Multi-Armed Bandits for Intelligent Tutoring Systems0
Multi-armed Bandits for Link Configuration in Millimeter-wave Networks0
Show:102550
← PrevPage 83 of 127Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1NeuralLinear FullPosterior-MRCumulative regret1.92Unverified
2Linear FullPosterior-MRCumulative regret1.82Unverified