Fighting Contextual Bandits with Stochastic Smoothing Oct 11, 2018 Multi-Armed Bandits
— Unverified 0Finding All -Good Arms in Stochastic Bandits Dec 1, 2020 All Multi-Armed Bandits
— Unverified 0Finding the bandit in a graph: Sequential search-and-stop Jun 6, 2018 Multi-Armed Bandits
— Unverified 0Fine-Grained Gap-Dependent Bounds for Tabular MDPs via Adaptive Multi-Step Bootstrap Feb 9, 2021 Multi-Armed Bandits
— Unverified 0Finite-Horizon Single-Pull Restless Bandits: An Efficient Index Policy For Scarce Resource Allocation Jan 10, 2025 Multi-Armed Bandits
— Unverified 0Finite-Time Analysis of Kernelised Contextual Bandits Sep 26, 2013 Multi-Armed Bandits
— Unverified 0Finite-Time Analysis of Whittle Index based Q-Learning for Restless Multi-Armed Bandits with Neural Network Function Approximation Oct 3, 2023 Multi-Armed Bandits Q-Learning
— Unverified 0Finite-Time Regret of Thompson Sampling Algorithms for Exponential Family Multi-Armed Bandits Jun 7, 2022 Multi-Armed Bandits Thompson Sampling
— Unverified 0First- and Second-Order Bounds for Adversarial Linear Contextual Bandits May 1, 2023 Multi-Armed Bandits
— Unverified 0Fixed-Budget Best-Arm Identification in Structured Bandits Jun 9, 2021 Multi-Armed Bandits
— Unverified 0FLASH: Federated Learning Across Simultaneous Heterogeneities Feb 13, 2024 Federated Learning Multi-Armed Bandits
— Unverified 0Flexible and Efficient Contextual Bandits with Heterogeneous Treatment Effect Oracles Mar 30, 2022 Decision Making Heterogeneous Treatment Effect Estimation
— Unverified 0Follow-ups Also Matter: Improving Contextual Bandits via Post-serving Contexts Sep 25, 2023 LEMMA Multi-Armed Bandits
— Unverified 0Foundations of Reinforcement Learning and Interactive Decision Making Dec 27, 2023 Decision Making Multi-Armed Bandits
— Unverified 0From Bandits to Experts: A Tale of Domination and Independence Jul 17, 2013 Multi-Armed Bandits
— Unverified 0From Bandits to Experts: On the Value of Side-Observations Dec 1, 2011 Multi-Armed Bandits
— Unverified 0From Dirichlet to Rubin: Optimistic Exploration in RL without Bonuses May 16, 2022 Multi-Armed Bandits
— Unverified 0Survival of the strictest: Stable and unstable equilibria under regularized learning with partial information Jan 12, 2021 Multi-Armed Bandits
— Unverified 0Full Gradient Deep Reinforcement Learning for Average-Reward Criterion Apr 7, 2023 Deep Reinforcement Learning Multi-Armed Bandits
— Unverified 0Fully Gap-Dependent Bounds for Multinomial Logit Bandit Nov 19, 2020 Multi-Armed Bandits
— Unverified 0Fundamental Limits of Online and Distributed Algorithms for Statistical Learning and Estimation Nov 14, 2013 Multi-Armed Bandits Stochastic Optimization
— Unverified 0Garbage In, Reward Out: Bootstrapping Exploration in Multi-Armed Bandits Nov 13, 2018 Multi-Armed Bandits
— Unverified 0Gaussian Process bandits with adaptive discretization Dec 5, 2017 Multi-Armed Bandits
— Unverified 0Generalized Policy Elimination: an efficient algorithm for Nonparametric Contextual Bandits Mar 5, 2020 Multi-Armed Bandits
— Unverified 0Generalized Risk-Aversion in Stochastic Multi-Armed Bandits May 5, 2014 Multi-Armed Bandits
— Unverified 0Generalized Thompson Sampling for Contextual Bandits Oct 27, 2013 Multi-Armed Bandits Thompson Sampling
— Unverified 0Generalized Translation and Scale Invariant Online Algorithm for Adversarial Multi-Armed Bandits Sep 19, 2021 Multi-Armed Bandits Translation
— Unverified 0Generalizing distribution of partial rewards for multi-armed bandits with temporally-partitioned rewards Nov 13, 2022 Multi-Armed Bandits
— Unverified 0Genetic multi-armed bandits: a reinforcement learning approach for discrete optimization via simulation Feb 15, 2023 Multi-Armed Bandits Stochastic Optimization
— Unverified 0GINO-Q: Learning an Asymptotically Optimal Index Policy for Restless Multi-armed Bandits Aug 19, 2024 Multi-Armed Bandits Q-Learning
— Unverified 0Global Bandits Mar 29, 2015 Decision Making Informativeness
— Unverified 0Global Rewards in Restless Multi-Armed Bandits Jun 2, 2024 Multi-Armed Bandits
— Unverified 0Gradient-free Online Learning in Continuous Games with Delayed Rewards Jan 1, 2020 Multi-Armed Bandits Recommendation Systems
— Unverified 0Graph Clustering Bandits for Recommendation May 2, 2016 Clustering Graph Clustering
— Unverified 0Graph-Dependent Regret Bounds in Multi-Armed Bandits with Interference Mar 10, 2025 Multi-Armed Bandits Sequential Decision Making
— Unverified 0Practical Contextual Bandits with Feedback Graphs Feb 17, 2023 Multi-Armed Bandits regression
— Unverified 0Graph Neural Bandits Aug 21, 2023 Multi-Armed Bandits
— Unverified 0Greedy Algorithm almost Dominates in Smoothed Contextual Bandits May 19, 2020 Diversity Multi-Armed Bandits
— Unverified 0Greedy Algorithm for Structured Bandits: A Sharp Characterization of Asymptotic Success / Failure Mar 6, 2025 Decision Making Multi-Armed Bandits
— Unverified 0Greedy Bandits with Sampled Context Jul 27, 2020 Decision Making Multi-Armed Bandits
— Unverified 0Greybox fuzzing as a contextual bandits problem Jun 11, 2018 Multi-Armed Bandits
— Unverified 0Guaranteed Fixed-Confidence Best Arm Identification in Multi-Armed Bandits: Simple Sequential Elimination Algorithms Jun 12, 2021 Multi-Armed Bandits
— Unverified 0GuideBoot: Guided Bootstrap for Deep Contextual Bandits Jul 18, 2021 Multi-Armed Bandits Thompson Sampling
— Unverified 0Hawkes Process Multi-armed Bandits for Disaster Search and Rescue Apr 3, 2020 Multi-Armed Bandits
— Unverified 0HD-CB: The First Exploration of Hyperdimensional Computing for Contextual Bandits Problems Jan 28, 2025 Computational Efficiency Multi-Armed Bandits
— Unverified 0Heterogeneous Multi-Agent Bandits with Parsimonious Hints Feb 22, 2025 4k Multi-Armed Bandits
— Unverified 0Heterogeneous Multi-agent Multi-armed Bandits on Stochastic Block Models Feb 11, 2025 Multi-Armed Bandits Stochastic Block Model
— Unverified 0Heterogeneous Multi-Player Multi-Armed Bandits Robust To Adversarial Attacks Jan 21, 2025 Adversarial Attack All
— Unverified 0Hierarchical Optimistic Region Selection driven by Curiosity Dec 1, 2012 Active Learning Multi-Armed Bandits
— Unverified 0High-dimensional Linear Bandits with Knapsacks Nov 2, 2023 Multi-Armed Bandits
— Unverified 0