Slowly Changing Adversarial Bandit Algorithms are Efficient for Discounted MDPs May 18, 2022 Multi-Armed Bandits reinforcement-learning
— Unverified 00 Small-loss bounds for online learning with partial information Nov 9, 2017 Multi-Armed Bandits
— Unverified 00 Small Total-Cost Constraints in Contextual Bandits with Knapsacks, with Application to Fairness May 25, 2023 Fairness Multi-Armed Bandits
— Unverified 00 SmartChoices: Augmenting Software with Learned Implementations Apr 12, 2023 Multi-Armed Bandits Philosophy
— Unverified 00 Smoothed Online Learning is as Easy as Statistical Learning Feb 9, 2022 Learning Theory Multi-Armed Bandits
— Unverified 00 Smooth Sequential Optimisation with Delayed Feedback Jun 21, 2021 Multi-Armed Bandits
— Unverified 00 Social Learning in Multi Agent Multi Armed Bandits Oct 4, 2019 Multi-Armed Bandits
— Unverified 00 Sparse Additive Contextual Bandits: A Nonparametric Approach for Online Decision-making with High-dimensional Covariates Mar 21, 2025 Decision Making Multi-Armed Bandits
— Unverified 00 Sparse Nonparametric Contextual Bandits Mar 20, 2025 Multi-Armed Bandits Thompson Sampling
— Unverified 00 Sparsity, variance and curvature in multi-armed bandits Nov 3, 2017 Generalization Bounds Learning Theory
— Unverified 00 SPRT-based Efficient Best Arm Identification in Stochastic Bandits Jul 22, 2022 Multi-Armed Bandits Thompson Sampling
— Unverified 00 Squeeze All: Novel Estimator and Self-Normalized Bound for Linear Contextual Bandits Jun 11, 2022 All Multi-Armed Bandits
— Unverified 00 Stability Enforced Bandit Algorithms for Channel Selection in Remote State Estimation of Gauss-Markov Processes May 20, 2022 channel selection Multi-Armed Bandits
— Unverified 00 Stabilizing the Kumaraswamy Distribution Oct 1, 2024 Link Prediction Multi-Armed Bandits
— Unverified 00 Stateful Offline Contextual Policy Evaluation and Learning Oct 19, 2021 Management Multi-Armed Bandits
— Unverified 00 Statistical Inference with M-Estimators on Adaptively Collected Data Apr 29, 2021 Decision Making Multi-Armed Bandits
— Unverified 00 Statistically Robust, Risk-Averse Best Arm Identification in Multi-Armed Bandits Aug 28, 2020 Multi-Armed Bandits
— Unverified 00 Stealthy Adversarial Attacks on Stochastic Multi-Armed Bandits Feb 21, 2024 Multi-Armed Bandits
— Unverified 00 Stochastic Approximation Approaches to Group Distributionally Robust Optimization and Beyond Feb 18, 2023 Multi-Armed Bandits
— Unverified 00 Concentration bounds for temporal difference learning with linear function approximation: The case of batch data and uniform sampling Jun 11, 2013 Multi-Armed Bandits News Recommendation
— Unverified 00 Stochastic Bandits for Egalitarian Assignment Oct 8, 2024 Fairness Multi-Armed Bandits
— Unverified 00 Stochastic Bandits with Linear Constraints Jun 17, 2020 Multi-Armed Bandits
— Unverified 00 Stochastic Bandits with Vector Losses: Minimizing ^-Norm of Relative Losses Oct 15, 2020 Multi-Armed Bandits Recommendation Systems
— Unverified 00 Stochastic Contextual Bandits with Graph-based Contexts May 2, 2023 Multi-Armed Bandits
— Unverified 00 Stochastic contextual bandits with graph feedback: from independence number to MAS number Feb 12, 2024 Multi-Armed Bandits
— Unverified 00 Stochastic Contextual Bandits with Known Reward Functions Apr 30, 2016 Decision Making Multi-Armed Bandits
— Unverified 00 Stochastic Contextual Bandits with Long Horizon Rewards Feb 2, 2023 Decision Making Language Modeling
— Unverified 00 Stochastic differential equations for limiting description of UCB rule for Gaussian multi-armed bandits Dec 13, 2021 Multi-Armed Bandits
— Unverified 00 Stochastic Graph Bandit Learning with Side-Observations Aug 29, 2023 Computational Efficiency Multi-Armed Bandits
— Unverified 00 Stochastic Linear Contextual Bandits with Diverse Contexts Mar 5, 2020 Diversity Multi-Armed Bandits
— Unverified 00 Stochastic Multi-armed Bandits in Constant Space Dec 25, 2017 Multi-Armed Bandits
— Unverified 00 Stochastic Multi-Armed Bandits with Unrestricted Delay Distributions Jun 4, 2021 Multi-Armed Bandits
— Unverified 00 Achieving Fairness in Stochastic Multi-armed Bandit Problem May 27, 2019 Fairness Multi-Armed Bandits
— Unverified 00 Stochastic Multi-Armed Bandits with Control Variates May 9, 2021 Multi-Armed Bandits
— Unverified 00 Stochastic Multi-armed Bandits with Non-stationary Rewards Generated by a Linear Dynamical System Apr 6, 2022 Decision Making Multi-Armed Bandits
— Unverified 00 Stochastic Multi-Objective Multi-Armed Bandits: Regret Definition and Algorithm Jun 16, 2025 Multi-Armed Bandits
— Unverified 00 Stochastic Network Utility Maximization with Unknown Utilities: Multi-Armed Bandits Approach Jun 17, 2020 Multi-Armed Bandits
— Unverified 00 Stochastic Neural Network with Kronecker Flow Jun 10, 2019 Multi-Armed Bandits Thompson Sampling
— Unverified 00 Strategic Linear Contextual Bandits Jun 1, 2024 Multi-Armed Bandits Recommendation Systems
— Unverified 00 Strategies for Safe Multi-Armed Bandits with Logarithmic Regret and Risk Apr 1, 2022 Multi-Armed Bandits
— Unverified 00 Streaming Algorithms for Stochastic Multi-armed Bandits Dec 9, 2020 Multi-Armed Bandits Open-Ended Question Answering
— Unverified 00 Structured Linear Contextual Bandits: A Sharp and Geometric Smoothed Analysis Feb 26, 2020 Multi-Armed Bandits
— Unverified 00 Structured Reinforcement Learning for Delay-Optimal Data Transmission in Dense mmWave Networks Apr 25, 2024 Fairness Multi-Armed Bandits
— Unverified 00 Structure Matters: Dynamic Policy Gradient Nov 7, 2024 Multi-Armed Bandits
— Unverified 00 Sublinear Optimal Policy Value Estimation in Contextual Bandits Dec 12, 2019 Multi-Armed Bandits
— Unverified 00 Surrogate Objectives for Batch Policy Optimization in One-step Decision Making Dec 1, 2019 Decision Making Multi-Armed Bandits
— Unverified 00 Survey Bandits with Regret Guarantees Feb 23, 2020 Multi-Armed Bandits Survey
— Unverified 00 Taking a hint: How to leverage loss predictors in contextual bandits? Mar 4, 2020 Multi-Armed Bandits
— Unverified 00 Target Tracking for Contextual Bandits: Application to Demand Side Management Jan 28, 2019 Management Multi-Armed Bandits
— Unverified 00 Task Selection and Assignment for Multi-modal Multi-task Dialogue Act Classification with Non-stationary Multi-armed Bandits Sep 18, 2023 Dialogue Act Classification Multi-Armed Bandits
— Unverified 00