Queue Scheduling with Adversarial Bandit Learning Mar 3, 2023 Multi-Armed Bandits Scheduling
— Unverified 00 Quick-Draw Bandits: Quickly Optimizing in Nonstationary Environments with Extremely Many Arms May 30, 2025 Multi-Armed Bandits
— Unverified 00 Raising Student Completion Rates with Adaptive Curriculum and Contextual Bandits Jul 28, 2022 Model-based Reinforcement Learning Multi-Armed Bandits
— Unverified 00 Random Effect Bandits Jun 23, 2021 Multi-Armed Bandits Thompson Sampling
— Unverified 00 Randomized Allocation with Nonparametric Estimation for Contextual Multi-Armed Bandits with Delayed Rewards Feb 3, 2019 Multi-Armed Bandits
— Unverified 00 Randomized Greedy Learning for Non-monotone Stochastic Submodular Maximization Under Full-bandit Feedback Feb 2, 2023 Multi-Armed Bandits
— Unverified 00 Towards Fundamental Limits of Multi-armed Bandits with Random Walk Feedback Nov 3, 2020 Multi-Armed Bandits Recommendation Systems
— Unverified 00 Rarely-switching linear bandits: optimization of causal effects for the real world May 30, 2019 Causal Inference Multi-Armed Bandits
— Unverified 00 Rate-Constrained Remote Contextual Bandits Apr 26, 2022 Marketing Multi-Armed Bandits
— Unverified 00 Reciprocal Learning Aug 12, 2024 Active Learning Multi-Armed Bandits
— Unverified 00 Recommenadation aided Caching using Combinatorial Multi-armed Bandits Apr 30, 2024 Multi-Armed Bandits
— Unverified 00 Recurrent Submodular Welfare and Matroid Blocking Bandits Jan 30, 2021 Blocking Multi-Armed Bandits
— Unverified 00 Recurrent Submodular Welfare and Matroid Blocking Semi-Bandits May 21, 2021 Blocking Multi-Armed Bandits
— Unverified 00 Reducing Dueling Bandits to Cardinal Bandits May 14, 2014 Multi-Armed Bandits
— Unverified 00 Regional Multi-Armed Bandits Feb 22, 2018 Multi-Armed Bandits
— Unverified 00 Regression Oracles and Exploration Strategies for Short-Horizon Multi-Armed Bandits Feb 10, 2021 Multi-Armed Bandits regression
— Unverified 00 Regret Analysis of Learning-Based MPC with Partially-Unknown Cost Function Aug 4, 2021 Multi-Armed Bandits
— Unverified 00 Regret Analysis of the Finite-Horizon Gittins Index Strategy for Multi-Armed Bandits Nov 18, 2015 Multi-Armed Bandits Thompson Sampling
— Unverified 00 Regret Bounds and Reinforcement Learning Exploration of EXP-based Algorithms Sep 20, 2020 Multi-Armed Bandits reinforcement-learning
— Unverified 00 Stochastic Top-K Subset Bandits with Linear Space and Non-Linear Feedback Nov 29, 2018 Multi-Armed Bandits
— Unverified 00 Regret Minimisation in Multi-Armed Bandits Using Bounded Arm Memory Jan 24, 2019 Multi-Armed Bandits
— Unverified 00 Regret vs. Communication: Distributed Stochastic Multi-Armed Bandits and Beyond Apr 14, 2015 Multi-Armed Bandits
— Unverified 00 Regularized Contextual Bandits Oct 11, 2018 Multi-Armed Bandits
— Unverified 00 Regularized-OFU: an efficient algorithm for general contextual bandit with optimization oracles Sep 29, 2021 Multi-Armed Bandits Thompson Sampling
— Unverified 00 Regularized OFU: an Efficient UCB Estimator forNon-linear Contextual Bandit Jun 29, 2021 Multi-Armed Bandits
— Unverified 00 Reinforced Meta Active Learning Mar 9, 2022 Active Learning Informativeness
— Unverified 00 Reinforcement Learning for Machine Learning Model Deployment: Evaluating Multi-Armed Bandits in ML Ops Environments Mar 28, 2025 Management Model Selection
— Unverified 00 Reinforcement learning techniques for Outer Loop Link Adaptation in 4G/5G systems Aug 3, 2017 Multi-Armed Bandits reinforcement-learning
— Unverified 00 Multi-Armed Bandits with Fairness Constraints for Distributing Resources to Human Teammates Jun 30, 2019 Fairness Multi-Armed Bandits
— Unverified 00 Reliability-Optimized User Admission Control for URLLC Traffic: A Neural Contextual Bandit Approach Jan 5, 2024 Multi-Armed Bandits
— Unverified 00 Remote Contextual Bandits Feb 10, 2022 Marketing Multi-Armed Bandits
— Unverified 00 Replicability is Asymptotically Free in Multi-armed Bandits Feb 12, 2024 Decision Making Multi-Armed Bandits
— Unverified 00 Representation-Driven Reinforcement Learning May 31, 2023 Multi-Armed Bandits reinforcement-learning
— Unverified 00 Representative Arm Identification: A fixed confidence approach to identify cluster representatives Aug 26, 2024 Multi-Armed Bandits
— Unverified 00 Replicable Bandits Oct 4, 2022 Multi-Armed Bandits
— Unverified 00 Residual Bootstrap Exploration for Bandit Algorithms Feb 19, 2020 Computational Efficiency Multi-Armed Bandits
— Unverified 00 Resonance: Replacing Software Constants with Context-Aware Models in Real-time Communication Nov 23, 2020 Friction Multi-Armed Bandits
— Unverified 00 Resource Allocation in Multi-armed Bandit Exploration: Overcoming Sublinear Scaling with Adaptive Parallelism Oct 31, 2020 Distributed Computing Multi-Armed Bandits
— Unverified 00 Resource Allocation in NOMA-based Self-Organizing Networks using Stochastic Multi-Armed Bandits Jan 16, 2021 Management Multi-Armed Bandits
— Unverified 00 Resourceful Contextual Bandits Feb 27, 2014 Multi-Armed Bandits
— Unverified 00 Restless Multi-Armed Bandits under Exogenous Global Markov Process Feb 28, 2022 Multi-Armed Bandits
— Unverified 00 Restless Multi-armed Bandits under Frequency and Window Constraints for Public Service Inspections Jan 27, 2025 Multi-Armed Bandits Scheduling
— Unverified 00 Revisiting Simple Regret: Fast Rates for Returning a Good Arm Oct 30, 2022 Multi-Armed Bandits
— Unverified 00 Reward Biased Maximum Likelihood Estimation for Reinforcement Learning Nov 16, 2020 Multi-Armed Bandits reinforcement-learning
— Unverified 00 Reward Maximization for Pure Exploration: Minimax Optimal Good Arm Identification for Nonparametric Multi-Armed Bandits Oct 21, 2024 Multi-Armed Bandits valid
— Unverified 00 Reward Teaching for Federated Multi-armed Bandits May 3, 2023 Multi-Armed Bandits
— Unverified 00 Rising Rested Bandits: Lower Bounds and Efficient Algorithms Nov 6, 2024 Model Selection Multi-Armed Bandits
— Unverified 00 Risk-Averse Multi-Armed Bandits with Unobserved Confounders: A Case Study in Emotion Regulation in Mobile Health Sep 9, 2022 Multi-Armed Bandits Transfer Learning
— Unverified 00 Risk averse non-stationary multi-armed bandits Sep 28, 2021 Multi-Armed Bandits
— Unverified 00 Risk-Aversion in Multi-armed Bandits Dec 1, 2012 Multi-Armed Bandits
— Unverified 00