An Optimal Algorithm for Multiplayer Multi-Armed Bandits Sep 28, 2019 Multi-Armed Bandits
— Unverified 00 An optimal learning method for developing personalized treatment regimes Jul 6, 2016 Clustering Multi-Armed Bandits
— Unverified 00 An Optimistic Algorithm for Online Convex Optimization with Adversarial Constraints Dec 11, 2024 Multi-Armed Bandits
— Unverified 00 A General Reduction for High-Probability Analysis with General Light-Tailed Distributions Mar 5, 2024 Multi-Armed Bandits Stochastic Optimization
— Unverified 00 A Novel Approach to Balance Convenience and Nutrition in Meals With Long-Term Group Recommendations and Reasoning on Multimodal Recipes and its Implementation in BEACON Dec 23, 2024 Multi-Armed Bandits Nutrition
— Unverified 00 A One-Size-Fits-All Solution to Conservative Bandit Problems Dec 14, 2020 All Multi-Armed Bandits
— Unverified 00 Approximate Function Evaluation via Multi-Armed Bandits Mar 18, 2022 Multi-Armed Bandits
— Unverified 00 Approximately Stationary Bandits with Knapsacks Feb 28, 2023 Multi-Armed Bandits
— Unverified 00 A Provably Efficient Model-Free Posterior Sampling Method for Episodic Reinforcement Learning Aug 23, 2022 Multi-Armed Bandits reinforcement-learning
— Unverified 00 A Regret bound for Non-stationary Multi-Armed Bandits with Fairness Constraints Dec 24, 2020 Decision Making Fairness
— Unverified 00 A Reinforcement-Learning-Enhanced LLM Framework for Automated A/B Testing in Personalized Marketing May 27, 2025 Marketing Multi-Armed Bandits
— Unverified 00 A Risk-Averse Framework for Non-Stationary Stochastic Multi-Armed Bandits Oct 24, 2023 Change Point Detection Multi-Armed Bandits
— Unverified 00 A Simple and Optimal Policy Design with Safety against Heavy-Tailed Risk for Stochastic Bandits Jun 7, 2022 Multi-Armed Bandits Thompson Sampling
— Unverified 00 A Sleeping, Recovering Bandit Algorithm for Optimizing Recurring Notifications Aug 23, 2020 Multi-Armed Bandits
— Unverified 00 A Survey of Learning in Multiagent Environments: Dealing with Non-Stationarity Jul 28, 2017 Multi-Armed Bandits Reinforcement Learning
— Unverified 00 A Survey of Risk-Aware Multi-Armed Bandits May 12, 2022 Multi-Armed Bandits Portfolio Optimization
— Unverified 00 Asymptotically Best Causal Effect Identification with Multi-Armed Bandits Dec 1, 2021 Multi-Armed Bandits
— Unverified 00 Asymptotically Optimal Regret for Black-Box Predict-then-Optimize Jun 12, 2024 Decision Making Multi-Armed Bandits
— Unverified 00 The Choice of Noninformative Priors for Thompson Sampling in Multiparameter Bandit Models Feb 28, 2023 Multi-Armed Bandits Thompson Sampling
— Unverified 00 Asymptotically Unbiased Off-Policy Policy Evaluation when Reusing Old Data in Nonstationary Environments Feb 23, 2023 Multi-Armed Bandits regression
— Unverified 00 Asymptotic Convergence of Thompson Sampling Nov 8, 2020 Multi-Armed Bandits Thompson Sampling
— Unverified 00 Asymptotic Instance-Optimal Algorithms for Interactive Decision Making Jun 6, 2022 Decision Making Multi-Armed Bandits
— Unverified 00 Asymptotic Performance of Thompson Sampling in the Batched Multi-Armed Bandits Oct 1, 2021 Multi-Armed Bandits Thompson Sampling
— Unverified 00 Asymptotic Randomised Control with applications to bandits Oct 14, 2020 ARC Multi-Armed Bandits
— Unverified 00 Augmenting Online RL with Offline Data is All You Need: A Unified Hybrid RL Algorithm Design and Analysis May 19, 2025 All Multi-Armed Bandits
— Unverified 00 A Reduction-Based Framework for Conservative Bandits and Reinforcement Learning Jun 22, 2021 Multi-Armed Bandits reinforcement-learning
— Unverified 00 Automatic Ensemble Learning for Online Influence Maximization Nov 25, 2019 Ensemble Learning Multi-Armed Bandits
— Unverified 00 AutoML for Contextual Bandits Sep 7, 2019 AutoML Feature Engineering
— Unverified 00 Autonomous Drug Design with Multi-Armed Bandits Jul 4, 2022 Drug Design Multi-Armed Bandits
— Unverified 00 Balanced Linear Contextual Bandits Dec 15, 2018 Causal Inference Multi-Armed Bandits
— Unverified 00 Balanced off-policy evaluation in general action spaces Jun 9, 2019 Binary Classification counterfactual
— Unverified 00 Balancing Act: Prioritization Strategies for LLM-Designed Restless Bandit Rewards Aug 22, 2024 Language Modeling Language Modelling
— Unverified 00 Ballooning Multi-Armed Bandits Jan 24, 2020 Multi-Armed Bandits
— Unverified 00 Bandit Algorithms for Prophet Inequality and Pandora's Box Nov 16, 2022 Multi-Armed Bandits Stochastic Optimization
— Unverified 00 Exploration Through Reward Biasing: Reward-Biased Maximum Likelihood Estimation for Stochastic Multi-Armed Bandits Jul 2, 2019 Multi-Armed Bandits
— Unverified 00 BanditMF: Multi-Armed Bandit Based Matrix Factorization Recommender System Jun 21, 2021 Collaborative Filtering Multi-Armed Bandits
— Unverified 00 BanditQ: Fair Bandits with Guaranteed Rewards Apr 11, 2023 Multi-Armed Bandits
— Unverified 00 BanditRank: Learning to Rank Using Contextual Bandits Oct 23, 2019 Information Retrieval Learning-To-Rank
— Unverified 00 Bandit Regret Scaling with the Effective Loss Range May 15, 2017 Multi-Armed Bandits
— Unverified 00 Bandits Don't Follow Rules: Balancing Multi-Facet Machine Translation with Multi-Armed Bandits Oct 13, 2021 Machine Translation Multi-Armed Bandits
— Unverified 00 Bandits Don’t Follow Rules: Balancing Multi-Facet Machine Translation with Multi-Armed Bandits Nov 1, 2021 Machine Translation Multi-Armed Bandits
— Unverified 00 Bandits for Learning to Explain from Explanations Feb 7, 2021 Gaussian Processes Multi-Armed Bandits
— Unverified 00 Bandits meet Computer Architecture: Designing a Smartly-allocated Cache Jan 31, 2016 Multi-Armed Bandits
— Unverified 00 Bandit Social Learning: Exploration under Myopic Behavior Feb 15, 2023 Multi-Armed Bandits
— Unverified 00 Bandits Warm-up Cold Recommender Systems Jul 10, 2014 Multi-Armed Bandits Recommendation Systems
— Unverified 00 Preferences Evolve And So Should Your Bandits: Bandits with Evolving States for Online Platforms Jul 21, 2023 Multi-Armed Bandits Recommendation Systems
— Unverified 00 Bandits with Knapsacks beyond the Worst Case Dec 1, 2021 Multi-Armed Bandits
— Unverified 00 Bandits with Partially Observable Confounded Data Jun 11, 2020 Multi-Armed Bandits
— Unverified 00 Bandits with Temporal Stochastic Constraints Nov 22, 2018 Multi-Armed Bandits
— Unverified 00 Banker Online Mirror Descent Jun 16, 2021 Multi-Armed Bandits
— Unverified 00