High-dimensional Nonparametric Contextual Bandit Problem May 20, 2025 Decision Making Multi-Armed Bandits
— Unverified 0High Probability Bound for Cross-Learning Contextual Bandits with Unknown Context Distributions Oct 5, 2024 Multi-Armed Bandits
— Unverified 0Encrypted Linear Contextual Bandit Mar 17, 2021 Decision Making Multi-Armed Bandits
— Unverified 0Honor Among Bandits: No-Regret Learning for Online Fair Division Jul 1, 2024 Fairness Multi-Armed Bandits
— Unverified 0Horde of Bandits using Gaussian Markov Random Fields Mar 7, 2017 Clustering Multi-Armed Bandits
— Unverified 0How Does Variance Shape the Regret in Contextual Bandits? Oct 16, 2024 Multi-Armed Bandits
— Unverified 0Human-AI Learning Performance in Multi-Armed Bandits Dec 21, 2018 Decision Making Multi-Armed Bandits
— Unverified 0Hypothesis Transfer in Bandits by Weighted Models Nov 14, 2022 Multi-Armed Bandits Transfer Learning
— Unverified 0Identifiable latent bandits: Combining observational data and exploration for personalized healthcare Jul 23, 2024 Decision Making Multi-Armed Bandits
— Unverified 0Imitation-Regularized Offline Learning Jan 15, 2019 counterfactual Multi-Armed Bandits
— Unverified 0Imprecise Multi-Armed Bandits May 9, 2024 Multi-Armed Bandits
— Unverified 0Improved Algorithms for Adversarial Bandits with Unbounded Losses Oct 3, 2023 Multi-Armed Bandits
— Unverified 0Improved Algorithms for Misspecified Linear Markov Decision Processes Sep 12, 2021 Multi-Armed Bandits
— Unverified 0Improved Algorithms for Multi-period Multi-class Packing Problems with Bandit Feedback Jan 31, 2023 Management Multi-Armed Bandits
— Unverified 0Improved Best-of-Both-Worlds Guarantees for Multi-Armed Bandits: FTRL with General Regularizers and Multiple Optimal Arms Feb 27, 2023 Multi-Armed Bandits
— Unverified 0Improved High-Probability Regret for Adversarial Bandits with Time-Varying Feedback Graphs Oct 4, 2022 Multi-Armed Bandits
— Unverified 0Improved Offline Contextual Bandits with Second-Order Bounds: Betting and Freezing Feb 15, 2025 Multi-Armed Bandits
— Unverified 0A Tractable Online Learning Algorithm for the Multinomial Logit Contextual Bandit Nov 28, 2020 Decision Making Multi-Armed Bandits
— Unverified 0Improved Regret Bounds for Linear Bandits with Heavy-Tailed Rewards Jun 5, 2025 Experimental Design Multi-Armed Bandits
— Unverified 0Improved Regret Bounds for Oracle-Based Adversarial Contextual Bandits Jun 1, 2016 Multi-Armed Bandits
— Unverified 0Improving Fairness in Adaptive Social Exergames via Shapley Bandits Feb 18, 2023 Fairness Multi-Armed Bandits
— Unverified 0Improving Offline Contextual Bandits with Distributional Robustness Nov 13, 2020 counterfactual Multi-Armed Bandits
— Unverified 0Improving Reward-Conditioned Policies for Multi-Armed Bandits using Normalized Weight Functions Jun 16, 2024 Multi-Armed Bandits Policy Gradient Methods
— Unverified 0Improving Thompson Sampling via Information Relaxation for Budgeted Multi-armed Bandits Aug 28, 2024 Multi-Armed Bandits Thompson Sampling
— Unverified 0Incentivising Exploration and Recommendations for Contextual Bandits with Payments Jan 22, 2020 Multi-Armed Bandits
— Unverified 0Incentivized Exploration for Multi-Armed Bandits under Reward Drift Nov 12, 2019 Multi-Armed Bandits Thompson Sampling
— Unverified 0Incentivized Exploration via Filtered Posterior Sampling Feb 20, 2024 Multi-Armed Bandits
— Unverified 0Increasing Students' Engagement to Reminder Emails Through Multi-Armed Bandits Aug 10, 2022 Management Multi-Armed Bandits
— Unverified 0Indexability and Rollout Policy for Multi-State Partially Observable Restless Bandits Jul 30, 2021 Multi-Armed Bandits Recommendation Systems
— Unverified 0Indexed Minimum Empirical Divergence-Based Algorithms for Linear Bandits May 24, 2024 Multi-Armed Bandits Thompson Sampling
— Unverified 0Individual Regret in Cooperative Nonstochastic Multi-Armed Bandits Jul 7, 2019 Multi-Armed Bandits
— Unverified 0Individual Regret in Cooperative Stochastic Multi-Armed Bandits Nov 10, 2024 Multi-Armed Bandits
— Unverified 0In-Domain African Languages Translation Using LLMs and Multi-armed Bandits May 21, 2025 Domain Adaptation Machine Translation
— Unverified 0Inference for Batched Bandits Feb 8, 2020 Multi-Armed Bandits
— Unverified 0Instance-Dependent Complexity of Contextual Bandits and Reinforcement Learning: A Disagreement-Based Perspective Oct 7, 2020 Active Learning Multi-Armed Bandits
— Unverified 0Instance-optimal PAC Algorithms for Contextual Bandits Jul 5, 2022 Multi-Armed Bandits
— Unverified 0Concentrated Differential Privacy for Bandits Sep 1, 2023 Multi-Armed Bandits Recommendation Systems
— Unverified 0Investigating Gender Fairness in Machine Learning-driven Personalized Care for Chronic Pain Feb 29, 2024 Decision Making Fairness
— Unverified 0Is Offline Decision Making Possible with Only Few Samples? Reliable Decisions in Data-Starved Bandits via Trust Region Enhancement Feb 24, 2024 Decision Making Multi-Armed Bandits
— Unverified 0Is Prior-Free Black-Box Non-Stationary Reinforcement Learning Feasible? Oct 17, 2024 Change Detection Multi-Armed Bandits
— Unverified 0Is Reinforcement Learning More Difficult Than Bandits? A Near-optimal Algorithm Escaping the Curse of Horizon Sep 28, 2020 Decision Making Multi-Armed Bandits
— Unverified 0Joint Representation Training in Sequential Tasks with Shared Structure Jun 24, 2022 Multi-Armed Bandits Reinforcement Learning (RL)
— Unverified 0Kernel-based Multi-Task Contextual Bandits in Cellular Network Configuration Nov 27, 2018 Multi-Armed Bandits Multi-Task Learning
— Unverified 0Kernel ε-Greedy for Multi-Armed Bandits with Covariates Jun 29, 2023 Multi-Armed Bandits
— Unverified 0Kernel Methods for Cooperative Multi-Agent Contextual Bandits Aug 14, 2020 Decision Making Multi-Armed Bandits
— Unverified 0KL-regularization Itself is Differentially Private in Bandits and RLHF May 23, 2025 Decision Making Multi-Armed Bandits
— Unverified 0Knowledge Infused Policy Gradients with Upper Confidence Bound for Relational Bandits Jun 25, 2021 Descriptive Multi-Armed Bandits
— Unverified 0Lagrangian Index Policy for Restless Bandits with Average Reward Dec 17, 2024 Multi-Armed Bandits reinforcement-learning
— Unverified 0Langevin Thompson Sampling with Logarithmic Communication: Bandits and Reinforcement Learning Jun 15, 2023 Decision Making Multi-Armed Bandits
— Unverified 0Latent Contextual Bandits and their Application to Personalized Recommendations for New Users Apr 22, 2016 Multi-Armed Bandits
— Unverified 0