Contextual Bandits for Unbounded Context Distributions Aug 19, 2024 Decision Making Multi-Armed Bandits
— Unverified 0Heterogeneous Multi-Player Multi-Armed Bandits Robust To Adversarial Attacks Jan 21, 2025 Adversarial Attack All
— Unverified 0Contextual Bandits in a Survey Experiment on Charitable Giving: Within-Experiment Outcomes versus Policy Learning Nov 22, 2022 Multi-Armed Bandits
— Unverified 0Full Gradient Deep Reinforcement Learning for Average-Reward Criterion Apr 7, 2023 Deep Reinforcement Learning Multi-Armed Bandits
— Unverified 0Contextual Bandits in Payment Processing: Non-uniform Exploration and Supervised Learning at Adyen Nov 30, 2024 Multi-Armed Bandits regression
— Unverified 0Hierarchical Optimistic Region Selection driven by Curiosity Dec 1, 2012 Active Learning Multi-Armed Bandits
— Unverified 0High-dimensional Linear Bandits with Knapsacks Nov 2, 2023 Multi-Armed Bandits
— Unverified 0High-dimensional Nonparametric Contextual Bandit Problem May 20, 2025 Decision Making Multi-Armed Bandits
— Unverified 0High Probability Bound for Cross-Learning Contextual Bandits with Unknown Context Distributions Oct 5, 2024 Multi-Armed Bandits
— Unverified 0Encrypted Linear Contextual Bandit Mar 17, 2021 Decision Making Multi-Armed Bandits
— Unverified 0Honor Among Bandits: No-Regret Learning for Online Fair Division Jul 1, 2024 Fairness Multi-Armed Bandits
— Unverified 0Horde of Bandits using Gaussian Markov Random Fields Mar 7, 2017 Clustering Multi-Armed Bandits
— Unverified 0How Does Variance Shape the Regret in Contextual Bandits? Oct 16, 2024 Multi-Armed Bandits
— Unverified 0Human-AI Learning Performance in Multi-Armed Bandits Dec 21, 2018 Decision Making Multi-Armed Bandits
— Unverified 0Contextual Bandits with Continuous Actions: Smoothing, Zooming, and Adapting Feb 5, 2019 Multi-Armed Bandits
— Unverified 0Adapting to Misspecification in Contextual Bandits with Offline Regression Oracles Feb 26, 2021 Multi-Armed Bandits regression
— Unverified 0Instance-Dependent Complexity of Contextual Bandits and Reinforcement Learning: A Disagreement-Based Perspective Oct 7, 2020 Active Learning Multi-Armed Bandits
— Unverified 0Identifiable latent bandits: Combining observational data and exploration for personalized healthcare Jul 23, 2024 Decision Making Multi-Armed Bandits
— Unverified 0Balancing Act: Prioritization Strategies for LLM-Designed Restless Bandit Rewards Aug 22, 2024 Language Modeling Language Modelling
— Unverified 0Imitation-Regularized Offline Learning Jan 15, 2019 counterfactual Multi-Armed Bandits
— Unverified 0The Choice of Noninformative Priors for Thompson Sampling in Multiparameter Bandit Models Feb 28, 2023 Multi-Armed Bandits Thompson Sampling
— Unverified 0Survival of the strictest: Stable and unstable equilibria under regularized learning with partial information Jan 12, 2021 Multi-Armed Bandits
— Unverified 0Improved Algorithms for Adversarial Bandits with Unbounded Losses Oct 3, 2023 Multi-Armed Bandits
— Unverified 0Improved Algorithms for Misspecified Linear Markov Decision Processes Sep 12, 2021 Multi-Armed Bandits
— Unverified 0Improved Algorithms for Multi-period Multi-class Packing Problems with Bandit Feedback Jan 31, 2023 Management Multi-Armed Bandits
— Unverified 0Improved Best-of-Both-Worlds Guarantees for Multi-Armed Bandits: FTRL with General Regularizers and Multiple Optimal Arms Feb 27, 2023 Multi-Armed Bandits
— Unverified 0Improved High-Probability Regret for Adversarial Bandits with Time-Varying Feedback Graphs Oct 4, 2022 Multi-Armed Bandits
— Unverified 0Improved Offline Contextual Bandits with Second-Order Bounds: Betting and Freezing Feb 15, 2025 Multi-Armed Bandits
— Unverified 0A Tractable Online Learning Algorithm for the Multinomial Logit Contextual Bandit Nov 28, 2020 Decision Making Multi-Armed Bandits
— Unverified 0Improved Regret Bounds for Linear Bandits with Heavy-Tailed Rewards Jun 5, 2025 Experimental Design Multi-Armed Bandits
— Unverified 0Improved Regret Bounds for Oracle-Based Adversarial Contextual Bandits Jun 1, 2016 Multi-Armed Bandits
— Unverified 0Improving Fairness in Adaptive Social Exergames via Shapley Bandits Feb 18, 2023 Fairness Multi-Armed Bandits
— Unverified 0Improving Offline Contextual Bandits with Distributional Robustness Nov 13, 2020 counterfactual Multi-Armed Bandits
— Unverified 0Improving Reward-Conditioned Policies for Multi-Armed Bandits using Normalized Weight Functions Jun 16, 2024 Multi-Armed Bandits Policy Gradient Methods
— Unverified 0Improving Thompson Sampling via Information Relaxation for Budgeted Multi-armed Bandits Aug 28, 2024 Multi-Armed Bandits Thompson Sampling
— Unverified 0Incentivising Exploration and Recommendations for Contextual Bandits with Payments Jan 22, 2020 Multi-Armed Bandits
— Unverified 0Incentivized Exploration for Multi-Armed Bandits under Reward Drift Nov 12, 2019 Multi-Armed Bandits Thompson Sampling
— Unverified 0Incentivized Exploration via Filtered Posterior Sampling Feb 20, 2024 Multi-Armed Bandits
— Unverified 0A Closer Look at Small-loss Bounds for Bandits with Graph Feedback Feb 2, 2020 Multi-Armed Bandits
— Unverified 0Contextual Bandits with Sparse Data in Web setting May 6, 2021 Articles Dimensionality Reduction
— Unverified 0Instance-optimal PAC Algorithms for Contextual Bandits Jul 5, 2022 Multi-Armed Bandits
— Unverified 0Indexability and Rollout Policy for Multi-State Partially Observable Restless Bandits Jul 30, 2021 Multi-Armed Bandits Recommendation Systems
— Unverified 0From Dirichlet to Rubin: Optimistic Exploration in RL without Bonuses May 16, 2022 Multi-Armed Bandits
— Unverified 0Indexed Minimum Empirical Divergence-Based Algorithms for Linear Bandits May 24, 2024 Multi-Armed Bandits Thompson Sampling
— Unverified 0From Bandits to Experts: On the Value of Side-Observations Dec 1, 2011 Multi-Armed Bandits
— Unverified 0Individual Regret in Cooperative Stochastic Multi-Armed Bandits Nov 10, 2024 Multi-Armed Bandits
— Unverified 0In-Domain African Languages Translation Using LLMs and Multi-armed Bandits May 21, 2025 Domain Adaptation Machine Translation
— Unverified 0Inference for Batched Bandits Feb 8, 2020 Multi-Armed Bandits
— Unverified 0Contextual Causal Bayesian Optimisation Jan 29, 2023 Bayesian Optimisation Multi-Armed Bandits
— Unverified 0Confidence-Budget Matching for Sequential Budgeted Learning Feb 5, 2021 Decision Making Decision Making Under Uncertainty
— Unverified 0