The Best Arm Evades: Near-optimal Multi-pass Streaming Lower Bounds for Pure Exploration in Multi-armed Bandits Sep 6, 2023 Multi-Armed Bandits
— Unverified 00 Are sample means in multi-armed bandits positively or negatively biased? May 27, 2019 Multi-Armed Bandits Selection bias
— Unverified 00 Cramming Contextual Bandits for On-policy Statistical Evaluation Mar 11, 2024 Multi-Armed Bandits Off-policy evaluation
— Unverified 00 The Epoch-Greedy Algorithm for Multi-armed Bandits with Side Information Dec 1, 2007 Multi-Armed Bandits
— Unverified 00 The Externalities of Exploration and How Data Diversity Helps Exploitation Jun 1, 2018 Diversity Multi-Armed Bandits
— Unverified 00 The K-Nearest Neighbour UCB algorithm for multi-armed bandits with covariates Mar 1, 2018 Multi-Armed Bandits
— Unverified 00 The Pareto Frontier of Instance-Dependent Guarantees in Multi-Player Multi-Armed Bandits with no Communication Feb 19, 2022 Multi-Armed Bandits
— Unverified 00 The Pareto Frontier of model selection for general Contextual Bandits Oct 25, 2021 Model Selection Multi-Armed Bandits
— Unverified 00 The Price of Differential Privacy For Online Learning Jan 27, 2017 Multi-Armed Bandits
— Unverified 00 Thompson Sampling for Budgeted Multi-armed Bandits May 1, 2015 Multi-Armed Bandits Thompson Sampling
— Unverified 00 Thompson Sampling Algorithms for Cascading Bandits Oct 2, 2018 Efficient Exploration Multi-Armed Bandits
— Unverified 00 Thompson Sampling for Contextual Bandit Problems with Auxiliary Safety Constraints Nov 2, 2019 Bayesian Optimization Decision Making
— Unverified 00 Thompson sampling for improved exploration in GFlowNets Jun 30, 2023 Active Learning Decision Making
— Unverified 00 Thompson Sampling for Unsupervised Sequential Selection Sep 16, 2020 Multi-Armed Bandits Thompson Sampling
— Unverified 00 Thompson sampling for zero-inflated count outcomes with an application to the Drink Less mobile health study Nov 24, 2023 Decision Making Multi-Armed Bandits
— Unverified 00 Thompson Sampling in Partially Observable Contextual Bandits Feb 15, 2024 Decision Making Decision Making Under Uncertainty
— Unverified 00 Thompson Sampling Regret Bounds for Contextual Bandits with sub-Gaussian rewards Apr 26, 2023 Multi-Armed Bandits Thompson Sampling
— Unverified 00 Thresholding Data Shapley for Data Cleansing Using Multi-Armed Bandits Feb 13, 2024 Multi-Armed Bandits
— Unverified 00 Tight Gap-Dependent Memory-Regret Trade-Off for Single-Pass Streaming Stochastic Multi-Armed Bandits Mar 4, 2025 Multi-Armed Bandits
— Unverified 00 Tight Lower Bounds for Combinatorial Multi-Armed Bandits Feb 13, 2020 Decision Making Multi-Armed Bandits
— Unverified 00 Tight Regret Bounds for Infinite-armed Linear Contextual Bandits May 4, 2019 Decision Making Multi-Armed Bandits
— Unverified 00 Top-K Ranking Deep Contextual Bandits for Information Selection Systems Jan 28, 2022 Multi-Armed Bandits
— Unverified 00 To update or not to update? Delayed Nonparametric Bandits with Randomized Allocation May 26, 2020 Multi-Armed Bandits
— Unverified 00 Towards Distribution-Free Multi-Armed Bandits with Combinatorial Strategies Jul 20, 2013 Multi-Armed Bandits
— Unverified 00 Towards Domain Adaptive Neural Contextual Bandits Jun 13, 2024 Decision Making Domain Adaptation
— Unverified 00 Towards More Efficient, Robust, Instance-adaptive, and Generalizable Sequential Decision making Apr 12, 2025 Decision Making Decision Making Under Uncertainty
— Unverified 00 Towards Optimal Algorithms for Multi-Player Bandits without Collision Sensing Information Mar 24, 2021 Multi-Armed Bandits
— Unverified 00 Towards Robust Off-Policy Evaluation via Human Inputs Sep 18, 2022 Multi-Armed Bandits Off-policy evaluation
— Unverified 00 Towards Soft Fairness in Restless Multi-Armed Bandits Jul 27, 2022 Fairness Multi-Armed Bandits
— Unverified 00 Towards Understanding the Benefit of Multitask Representation Learning in Decision Process Mar 1, 2025 Multi-Armed Bandits Reinforcement Learning (RL)
— Unverified 00 Towards a Pretrained Model for Restless Bandits via Multi-arm Generalization Oct 23, 2023 Multi-agent Reinforcement Learning Multi-Armed Bandits
— Unverified 00 Tracking Most Significant Shifts in Nonparametric Contextual Bandits Jul 11, 2023 Multi-Armed Bandits
— Unverified 00 Tractable contextual bandits beyond realizability Oct 25, 2020 Multi-Armed Bandits
— Unverified 00 Transfer in Sequential Multi-armed Bandits via Reward Samples Mar 19, 2024 Multi-Armed Bandits
— Unverified 00 Transfer Learning for Contextual Multi-armed Bandits Nov 22, 2022 Multi-Armed Bandits Transfer Learning
— Unverified 00 Transfer Learning in Bandits with Latent Continuity Feb 4, 2021 Multi-Armed Bandits Transfer Learning
— Unverified 00 Tree Ensembles for Contextual Bandits Feb 10, 2024 Multi-Armed Bandits Thompson Sampling
— Unverified 00 Trend Detection based Regret Minimization for Bandit Problems Sep 15, 2017 Multi-Armed Bandits
— Unverified 00 Trend-responsive User Segmentation Enabling Traceable Publishing Insights. A Case Study of a Real-world Large-scale News Recommendation System Oct 28, 2019 Diversity global-optimization
— Unverified 00 Triply Robust Off-Policy Evaluation Nov 13, 2019 Multi-Armed Bandits Off-policy evaluation
— Unverified 00 TS-UCB: Improving on Thompson Sampling With Little to No Additional Computation Jun 11, 2020 Multi-Armed Bandits Thompson Sampling
— Unverified 00 UCB algorithms for multi-armed bandits: Precise regret and adaptive inference Dec 9, 2024 Multi-Armed Bandits
— Unverified 00 Understanding Memory-Regret Trade-Off for Streaming Stochastic Multi-Armed Bandits May 30, 2024 Multi-Armed Bandits
— Unverified 00 Understanding the stochastic dynamics of sequential decision-making processes: A path-integral analysis of multi-armed bandits Aug 11, 2022 Decision Making Decision Making Under Uncertainty
— Unverified 00 Unifying Clustered and Non-stationary Bandits Sep 5, 2020 Change Detection Clustering
— Unverified 00 uniINF: Best-of-Both-Worlds Algorithm for Parameter-Free Heavy-Tailed MABs Oct 4, 2024 Multi-Armed Bandits Scheduling
— Unverified 00 Unimodal Bandits: Regret Lower Bounds and Optimal Algorithms May 20, 2014 Multi-Armed Bandits
— Unverified 00 Universal and data-adaptive algorithms for model selection in linear contextual bandits Nov 8, 2021 Diversity Model Selection
— Unverified 00 Unreliable Multi-Armed Bandits: A Novel Approach to Recommendation Systems Nov 14, 2019 Multi-Armed Bandits Recommendation Systems
— Unverified 00 Upper-Confidence-Bound Algorithms for Active Learning in Multi-Armed Bandits Jul 16, 2015 Active Learning Multi-Armed Bandits
— Unverified 00