A General Reduction for High-Probability Analysis with General Light-Tailed Distributions Mar 5, 2024 Multi-Armed Bandits Stochastic Optimization
— Unverified 00 Catoni Contextual Bandits are Robust to Heavy-tailed Rewards Feb 4, 2025 Multi-Armed Bandits
— Unverified 00 An Optimistic Algorithm for Online Convex Optimization with Adversarial Constraints Dec 11, 2024 Multi-Armed Bandits
— Unverified 00 ADARES: Adaptive Resource Management for Virtual Machines Dec 5, 2018 Management Multi-Armed Bandits
— Unverified 00 AdaLinUCB: Opportunistic Learning for Contextual Bandits Feb 20, 2019 Multi-Armed Bandits
— Unverified 00 Byzantine-Resilient Decentralized Multi-Armed Bandits Oct 11, 2023 Multi-Armed Bandits Recommendation Systems
— Unverified 00 An optimal learning method for developing personalized treatment regimes Jul 6, 2016 Clustering Multi-Armed Bandits
— Unverified 00 Bypassing the Simulator: Near-Optimal Adversarial Linear Contextual Bandits Sep 2, 2023 Computational Efficiency Multi-Armed Bandits
— Unverified 00 Bypassing the Monster: A Faster and Simpler Optimal Algorithm for Contextual Bandits under Realizability Mar 28, 2020 Multi-Armed Bandits regression
— Unverified 00 An Optimal Algorithm for Multiplayer Multi-Armed Bandits Sep 28, 2019 Multi-Armed Bandits
— Unverified 00 Building Bridges: Viewing Active Learning from the Multi-Armed Bandit Lens Sep 26, 2013 Active Learning Binary Classification
— Unverified 00 Budgeted Recommendation with Delayed Feedback May 19, 2024 Decision Making Multi-Armed Bandits
— Unverified 00 Tsallis-INF: An Optimal Algorithm for Stochastic and Adversarial Bandits Jul 19, 2018 Multi-Armed Bandits Thompson Sampling
— Unverified 00 Budgeted Combinatorial Multi-Armed Bandits Feb 8, 2022 Multi-Armed Bandits
— Unverified 00 An Optimal Algorithm for Adversarial Bandits with Arbitrary Delays Oct 14, 2019 Multi-Armed Bandits
— Unverified 00 Adaptive, Robust and Scalable Bayesian Filtering for Online Learning May 12, 2025 Continual Learning Multi-Armed Bandits
— Unverified 00 Active Velocity Estimation using Light Curtains via Self-Supervised Multi-Armed Bandits Feb 24, 2023 Multi-Armed Bandits Navigate
— Unverified 00 Achieving adaptivity and optimality for multi-armed bandits using Exponential-Kullback Leibler Maillard Sampling Feb 20, 2025 Multi-Armed Bandits Thompson Sampling
— Unverified 00 Budget-Constrained Multi-Armed Bandits with Multiple Plays Nov 16, 2017 Multi-Armed Bandits
— Unverified 00 Bridging Offline Reinforcement Learning and Imitation Learning: A Tale of Pessimism Mar 22, 2021 Imitation Learning Multi-Armed Bandits
— Unverified 00 An Instrumental Value for Data Production and its Application to Data Pricing Dec 24, 2024 Decision Making Multi-Armed Bandits
— Unverified 00 Breaking the T Barrier: Instance-Independent Logarithmic Regret in Stochastic Contextual Linear Bandits May 19, 2022 Multi-Armed Bandits parameter estimation
— Unverified 00 Breaking the (1/Δ_2) Barrier: Better Batched Best Arm Identification with Adaptive Grids Jan 29, 2025 Multi-Armed Bandits
— Unverified 00 An Instance-Dependent Analysis for the Cooperative Multi-Player Multi-Armed Bandit Nov 8, 2021 Multi-Armed Bandits
— Unverified 00 Adaptive Regret for Bandits Made Possible: Two Queries Suffice Jan 17, 2024 Hyperparameter Optimization Multi-Armed Bandits
— Unverified 00 Bounded Regret for Finitely Parameterized Multi-Armed Bandits Mar 3, 2020 Multi-Armed Bandits
— Unverified 00 Boundary Crossing Probabilities for General Exponential Families May 24, 2017 Multi-Armed Bandits
— Unverified 00 An Improved Relaxation for Oracle-Efficient Adversarial Contextual Bandits Oct 29, 2023 Multi-Armed Bandits
— Unverified 00 Bootstrapping Upper Confidence Bound Jun 12, 2019 Decision Making Multi-Armed Bandits
— Unverified 00 An Exploration-free Method for a Linear Stochastic Bandit Driven by a Linear Gaussian Dynamical System Apr 4, 2025 Hyperparameter Optimization Multi-Armed Bandits
— Unverified 00 Active Search for Sparse Signals with Region Sensing Dec 2, 2016 Bayesian Optimization Compressive Sensing
— Unverified 00 Boltzmann Exploration Done Right May 29, 2017 Decision Making Decision Making Under Uncertainty
— Unverified 00 BOF-UCB: A Bayesian-Optimistic Frequentist Algorithm for Non-Stationary Contextual Bandits Jul 7, 2023 Decision Making Multi-Armed Bandits
— Unverified 00 BISTRO: An Efficient Relaxation-Based Method for Contextual Bandits Feb 6, 2016 Multi-Armed Bandits
— Unverified 00 Bi-Criteria Optimization for Combinatorial Bandits: Sublinear Regret and Constraint Violation under Bandit Feedback Mar 15, 2025 Multi-Armed Bandits
— Unverified 00 A New Benchmark for Online Learning with Budget-Balancing Constraints Mar 19, 2025 Multi-Armed Bandits
— Unverified 00 Beyond UCB: Optimal and Efficient Contextual Bandits with Regression Oracles Feb 12, 2020 Multi-Armed Bandits regression
— Unverified 00 Beyond the Hazard Rate: More Perturbation Algorithms for Adversarial Multi-armed Bandits Feb 17, 2017 Multi-Armed Bandits
— Unverified 00 Better Algorithms for Stochastic Bandits with Adversarial Corruptions Feb 22, 2019 Multi-Armed Bandits
— Unverified 00 Best-of-Both-Worlds Linear Contextual Bandits Dec 27, 2023 Multi-Armed Bandits
— Unverified 00 A New Algorithm for Non-stationary Contextual Bandits: Efficient, Optimal, and Parameter-free Feb 3, 2019 Multi-Armed Bandits
— Unverified 00 Adaptively Learning to Select-Rank in Online Platforms Jun 7, 2024 Multi-Armed Bandits Thompson Sampling
— Unverified 00 Active Search for High Recall: a Non-Stationary Extension of Thompson Sampling Dec 27, 2017 Multi-Armed Bandits Thompson Sampling
— Unverified 00 A Central Limit Theorem, Loss Aversion and Multi-Armed Bandits Jun 10, 2021 Multi-Armed Bandits
— Unverified 00 A Batch Sequential Halving Algorithm without Performance Degradation Jun 1, 2024 Computational Efficiency Multi-Armed Bandits
— Unverified 00 Best-of-Both-Worlds Algorithms for Linear Contextual Bandits Dec 24, 2023 Multi-Armed Bandits
— Unverified 00 An Empirical Evaluation of Thompson Sampling Dec 1, 2011 Multi-Armed Bandits Thompson Sampling
— Unverified 00 Best Arm Identification under Additive Transfer Bandits Dec 8, 2021 Multi-Armed Bandits Transfer Learning
— Unverified 00 Best Arm Identification in Stochastic Bandits: Beyond β-optimality Jan 10, 2023 Computational Efficiency Multi-Armed Bandits
— Unverified 00 An Empirical Evaluation of Federated Contextual Bandit Algorithms Mar 17, 2023 Federated Learning Multi-Armed Bandits
— Unverified 00