Banker Online Mirror Descent: A Universal Approach for Delayed Online Bandit Learning Jan 25, 2023 Multi-Armed Bandits
— Unverified 00 Batched Bandits with Crowd Externalities Sep 29, 2021 Multi-Armed Bandits
— Unverified 00 Batched Coarse Ranking in Multi-Armed Bandits Dec 1, 2020 Multi-Armed Bandits
— Unverified 00 Regret Bounds for Batched Bandits Oct 11, 2019 Multi-Armed Bandits
— Unverified 00 Batched Nonparametric Bandits via k-Nearest Neighbor UCB May 15, 2025 Decision Making Marketing
— Unverified 00 Batched Nonparametric Contextual Bandits Feb 27, 2024 Multi-Armed Bandits
— Unverified 00 Batched Online Contextual Sparse Bandits with Sequential Inclusion of Features Sep 13, 2024 Decision Making Fairness
— Unverified 00 Batched Thompson Sampling Oct 1, 2021 Multi-Armed Bandits Thompson Sampling
— Unverified 00 Batched Thompson Sampling for Multi-Armed Bandits Aug 15, 2021 Multi-Armed Bandits Thompson Sampling
— Unverified 00 Batch Ensemble for Variance Dependent Regret in Stochastic Bandits Sep 13, 2024 Multi-Armed Bandits Reinforcement Learning (RL)
— Unverified 00 Towards Bayesian Data Selection Jun 18, 2024 Active Learning Additive models
— Unverified 00 Bayesian decision-making under misspecified priors with applications to meta-learning Jul 3, 2021 Decision Making Meta-Learning
— Unverified 00 BEACON: Balancing Convenience and Nutrition in Meals With Long-Term Group Recommendations and Reasoning on Multimodal Recipes Jun 19, 2024 Multi-Armed Bandits Nutrition
— Unverified 00 Beam Learning -- Using Machine Learning for Finding Beam Directions Jun 11, 2019 BIG-bench Machine Learning Multi-Armed Bandits
— Unverified 00 Be Greedy in Multi-Armed Bandits Jan 4, 2021 Multi-Armed Bandits
— Unverified 00 Efficient Prompt Optimization Through the Lens of Best Arm Identification Feb 15, 2024 Instruction Following Multi-Armed Bandits
— Unverified 00 Quantile Multi-Armed Bandits: Optimal Best-Arm Identification and a Differentially Private Scheme Jun 11, 2020 Multi-Armed Bandits
— Unverified 00 Best-Arm Identification in Correlated Multi-Armed Bandits Sep 10, 2021 Multi-Armed Bandits
— Unverified 00 Best Arm Identification in Linked Bandits Nov 19, 2018 Multi-Armed Bandits
— Unverified 00 Best arm identification in multi-armed bandits with delayed feedback Mar 29, 2018 Hyperparameter Optimization Multi-Armed Bandits
— Unverified 00 Best Arm Identification in Restless Markov Multi-Armed Bandits Mar 29, 2022 Multi-Armed Bandits
— Unverified 00 Best Arm Identification in Stochastic Bandits: Beyond β-optimality Jan 10, 2023 Computational Efficiency Multi-Armed Bandits
— Unverified 00 Best Arm Identification under Additive Transfer Bandits Dec 8, 2021 Multi-Armed Bandits Transfer Learning
— Unverified 00 Best-of-Both-Worlds Algorithms for Linear Contextual Bandits Dec 24, 2023 Multi-Armed Bandits
— Unverified 00 Best-of-Both-Worlds Linear Contextual Bandits Dec 27, 2023 Multi-Armed Bandits
— Unverified 00 Better Algorithms for Stochastic Bandits with Adversarial Corruptions Feb 22, 2019 Multi-Armed Bandits
— Unverified 00 Beyond the Hazard Rate: More Perturbation Algorithms for Adversarial Multi-armed Bandits Feb 17, 2017 Multi-Armed Bandits
— Unverified 00 Beyond UCB: Optimal and Efficient Contextual Bandits with Regression Oracles Feb 12, 2020 Multi-Armed Bandits regression
— Unverified 00 Bi-Criteria Optimization for Combinatorial Bandits: Sublinear Regret and Constraint Violation under Bandit Feedback Mar 15, 2025 Multi-Armed Bandits
— Unverified 00 BISTRO: An Efficient Relaxation-Based Method for Contextual Bandits Feb 6, 2016 Multi-Armed Bandits
— Unverified 00 BOF-UCB: A Bayesian-Optimistic Frequentist Algorithm for Non-Stationary Contextual Bandits Jul 7, 2023 Decision Making Multi-Armed Bandits
— Unverified 00 Boltzmann Exploration Done Right May 29, 2017 Decision Making Decision Making Under Uncertainty
— Unverified 00 Bootstrapping Upper Confidence Bound Jun 12, 2019 Decision Making Multi-Armed Bandits
— Unverified 00 Boundary Crossing Probabilities for General Exponential Families May 24, 2017 Multi-Armed Bandits
— Unverified 00 Bounded Regret for Finitely Parameterized Multi-Armed Bandits Mar 3, 2020 Multi-Armed Bandits
— Unverified 00 Breaking the (1/Δ_2) Barrier: Better Batched Best Arm Identification with Adaptive Grids Jan 29, 2025 Multi-Armed Bandits
— Unverified 00 Breaking the T Barrier: Instance-Independent Logarithmic Regret in Stochastic Contextual Linear Bandits May 19, 2022 Multi-Armed Bandits parameter estimation
— Unverified 00 Bridging Offline Reinforcement Learning and Imitation Learning: A Tale of Pessimism Mar 22, 2021 Imitation Learning Multi-Armed Bandits
— Unverified 00 Budget-Constrained Multi-Armed Bandits with Multiple Plays Nov 16, 2017 Multi-Armed Bandits
— Unverified 00 Budgeted Combinatorial Multi-Armed Bandits Feb 8, 2022 Multi-Armed Bandits
— Unverified 00 Budgeted Recommendation with Delayed Feedback May 19, 2024 Decision Making Multi-Armed Bandits
— Unverified 00 Building Bridges: Viewing Active Learning from the Multi-Armed Bandit Lens Sep 26, 2013 Active Learning Binary Classification
— Unverified 00 Bypassing the Monster: A Faster and Simpler Optimal Algorithm for Contextual Bandits under Realizability Mar 28, 2020 Multi-Armed Bandits regression
— Unverified 00 Bypassing the Simulator: Near-Optimal Adversarial Linear Contextual Bandits Sep 2, 2023 Computational Efficiency Multi-Armed Bandits
— Unverified 00 Byzantine-Resilient Decentralized Multi-Armed Bandits Oct 11, 2023 Multi-Armed Bandits Recommendation Systems
— Unverified 00 Catoni Contextual Bandits are Robust to Heavy-tailed Rewards Feb 4, 2025 Multi-Armed Bandits
— Unverified 00 Causal Bandits: Online Decision-Making in Endogenous Settings Nov 16, 2022 Decision Making Multi-Armed Bandits
— Unverified 00 Causal Contextual Bandits with Targeted Interventions Sep 29, 2021 Multi-Armed Bandits
— Unverified 00 Causal Feature Selection Method for Contextual Multi-Armed Bandits in Recommender System Sep 20, 2024 feature selection Multi-Armed Bandits
— Unverified 00 Censored Semi-Bandits for Resource Allocation Apr 12, 2021 Multi-Armed Bandits
— Unverified 00