Bandit Regret Scaling with the Effective Loss Range May 15, 2017 Multi-Armed Bandits
— Unverified 0Bandits Don't Follow Rules: Balancing Multi-Facet Machine Translation with Multi-Armed Bandits Oct 13, 2021 Machine Translation Multi-Armed Bandits
— Unverified 0Bandits Don’t Follow Rules: Balancing Multi-Facet Machine Translation with Multi-Armed Bandits Nov 1, 2021 Machine Translation Multi-Armed Bandits
— Unverified 0Bandits for Learning to Explain from Explanations Feb 7, 2021 Gaussian Processes Multi-Armed Bandits
— Unverified 0Bandits meet Computer Architecture: Designing a Smartly-allocated Cache Jan 31, 2016 Multi-Armed Bandits
— Unverified 0Bandit Social Learning: Exploration under Myopic Behavior Feb 15, 2023 Multi-Armed Bandits
— Unverified 0Bandits Warm-up Cold Recommender Systems Jul 10, 2014 Multi-Armed Bandits Recommendation Systems
— Unverified 0Preferences Evolve And So Should Your Bandits: Bandits with Evolving States for Online Platforms Jul 21, 2023 Multi-Armed Bandits Recommendation Systems
— Unverified 0Bandits with Knapsacks beyond the Worst Case Dec 1, 2021 Multi-Armed Bandits
— Unverified 0Bandits with Partially Observable Confounded Data Jun 11, 2020 Multi-Armed Bandits
— Unverified 0Bandits with Temporal Stochastic Constraints Nov 22, 2018 Multi-Armed Bandits
— Unverified 0Banker Online Mirror Descent Jun 16, 2021 Multi-Armed Bandits
— Unverified 0Banker Online Mirror Descent: A Universal Approach for Delayed Online Bandit Learning Jan 25, 2023 Multi-Armed Bandits
— Unverified 0Batched Bandits with Crowd Externalities Sep 29, 2021 Multi-Armed Bandits
— Unverified 0Batched Coarse Ranking in Multi-Armed Bandits Dec 1, 2020 Multi-Armed Bandits
— Unverified 0Almost Optimal Batch-Regret Tradeoff for Batch Linear Contextual Bandits Oct 15, 2021 Multi-Armed Bandits
— Unverified 0Regret Bounds for Batched Bandits Oct 11, 2019 Multi-Armed Bandits
— Unverified 0Batched Nonparametric Bandits via k-Nearest Neighbor UCB May 15, 2025 Decision Making Marketing
— Unverified 0Breaking the (1/Δ_2) Barrier: Better Batched Best Arm Identification with Adaptive Grids Jan 29, 2025 Multi-Armed Bandits
— Unverified 0Batched Online Contextual Sparse Bandits with Sequential Inclusion of Features Sep 13, 2024 Decision Making Fairness
— Unverified 0Batched Thompson Sampling Oct 1, 2021 Multi-Armed Bandits Thompson Sampling
— Unverified 0Batched Thompson Sampling for Multi-Armed Bandits Aug 15, 2021 Multi-Armed Bandits Thompson Sampling
— Unverified 0Batch Ensemble for Variance Dependent Regret in Stochastic Bandits Sep 13, 2024 Multi-Armed Bandits Reinforcement Learning (RL)
— Unverified 0Towards Bayesian Data Selection Jun 18, 2024 Active Learning Additive models
— Unverified 0Bayesian decision-making under misspecified priors with applications to meta-learning Jul 3, 2021 Decision Making Meta-Learning
— Unverified 0An Analysis of Reinforcement Learning for Malaria Control Jul 19, 2021 Multi-Armed Bandits OpenAI Gym
— Unverified 0An Analysis of the Value of Information when Exploring Stochastic, Discrete Multi-Armed Bandits Oct 8, 2017 Multi-Armed Bandits
— Unverified 0BEACON: Balancing Convenience and Nutrition in Meals With Long-Term Group Recommendations and Reasoning on Multimodal Recipes Jun 19, 2024 Multi-Armed Bandits Nutrition
— Unverified 0Beam Learning -- Using Machine Learning for Finding Beam Directions Jun 11, 2019 BIG-bench Machine Learning Multi-Armed Bandits
— Unverified 0Be Greedy in Multi-Armed Bandits Jan 4, 2021 Multi-Armed Bandits
— Unverified 0Efficient Prompt Optimization Through the Lens of Best Arm Identification Feb 15, 2024 Instruction Following Multi-Armed Bandits
— Unverified 0Quantile Multi-Armed Bandits: Optimal Best-Arm Identification and a Differentially Private Scheme Jun 11, 2020 Multi-Armed Bandits
— Unverified 0Balancing Act: Prioritization Strategies for LLM-Designed Restless Bandit Rewards Aug 22, 2024 Language Modeling Language Modelling
— Unverified 0Best Arm Identification in Linked Bandits Nov 19, 2018 Multi-Armed Bandits
— Unverified 0A Gang of Bandits Jun 4, 2013 Clustering Multi-Armed Bandits
— Unverified 0Best Arm Identification in Restless Markov Multi-Armed Bandits Mar 29, 2022 Multi-Armed Bandits
— Unverified 0Best Arm Identification in Stochastic Bandits: Beyond β-optimality Jan 10, 2023 Computational Efficiency Multi-Armed Bandits
— Unverified 0Best Arm Identification under Additive Transfer Bandits Dec 8, 2021 Multi-Armed Bandits Transfer Learning
— Unverified 0An Empirical Evaluation of Thompson Sampling Dec 1, 2011 Multi-Armed Bandits Thompson Sampling
— Unverified 0Best-of-Both-Worlds Algorithms for Linear Contextual Bandits Dec 24, 2023 Multi-Armed Bandits
— Unverified 0Best-of-Both-Worlds Linear Contextual Bandits Dec 27, 2023 Multi-Armed Bandits
— Unverified 0Better Algorithms for Stochastic Bandits with Adversarial Corruptions Feb 22, 2019 Multi-Armed Bandits
— Unverified 0Beyond the Hazard Rate: More Perturbation Algorithms for Adversarial Multi-armed Bandits Feb 17, 2017 Multi-Armed Bandits
— Unverified 0Beyond UCB: Optimal and Efficient Contextual Bandits with Regression Oracles Feb 12, 2020 Multi-Armed Bandits regression
— Unverified 0Bi-Criteria Optimization for Combinatorial Bandits: Sublinear Regret and Constraint Violation under Bandit Feedback Mar 15, 2025 Multi-Armed Bandits
— Unverified 0BISTRO: An Efficient Relaxation-Based Method for Contextual Bandits Feb 6, 2016 Multi-Armed Bandits
— Unverified 0BOF-UCB: A Bayesian-Optimistic Frequentist Algorithm for Non-Stationary Contextual Bandits Jul 7, 2023 Decision Making Multi-Armed Bandits
— Unverified 0Boltzmann Exploration Done Right May 29, 2017 Decision Making Decision Making Under Uncertainty
— Unverified 0A framework for optimizing COVID-19 testing policy using a Multi Armed Bandit approach Jul 28, 2020 Decision Making Multi-Armed Bandits
— Unverified 0Balanced Linear Contextual Bandits Dec 15, 2018 Causal Inference Multi-Armed Bandits
— Unverified 0