Upper Counterfactual Confidence Bounds: a New Optimism Principle for Contextual Bandits Jul 15, 2020 counterfactual Multi-Armed Bandits
— Unverified 00 Value Directed Exploration in Multi-Armed Bandits with Structured Priors Apr 12, 2017 Multi-Armed Bandits
— Unverified 00 Variance-Dependent Regret Bounds for Linear Bandits and Reinforcement Learning: Adaptivity and Computational Efficiency Feb 21, 2023 Computational Efficiency Decision Making
— Unverified 00 Variance-Dependent Regret Lower Bounds for Contextual Bandits Mar 15, 2025 Multi-Armed Bandits
— Unverified 00 Variance-Optimal Augmentation Logging for Counterfactual Evaluation in Contextual Bandits Feb 3, 2022 counterfactual Multi-Armed Bandits
— Unverified 00 Variational Inference for Model-Free and Model-Based Reinforcement Learning Sep 4, 2022 Bayesian Inference Bayesian Optimization
— Unverified 00 Versatile Dueling Bandits: Best-of-both-World Analyses for Online Learning from Preferences Feb 14, 2022 Multi-Armed Bandits
— Unverified 00 Vertical Federated Linear Contextual Bandits Oct 20, 2022 Multi-Armed Bandits
— Unverified 00 Wasserstein Distributionally Robust Policy Evaluation and Learning for Contextual Bandits Sep 15, 2023 Multi-Armed Bandits Off-policy evaluation
— Unverified 00 Bandit algorithms to emulate human decision making using probabilistic distortions Nov 30, 2016 Decision Making Multi-Armed Bandits
— Unverified 00 What Doubling Tricks Can and Can't Do for Multi-Armed Bandits Mar 19, 2018 Multi-Armed Bandits Reinforcement Learning
— Unverified 00 Bad Values but Good Behavior: Learning Highly Misspecified Bandits and MDPs Oct 13, 2023 Decision Making Multi-Armed Bandits
— Unverified 00 When Privacy Meets Partial Information: A Refined Analysis of Differentially Private Bandits Sep 6, 2022 Multi-Armed Bandits
— Unverified 00 Whittle Index Learning Algorithms for Restless Bandits with Constant Stepsizes Sep 6, 2024 Multi-Armed Bandits Q-Learning
— Unverified 00 Why so gloomy? A Bayesian explanation of human pessimism bias in the multi-armed bandit task Dec 1, 2018 Multi-Armed Bandits Reinforcement Learning
— Unverified 00 Worst-case Performance of Greedy Policies in Bandits with Imperfect Context Observations Apr 10, 2022 Decision Making Decision Making Under Uncertainty
— Unverified 00 You Can Trade Your Experience in Distributed Multi-Agent Multi-Armed Bandits Jun 19, 2023 Decision Making Multi-Armed Bandits
— Unverified 00 A Survey on Practical Applications of Multi-Armed and Contextual Bandits Apr 2, 2019 Information Retrieval Multi-Armed Bandits
— Unverified 00 Zero-Inflated Bandits Dec 25, 2023 Multi-Armed Bandits Thompson Sampling
— Unverified 00 Functional multi-armed bandit and the best function identification problems Mar 1, 2025 Multi-Armed Bandits
— Unverified 00 A Bandit Approach to Sequential Experimental Design with False Discovery Control Dec 1, 2018 Drug Discovery Experimental Design
— Unverified 00 A Batch Sequential Halving Algorithm without Performance Degradation Jun 1, 2024 Computational Efficiency Multi-Armed Bandits
— Unverified 00 Context in Public Health for Underserved Communities: A Bayesian Approach to Online Restless Bandits Feb 7, 2024 Multi-Armed Bandits Reinforcement Learning (RL)
— Unverified 00 A Blackbox Approach to Best of Both Worlds in Bandits and Beyond Feb 20, 2023 Multi-Armed Bandits
— Unverified 00 Access Probability Optimization in RACH: A Multi-Armed Bandits Approach Apr 18, 2025 Multi-Armed Bandits
— Unverified 00 Accurate and Fast Federated Learning via Combinatorial Multi-Armed Bandits Dec 6, 2020 BIG-bench Machine Learning Federated Learning
— Unverified 00 A Central Limit Theorem, Loss Aversion and Multi-Armed Bandits Jun 10, 2021 Multi-Armed Bandits
— Unverified 00 Achieving adaptivity and optimality for multi-armed bandits using Exponential-Kullback Leibler Maillard Sampling Feb 20, 2025 Multi-Armed Bandits Thompson Sampling
— Unverified 00 Achieving User-Side Fairness in Contextual Bandits Oct 22, 2020 Fairness Multi-Armed Bandits
— Unverified 00 A Classification View on Meta Learning Bandits Apr 6, 2025 Classification Meta-Learning
— Unverified 00 A Closer Look at Small-loss Bounds for Bandits with Graph Feedback Feb 2, 2020 Multi-Armed Bandits
— Unverified 00 A Contextual Combinatorial Bandit Approach to Negotiation Jun 30, 2024 Multi-Armed Bandits
— Unverified 00 A Contextual Combinatorial Semi-Bandit Approach to Network Bottleneck Identification Jun 16, 2022 Multi-Armed Bandits Thompson Sampling
— Unverified 00 A Correction of Pseudo Log-Likelihood Method Mar 26, 2024 Multi-Armed Bandits
— Unverified 00 Active Inference for Autonomous Decision-Making with Contextual Multi-Armed Bandits Sep 19, 2022 Decision Making Decision Making Under Uncertainty
— Unverified 00 Active Reinforcement Learning: Observing Rewards at a Cost Nov 13, 2020 Multi-Armed Bandits reinforcement-learning
— Unverified 00 Active Search for High Recall: a Non-Stationary Extension of Thompson Sampling Dec 27, 2017 Multi-Armed Bandits Thompson Sampling
— Unverified 00 Active Search for Sparse Signals with Region Sensing Dec 2, 2016 Bayesian Optimization Compressive Sensing
— Unverified 00 Active Velocity Estimation using Light Curtains via Self-Supervised Multi-Armed Bandits Feb 24, 2023 Multi-Armed Bandits Navigate
— Unverified 00 AdaLinUCB: Opportunistic Learning for Contextual Bandits Feb 20, 2019 Multi-Armed Bandits
— Unverified 00 AdaptEx: A Self-Service Contextual Bandit Platform Aug 8, 2023 Multi-Armed Bandits Thompson Sampling
— Unverified 00 Adapting Bandit Algorithms for Settings with Sequentially Available Arms Sep 30, 2021 Management Multi-Armed Bandits
— Unverified 00 Adapting to Delays and Data in Adversarial Multi-Armed Bandits Oct 12, 2020 Multi-Armed Bandits
— Unverified 00 Adapting to Misspecification in Contextual Bandits with Offline Regression Oracles Feb 26, 2021 Multi-Armed Bandits regression
— Unverified 00 Adapting to Misspecification in Contextual Bandits Jul 12, 2021 Multi-Armed Bandits regression
— Unverified 00 Adaptive Best-of-Both-Worlds Algorithm for Heavy-Tailed Multi-Armed Bandits Jan 28, 2022 Multi-Armed Bandits
— Unverified 00 Adaptive Budgeted Multi-Armed Bandits for IoT with Dynamic Resource Constraints May 5, 2025 Multi-Armed Bandits
— Unverified 00 Adaptive Contract Design for Crowdsourcing Markets: Bandit Algorithms for Repeated Principal-Agent Problems May 12, 2014 Multi-Armed Bandits
— Unverified 00 Adaptive Data Augmentation for Thompson Sampling Jun 17, 2025 Data Augmentation Multi-Armed Bandits
— Unverified 00 Adaptive Discretization against an Adversary: Lipschitz bandits, Dynamic Pricing, and Auction Tuning Jun 22, 2020 Multi-Armed Bandits
— Unverified 00