Finite-Horizon Single-Pull Restless Bandits: An Efficient Index Policy For Scarce Resource Allocation Jan 10, 2025 Multi-Armed Bandits
— Unverified 00 Decision Making in Changing Environments: Robustness, Query-Based Learning, and Differential Privacy Jan 24, 2025 Decision Making Multi-Armed Bandits
— Unverified 00 Scalable Decision-Focused Learning in Restless Multi-Armed Bandits with Application to Maternal and Child Health Feb 2, 2022 Multi-Armed Bandits Scheduling
— Unverified 00 Finite-Time Analysis of Whittle Index based Q-Learning for Restless Multi-Armed Bandits with Neural Network Function Approximation Oct 3, 2023 Multi-Armed Bandits Q-Learning
— Unverified 00 Batched Thompson Sampling for Multi-Armed Bandits Aug 15, 2021 Multi-Armed Bandits Thompson Sampling
— Unverified 00 First- and Second-Order Bounds for Adversarial Linear Contextual Bandits May 1, 2023 Multi-Armed Bandits
— Unverified 00 Fixed-Budget Best-Arm Identification in Structured Bandits Jun 9, 2021 Multi-Armed Bandits
— Unverified 00 FLASH: Federated Learning Across Simultaneous Heterogeneities Feb 13, 2024 Federated Learning Multi-Armed Bandits
— Unverified 00 Flexible and Efficient Contextual Bandits with Heterogeneous Treatment Effect Oracles Mar 30, 2022 Decision Making Heterogeneous Treatment Effect Estimation
— Unverified 00 Follow-ups Also Matter: Improving Contextual Bandits via Post-serving Contexts Sep 25, 2023 LEMMA Multi-Armed Bandits
— Unverified 00 Decision Automation for Electric Power Network Recovery Oct 1, 2019 Decision Making Multi-Armed Bandits
— Unverified 00 Decentralized Smart Charging of Large-Scale EVs using Adaptive Multi-Agent Multi-Armed Bandits Jul 20, 2023 Fairness Multi-Armed Bandits
— Unverified 00 Batched Thompson Sampling Oct 1, 2021 Multi-Armed Bandits Thompson Sampling
— Unverified 00 An Adaptive Method for Contextual Stochastic Multi-armed Bandits with Rewards Generated by a Linear Dynamical System Jun 14, 2024 Multi-Armed Bandits
— Unverified 00 Decentralized Multi-player Multi-armed Bandits with No Collision Information Feb 29, 2020 Multi-Armed Bandits
— Unverified 00 Decentralized Upper Confidence Bound Algorithms for Homogeneous Multi-Agent Multi-Armed Bandits Nov 22, 2021 Multi-Armed Bandits
— Unverified 00 Batched Online Contextual Sparse Bandits with Sequential Inclusion of Features Sep 13, 2024 Decision Making Fairness
— Unverified 00 Decentralized Exploration in Multi-Armed Bandits -- Extended version Nov 19, 2018 Multi-Armed Bandits
— Unverified 00 Batched Nonparametric Contextual Bandits Feb 27, 2024 Multi-Armed Bandits
— Unverified 00 Decentralized Cooperative Reinforcement Learning with Hierarchical Information Structure Nov 1, 2021 Multi-agent Reinforcement Learning Multi-Armed Bandits
— Unverified 00 DBA bandits: Self-driving index tuning under ad-hoc, analytical workloads with safety guarantees Oct 19, 2020 Attribute Decision Making
— Unverified 00 Data Poisoning Attacks on Stochastic Bandits May 16, 2019 Data Poisoning Multi-Armed Bandits
— Unverified 00 Batched Nonparametric Bandits via k-Nearest Neighbor UCB May 15, 2025 Decision Making Marketing
— Unverified 00 Regret Bounds for Batched Bandits Oct 11, 2019 Multi-Armed Bandits
— Unverified 00 A Model Selection Approach for Corruption Robust Reinforcement Learning Oct 7, 2021 Model Selection Multi-Armed Bandits
— Unverified 00 Data Poisoning Attacks in Contextual Bandits Aug 17, 2018 Data Poisoning Multi-Armed Bandits
— Unverified 00 Data-Driven Upper Confidence Bounds with Near-Optimal Regret for Heavy-Tailed Bandits Jun 9, 2024 Decision Making Multi-Armed Bandits
— Unverified 00 Data Dependent Regret Guarantees Against General Comparators for Full or Bandit Feedback Mar 12, 2023 Multi-Armed Bandits
— Unverified 00 Data Acquisition for Improving Model Fairness using Reinforcement Learning Dec 4, 2024 Data Valuation Fairness
— Unverified 00 Batched Coarse Ranking in Multi-Armed Bandits Dec 1, 2020 Multi-Armed Bandits
— Unverified 00 Almost Optimal Batch-Regret Tradeoff for Batch Linear Contextual Bandits Oct 15, 2021 Multi-Armed Bandits
— Unverified 00 Query-Reward Tradeoffs in Multi-Armed Bandits Oct 12, 2021 Multi-Armed Bandits
— Unverified 00 Customized Nonlinear Bandits for Online Response Selection in Neural Conversation Models Nov 22, 2017 Multi-Armed Bandits Response Generation
— Unverified 00 Batched Bandits with Crowd Externalities Sep 29, 2021 Multi-Armed Bandits
— Unverified 00 Cost-Aware Optimal Pairwise Pure Exploration Mar 10, 2025 Multi-Armed Bandits
— Unverified 00 Banker Online Mirror Descent: A Universal Approach for Delayed Online Bandit Learning Jan 25, 2023 Multi-Armed Bandits
— Unverified 00 Adaptive Endpointing with Deep Contextual Multi-armed Bandits Mar 23, 2023 Multi-Armed Bandits
— Unverified 00 Corruption-robust exploration in episodic reinforcement learning Nov 20, 2019 Multi-Armed Bandits reinforcement-learning
— Unverified 00 Corruption-Robust Algorithms with Uncertainty Weighting for Nonlinear Contextual Bandits and Markov Decision Processes Dec 12, 2022 Multi-Armed Bandits Reinforcement Learning (RL)
— Unverified 00 Banker Online Mirror Descent Jun 16, 2021 Multi-Armed Bandits
— Unverified 00 Bandits with Temporal Stochastic Constraints Nov 22, 2018 Multi-Armed Bandits
— Unverified 00 Almost Boltzmann Exploration Jan 25, 2019 Multi-Armed Bandits Reinforcement Learning
— Unverified 00 CorrAttack: Black-box Adversarial Attack with Structured Search Oct 3, 2020 Adversarial Attack Bayesian Optimization
— Unverified 00 Bandits with Partially Observable Confounded Data Jun 11, 2020 Multi-Armed Bandits
— Unverified 00 Coordination without communication: optimal regret in two players multi-armed bandits Feb 14, 2020 Multi-Armed Bandits Vocal Bursts Valence Prediction
— Unverified 00 Coordinated Multi-Armed Bandits for Improved Spatial Reuse in Wi-Fi Dec 4, 2024 Decision Making Fairness
— Unverified 00 Bandits with Knapsacks beyond the Worst Case Dec 1, 2021 Multi-Armed Bandits
— Unverified 00 Algorithms with Logarithmic or Sublinear Regret for Constrained Contextual Bandits Apr 27, 2015 Multi-Armed Bandits
— Unverified 00 Adaptive Discretization against an Adversary: Lipschitz bandits, Dynamic Pricing, and Auction Tuning Jun 22, 2020 Multi-Armed Bandits
— Unverified 00 A Correction of Pseudo Log-Likelihood Method Mar 26, 2024 Multi-Armed Bandits
— Unverified 00