On the Interplay Between Misspecification and Sub-optimality Gap in Linear Contextual Bandits Mar 16, 2023 Multi-Armed Bandits
— Unverified 0Only Pay for What Is Uncertain: Variance-Adaptive Thompson Sampling Mar 16, 2023 Multi-Armed Bandits Thompson Sampling
— Unverified 0Data Dependent Regret Guarantees Against General Comparators for Full or Bandit Feedback Mar 12, 2023 Multi-Armed Bandits
— Unverified 0Flooding with Absorption: An Efficient Protocol for Heterogeneous Bandits over Complex Networks Mar 9, 2023 Decision Making Multi-Armed Bandits
Code Code Available 0Queue Scheduling with Adversarial Bandit Learning Mar 3, 2023 Multi-Armed Bandits Scheduling
— Unverified 0Efficient Explorative Key-term Selection Strategies for Conversational Contextual Bandits Mar 1, 2023 Computational Efficiency Multi-Armed Bandits
Code Code Available 0Fairness for Workers Who Pull the Arms: An Index Based Policy for Allocation of Restless Bandit Tasks Mar 1, 2023 Fairness Multi-Armed Bandits
— Unverified 0Multi-Armed Bandits with Generalized Temporally-Partitioned Rewards Mar 1, 2023 Decision Making Multi-Armed Bandits
— Unverified 0Approximately Stationary Bandits with Knapsacks Feb 28, 2023 Multi-Armed Bandits
— Unverified 0The Choice of Noninformative Priors for Thompson Sampling in Multiparameter Bandit Models Feb 28, 2023 Multi-Armed Bandits Thompson Sampling
— Unverified 0Improved Best-of-Both-Worlds Guarantees for Multi-Armed Bandits: FTRL with General Regularizers and Multiple Optimal Arms Feb 27, 2023 Multi-Armed Bandits
— Unverified 0On Differentially Private Federated Linear Contextual Bandits Feb 27, 2023 Multi-Armed Bandits
— Unverified 0Kernel Conditional Moment Constraints for Confounding Robust Inference Feb 26, 2023 Multi-Armed Bandits Sensitivity
Code Code Available 0Active Velocity Estimation using Light Curtains via Self-Supervised Multi-Armed Bandits Feb 24, 2023 Multi-Armed Bandits Navigate
— Unverified 0Asymptotically Unbiased Off-Policy Policy Evaluation when Reusing Old Data in Nonstationary Environments Feb 23, 2023 Multi-Armed Bandits regression
— Unverified 0Variance-Dependent Regret Bounds for Linear Bandits and Reinforcement Learning: Adaptivity and Computational Efficiency Feb 21, 2023 Computational Efficiency Decision Making
— Unverified 0A Blackbox Approach to Best of Both Worlds in Bandits and Beyond Feb 20, 2023 Multi-Armed Bandits
— Unverified 0Estimating Optimal Policy Value in General Linear Contextual Bandits Feb 19, 2023 Model Selection Multi-Armed Bandits
— Unverified 0Stochastic Approximation Approaches to Group Distributionally Robust Optimization and Beyond Feb 18, 2023 Multi-Armed Bandits
— Unverified 0Online Continuous Hyperparameter Optimization for Generalized Linear Contextual Bandits Feb 18, 2023 Hyperparameter Optimization Multi-Armed Bandits
— Unverified 0Improving Fairness in Adaptive Social Exergames via Shapley Bandits Feb 18, 2023 Fairness Multi-Armed Bandits
— Unverified 0Practical Contextual Bandits with Feedback Graphs Feb 17, 2023 Multi-Armed Bandits regression
— Unverified 0Infinite Action Contextual Bandits with Reusable Data Exhaust Feb 16, 2023 Model Selection Multi-Armed Bandits
Code Code Available 0Genetic multi-armed bandits: a reinforcement learning approach for discrete optimization via simulation Feb 15, 2023 Multi-Armed Bandits Stochastic Optimization
— Unverified 0Bandit Social Learning: Exploration under Myopic Behavior Feb 15, 2023 Multi-Armed Bandits
— Unverified 0Adversarial Rewards in Universal Learning for Contextual Bandits Feb 14, 2023 Multi-Armed Bandits
— Unverified 0Piecewise-Stationary Multi-Objective Multi-Armed Bandit with Application to Joint Communications and Sensing Feb 10, 2023 Change Detection Multi-Armed Bandits
Code Code Available 0Leveraging User-Triggered Supervision in Contextual Bandits Feb 7, 2023 Multi-Armed Bandits
— Unverified 0On Private and Robust Bandits Feb 6, 2023 Multi-Armed Bandits
— Unverified 0Multiplier Bootstrap-based Exploration Feb 3, 2023 Multi-Armed Bandits
— Unverified 0Randomized Greedy Learning for Non-monotone Stochastic Submodular Maximization Under Full-bandit Feedback Feb 2, 2023 Multi-Armed Bandits
— Unverified 0Stochastic Contextual Bandits with Long Horizon Rewards Feb 2, 2023 Decision Making Language Modeling
— Unverified 0Quantum contextual bandits and recommender systems for quantum data Jan 31, 2023 Multi-Armed Bandits Recommendation Systems
— Unverified 0Improved Algorithms for Multi-period Multi-class Packing Problems with Bandit Feedback Jan 31, 2023 Management Multi-Armed Bandits
— Unverified 0Adversarial Attacks on Adversarial Bandits Jan 30, 2023 Multi-Armed Bandits Recommendation Systems
— Unverified 0A Framework for Adapting Offline Algorithms to Solve Combinatorial Multi-Armed Bandit Problems with Bandit Feedback Jan 30, 2023 Multi-Armed Bandits
— Unverified 0Contextual Causal Bayesian Optimisation Jan 29, 2023 Bayesian Optimisation Multi-Armed Bandits
— Unverified 0Communication-Efficient Collaborative Regret Minimization in Multi-Armed Bandits Jan 26, 2023 Multi-agent Reinforcement Learning Multi-Armed Bandits
— Unverified 0Banker Online Mirror Descent: A Universal Approach for Delayed Online Bandit Learning Jan 25, 2023 Multi-Armed Bandits
— Unverified 0Quantum Heavy-tailed Bandits Jan 23, 2023 Multi-Armed Bandits
— Unverified 0Multi-Armed Bandits and Quantum Channel Oracles Jan 20, 2023 Multi-Armed Bandits reinforcement-learning
— Unverified 0Multi-armed Bandit Learning for TDMA Transmission Slot Scheduling and Defragmentation for Improved Bandwidth Usage Jan 14, 2023 Multi-Armed Bandits Scheduling
— Unverified 0Best Arm Identification in Stochastic Bandits: Beyond β-optimality Jan 10, 2023 Computational Efficiency Multi-Armed Bandits
— Unverified 0Local Differential Privacy for Sequential Decision Making in a Changing Environment Jan 2, 2023 Decision Making Multi-Armed Bandits
— Unverified 0Contextual Bandits and Optimistically Universal Learning Dec 31, 2022 Multi-Armed Bandits
— Unverified 0Online Statistical Inference for Contextual Bandits via Stochastic Gradient Descent Dec 30, 2022 Decision Making Multi-Armed Bandits
— Unverified 0On the Complexity of Representation Learning in Contextual Linear Bandits Dec 19, 2022 Model Selection Multi-Armed Bandits
— Unverified 0MABSplit: Faster Forest Training Using Multi-Armed Bandits Dec 14, 2022 Feature Importance Multi-Armed Bandits
Code Code Available 0Faster Maximum Inner Product Search in High Dimensions Dec 14, 2022 Multi-Armed Bandits Recommendation Systems
— Unverified 0Corruption-Robust Algorithms with Uncertainty Weighting for Nonlinear Contextual Bandits and Markov Decision Processes Dec 12, 2022 Multi-Armed Bandits Reinforcement Learning (RL)
— Unverified 0