Stealthy Adversarial Attacks on Stochastic Multi-Armed Bandits Feb 21, 2024 Multi-Armed Bandits
— Unverified 0Incentivized Exploration via Filtered Posterior Sampling Feb 20, 2024 Multi-Armed Bandits
— Unverified 0Thompson Sampling in Partially Observable Contextual Bandits Feb 15, 2024 Decision Making Decision Making Under Uncertainty
— Unverified 0Efficient Prompt Optimization Through the Lens of Best Arm Identification Feb 15, 2024 Instruction Following Multi-Armed Bandits
— Unverified 0Diffusion Models Meet Contextual Bandits with Large Action Spaces Feb 15, 2024 Efficient Exploration Multi-Armed Bandits
— Unverified 0FLASH: Federated Learning Across Simultaneous Heterogeneities Feb 13, 2024 Federated Learning Multi-Armed Bandits
— Unverified 0Thresholding Data Shapley for Data Cleansing Using Multi-Armed Bandits Feb 13, 2024 Multi-Armed Bandits
— Unverified 0Stochastic contextual bandits with graph feedback: from independence number to MAS number Feb 12, 2024 Multi-Armed Bandits
— Unverified 0Contextual Multinomial Logit Bandits with General Value Functions Feb 12, 2024 Computational Efficiency Multi-Armed Bandits
— Unverified 0Efficient Contextual Bandits with Uninformed Feedback Graphs Feb 12, 2024 Multi-Armed Bandits regression
— Unverified 0Replicability is Asymptotically Free in Multi-armed Bandits Feb 12, 2024 Decision Making Multi-Armed Bandits
— Unverified 0More Benefits of Being Distributional: Second-Order Bounds for Reinforcement Learning Feb 11, 2024 Distributional Reinforcement Learning Multi-Armed Bandits
— Unverified 0Fast UCB-type algorithms for stochastic bandits with heavy and super heavy symmetric noise Feb 10, 2024 Multi-Armed Bandits
— Unverified 0Tree Ensembles for Contextual Bandits Feb 10, 2024 Multi-Armed Bandits Thompson Sampling
— Unverified 0Fairness of Exposure in Online Restless Multi-armed Bandits Feb 9, 2024 Fairness Multi-Armed Bandits
Code Code Available 0Simultaneously Achieving Group Exposure Fairness and Within-Group Meritocracy in Stochastic Bandits Feb 8, 2024 Attribute Exposure Fairness
Code Code Available 0Context in Public Health for Underserved Communities: A Bayesian Approach to Online Restless Bandits Feb 7, 2024 Multi-Armed Bandits Reinforcement Learning (RL)
— Unverified 0Fairness and Privacy Guarantees in Federated Contextual Bandits Feb 5, 2024 Fairness Federated Learning
— Unverified 0Off-Policy Evaluation of Slate Bandit Policies via Optimizing Abstraction Feb 3, 2024 Marketing Multi-Armed Bandits
Code Code Available 0Multi-Armed Bandits with Interference Feb 2, 2024 Multi-Armed Bandits
— Unverified 0Query-Efficient Correlation Clustering with Noisy Oracle Feb 2, 2024 Clustering Multi-Armed Bandits
— Unverified 0Falcon: Fair Active Learning using Multi-armed Bandits Jan 23, 2024 Active Learning Attribute
Code Code Available 0Distributionally Robust Policy Evaluation under General Covariate Shift in Contextual Bandits Jan 21, 2024 Multi-Armed Bandits regression
Code Code Available 0Distributed Multi-Task Learning for Stochastic Bandits with Context Distribution and Stage-wise Constraints Jan 21, 2024 Multi-Armed Bandits Multi-Task Learning
— Unverified 0Adaptive Regret for Bandits Made Possible: Two Queries Suffice Jan 17, 2024 Hyperparameter Optimization Multi-Armed Bandits
— Unverified 0On Quantum Natural Policy Gradients Jan 16, 2024 Multi-Armed Bandits reinforcement-learning
— Unverified 0Contextual Bandits with Stage-wise Constraints Jan 15, 2024 Multi-Armed Bandits
— Unverified 0Let's Get It Started: Fostering the Discoverability of New Releases on Deezer Jan 5, 2024 Multi-Armed Bandits
Code Code Available 0Reliability-Optimized User Admission Control for URLLC Traffic: A Neural Contextual Bandit Approach Jan 5, 2024 Multi-Armed Bandits
— Unverified 0Optimal cross-learning for contextual bandits with unknown context distributions Jan 3, 2024 Multi-Armed Bandits
— Unverified 0Foundations of Reinforcement Learning and Interactive Decision Making Dec 27, 2023 Decision Making Multi-Armed Bandits
— Unverified 0Best-of-Both-Worlds Linear Contextual Bandits Dec 27, 2023 Multi-Armed Bandits
— Unverified 0Harnessing the Power of Federated Learning in Federated Contextual Bandits Dec 26, 2023 Decision Making Federated Learning
Code Code Available 0Diversity-Based Recruitment in Crowdsensing By Combinatorial Multi-Armed Bandits Dec 25, 2023 Diversity Multi-Armed Bandits
— Unverified 0Zero-Inflated Bandits Dec 25, 2023 Multi-Armed Bandits Thompson Sampling
— Unverified 0Best-of-Both-Worlds Algorithms for Linear Contextual Bandits Dec 24, 2023 Multi-Armed Bandits
— Unverified 0Neural Contextual Bandits for Personalized Recommendation Dec 21, 2023 Multi-Armed Bandits Recommendation Systems
— Unverified 0In-Context Reinforcement Learning for Variable Action Spaces Dec 20, 2023 In-Context Reinforcement Learning Multi-Armed Bandits
Code Code Available 1Bayesian Analysis of Combinatorial Gaussian Process Bandits Dec 20, 2023 Bayesian Inference Informativeness
— Unverified 0Distribution-Dependent Rates for Multi-Distribution Learning Dec 20, 2023 Multi-Armed Bandits
— Unverified 0Observation-Augmented Contextual Multi-Armed Bandits for Robotic Search and Exploration Dec 19, 2023 Bayesian Inference Decision Making
— Unverified 0Best Arm Identification with Fixed Budget: A Large Deviation Perspective Dec 19, 2023 Multi-Armed Bandits
Code Code Available 0Online Restless Multi-Armed Bandits with Long-Term Fairness Constraints Dec 16, 2023 Decision Making Fairness
— Unverified 0Risk-Aware Continuous Control with Neural Contextual Bandits Dec 15, 2023 continuous-control Continuous Control
Code Code Available 0A Hierarchical Nearest Neighbour Approach to Contextual Bandits Dec 14, 2023 Computational Efficiency Multi-Armed Bandits
— Unverified 0Robust and Performance Incentivizing Algorithms for Multi-Armed Bandits with Strategic Agents Dec 13, 2023 Multi-Armed Bandits
— Unverified 0Contextual Bandits with Online Neural Regression Dec 12, 2023 Multi-Armed Bandits regression
— Unverified 0RoME: A Robust Mixed-Effects Bandit Algorithm for Optimizing Mobile Health Interventions Dec 11, 2023 Multi-Armed Bandits Off-policy evaluation
Code Code Available 0Distributed Optimization via Kernelized Multi-armed Bandits Dec 7, 2023 Decision Making Distributed Optimization
— Unverified 0Marginal Density Ratio for Off-Policy Evaluation in Contextual Bandits Dec 3, 2023 Causal Inference Multi-Armed Bandits
Code Code Available 0