Leveraging (Biased) Information: Multi-armed Bandits with Offline Data May 4, 2024 Multi-Armed Bandits
— Unverified 0Mathematics of statistical sequential decision-making: concentration, risk-awareness and modelling in stochastic bandits, with applications to bariatric surgery May 3, 2024 Decision Making Interpretable Machine Learning
— Unverified 0Provably Efficient Reinforcement Learning for Adversarial Restless Multi-Armed Bandits with Unknown Transitions and Bandit Feedback May 2, 2024 Multi-Armed Bandits Sequential Decision Making
— Unverified 0Recommenadation aided Caching using Combinatorial Multi-armed Bandits Apr 30, 2024 Multi-Armed Bandits
— Unverified 0Disentangling Exploration from Exploitation Apr 29, 2024 Disentanglement Multi-Armed Bandits
— Unverified 0Causally Abstracted Multi-armed Bandits Apr 26, 2024 Decision Making Multi-Armed Bandits
Code Code Available 0Structured Reinforcement Learning for Delay-Optimal Data Transmission in Dense mmWave Networks Apr 25, 2024 Fairness Multi-Armed Bandits
— Unverified 0Sequential Decision Making with Expert Demonstrations under Unobserved Heterogeneity Apr 10, 2024 Decision Making Meta Reinforcement Learning
Code Code Available 0Generalized Linear Bandits with Limited Adaptivity Apr 10, 2024 Multi-Armed Bandits
Code Code Available 0Feel-Good Thompson Sampling for Contextual Dueling Bandits Apr 9, 2024 Decision Making Multi-Armed Bandits
— Unverified 0On the Importance of Uncertainty in Decision-Making with Large Language Models Apr 3, 2024 Decision Making Multi-Armed Bandits
— Unverified 0Doubly-Robust Off-Policy Evaluation with Estimated Logging Policy Apr 2, 2024 Multi-Armed Bandits Off-policy evaluation
— Unverified 0Nearly-tight Approximation Guarantees for the Improving Multi-Armed Bandits Problem Apr 1, 2024 Multi-Armed Bandits
— Unverified 0A Correction of Pseudo Log-Likelihood Method Mar 26, 2024 Multi-Armed Bandits
— Unverified 0Contextual Restless Multi-Armed Bandits with Application to Demand Response Decision-Making Mar 22, 2024 Decision Making Multi-Armed Bandits
— Unverified 0Transfer in Sequential Multi-armed Bandits via Reward Samples Mar 19, 2024 Multi-Armed Bandits
— Unverified 0Phasic Diversity Optimization for Population-Based Reinforcement Learning Mar 17, 2024 Diversity MuJoCo
— Unverified 0Cramming Contextual Bandits for On-policy Statistical Evaluation Mar 11, 2024 Multi-Armed Bandits Off-policy evaluation
— Unverified 0ε-Neural Thompson Sampling of Deep Brain Stimulation for Parkinson Disease Treatment Mar 11, 2024 Multi-Armed Bandits Reinforcement Learning (RL)
— Unverified 0Efficient Public Health Intervention Planning Using Decomposition-Based Decision-Focused Learning Mar 8, 2024 Multi-Armed Bandits
— Unverified 0LC-Tsallis-INF: Generalized Best-of-Both-Worlds Linear Contextual Bandits Mar 5, 2024 Multi-Armed Bandits
— Unverified 0A General Reduction for High-Probability Analysis with General Light-Tailed Distributions Mar 5, 2024 Multi-Armed Bandits Stochastic Optimization
— Unverified 0Adaptive Learning Rate for Follow-the-Regularized-Leader: Competitive Analysis and Best-of-Both-Worlds Mar 1, 2024 Decision Making Multi-Armed Bandits
— Unverified 0Investigating Gender Fairness in Machine Learning-driven Personalized Care for Chronic Pain Feb 29, 2024 Decision Making Fairness
— Unverified 0Federated Linear Contextual Bandits with Heterogeneous Clients Feb 29, 2024 All Federated Learning
— Unverified 0Batched Nonparametric Contextual Bandits Feb 27, 2024 Multi-Armed Bandits
— Unverified 0Low-Rank Bandits via Tight Two-to-Infinity Singular Subspace Recovery Feb 24, 2024 Multi-Armed Bandits
Code Code Available 0Is Offline Decision Making Possible with Only Few Samples? Reliable Decisions in Data-Starved Bandits via Trust Region Enhancement Feb 24, 2024 Decision Making Multi-Armed Bandits
— Unverified 0Optimistic Information Directed Sampling Feb 23, 2024 Multi-Armed Bandits
— Unverified 0Multi-Armed Bandits with Abstention Feb 23, 2024 Decision Making Multi-Armed Bandits
— Unverified 0A Decision-Language Model (DLM) for Dynamic Restless Multi-Armed Bandit Tasks in Public Health Feb 22, 2024 Language Modeling Language Modelling
— Unverified 0Stealthy Adversarial Attacks on Stochastic Multi-Armed Bandits Feb 21, 2024 Multi-Armed Bandits
— Unverified 0Incentivized Exploration via Filtered Posterior Sampling Feb 20, 2024 Multi-Armed Bandits
— Unverified 0Diffusion Models Meet Contextual Bandits with Large Action Spaces Feb 15, 2024 Efficient Exploration Multi-Armed Bandits
— Unverified 0Thompson Sampling in Partially Observable Contextual Bandits Feb 15, 2024 Decision Making Decision Making Under Uncertainty
— Unverified 0Efficient Prompt Optimization Through the Lens of Best Arm Identification Feb 15, 2024 Instruction Following Multi-Armed Bandits
— Unverified 0FLASH: Federated Learning Across Simultaneous Heterogeneities Feb 13, 2024 Federated Learning Multi-Armed Bandits
— Unverified 0Thresholding Data Shapley for Data Cleansing Using Multi-Armed Bandits Feb 13, 2024 Multi-Armed Bandits
— Unverified 0Replicability is Asymptotically Free in Multi-armed Bandits Feb 12, 2024 Decision Making Multi-Armed Bandits
— Unverified 0Contextual Multinomial Logit Bandits with General Value Functions Feb 12, 2024 Computational Efficiency Multi-Armed Bandits
— Unverified 0Efficient Contextual Bandits with Uninformed Feedback Graphs Feb 12, 2024 Multi-Armed Bandits regression
— Unverified 0Stochastic contextual bandits with graph feedback: from independence number to MAS number Feb 12, 2024 Multi-Armed Bandits
— Unverified 0More Benefits of Being Distributional: Second-Order Bounds for Reinforcement Learning Feb 11, 2024 Distributional Reinforcement Learning Multi-Armed Bandits
— Unverified 0Fast UCB-type algorithms for stochastic bandits with heavy and super heavy symmetric noise Feb 10, 2024 Multi-Armed Bandits
— Unverified 0Tree Ensembles for Contextual Bandits Feb 10, 2024 Multi-Armed Bandits Thompson Sampling
— Unverified 0Fairness of Exposure in Online Restless Multi-armed Bandits Feb 9, 2024 Fairness Multi-Armed Bandits
Code Code Available 0Simultaneously Achieving Group Exposure Fairness and Within-Group Meritocracy in Stochastic Bandits Feb 8, 2024 Attribute Exposure Fairness
Code Code Available 0Context in Public Health for Underserved Communities: A Bayesian Approach to Online Restless Bandits Feb 7, 2024 Multi-Armed Bandits Reinforcement Learning (RL)
— Unverified 0Fairness and Privacy Guarantees in Federated Contextual Bandits Feb 5, 2024 Fairness Federated Learning
— Unverified 0Off-Policy Evaluation of Slate Bandit Policies via Optimizing Abstraction Feb 3, 2024 Marketing Multi-Armed Bandits
Code Code Available 0