Query-Efficient Correlation Clustering with Noisy Oracle Feb 2, 2024 Clustering Multi-Armed Bandits
— Unverified 0Multi-Armed Bandits with Interference Feb 2, 2024 Multi-Armed Bandits
— Unverified 0Falcon: Fair Active Learning using Multi-armed Bandits Jan 23, 2024 Active Learning Attribute
Code Code Available 0Distributionally Robust Policy Evaluation under General Covariate Shift in Contextual Bandits Jan 21, 2024 Multi-Armed Bandits regression
Code Code Available 0Distributed Multi-Task Learning for Stochastic Bandits with Context Distribution and Stage-wise Constraints Jan 21, 2024 Multi-Armed Bandits Multi-Task Learning
— Unverified 0Adaptive Regret for Bandits Made Possible: Two Queries Suffice Jan 17, 2024 Hyperparameter Optimization Multi-Armed Bandits
— Unverified 0On Quantum Natural Policy Gradients Jan 16, 2024 Multi-Armed Bandits reinforcement-learning
— Unverified 0Contextual Bandits with Stage-wise Constraints Jan 15, 2024 Multi-Armed Bandits
— Unverified 0Let's Get It Started: Fostering the Discoverability of New Releases on Deezer Jan 5, 2024 Multi-Armed Bandits
Code Code Available 0Reliability-Optimized User Admission Control for URLLC Traffic: A Neural Contextual Bandit Approach Jan 5, 2024 Multi-Armed Bandits
— Unverified 0Optimal cross-learning for contextual bandits with unknown context distributions Jan 3, 2024 Multi-Armed Bandits
— Unverified 0Best-of-Both-Worlds Linear Contextual Bandits Dec 27, 2023 Multi-Armed Bandits
— Unverified 0Foundations of Reinforcement Learning and Interactive Decision Making Dec 27, 2023 Decision Making Multi-Armed Bandits
— Unverified 0Harnessing the Power of Federated Learning in Federated Contextual Bandits Dec 26, 2023 Decision Making Federated Learning
Code Code Available 0Diversity-Based Recruitment in Crowdsensing By Combinatorial Multi-Armed Bandits Dec 25, 2023 Diversity Multi-Armed Bandits
— Unverified 0Zero-Inflated Bandits Dec 25, 2023 Multi-Armed Bandits Thompson Sampling
— Unverified 0Best-of-Both-Worlds Algorithms for Linear Contextual Bandits Dec 24, 2023 Multi-Armed Bandits
— Unverified 0Neural Contextual Bandits for Personalized Recommendation Dec 21, 2023 Multi-Armed Bandits Recommendation Systems
— Unverified 0Bayesian Analysis of Combinatorial Gaussian Process Bandits Dec 20, 2023 Bayesian Inference Informativeness
— Unverified 0Distribution-Dependent Rates for Multi-Distribution Learning Dec 20, 2023 Multi-Armed Bandits
— Unverified 0Best Arm Identification with Fixed Budget: A Large Deviation Perspective Dec 19, 2023 Multi-Armed Bandits
Code Code Available 0Observation-Augmented Contextual Multi-Armed Bandits for Robotic Search and Exploration Dec 19, 2023 Bayesian Inference Decision Making
— Unverified 0Online Restless Multi-Armed Bandits with Long-Term Fairness Constraints Dec 16, 2023 Decision Making Fairness
— Unverified 0Risk-Aware Continuous Control with Neural Contextual Bandits Dec 15, 2023 continuous-control Continuous Control
Code Code Available 0A Hierarchical Nearest Neighbour Approach to Contextual Bandits Dec 14, 2023 Computational Efficiency Multi-Armed Bandits
— Unverified 0Robust and Performance Incentivizing Algorithms for Multi-Armed Bandits with Strategic Agents Dec 13, 2023 Multi-Armed Bandits
— Unverified 0Contextual Bandits with Online Neural Regression Dec 12, 2023 Multi-Armed Bandits regression
— Unverified 0RoME: A Robust Mixed-Effects Bandit Algorithm for Optimizing Mobile Health Interventions Dec 11, 2023 Multi-Armed Bandits Off-policy evaluation
Code Code Available 0Distributed Optimization via Kernelized Multi-armed Bandits Dec 7, 2023 Decision Making Distributed Optimization
— Unverified 0Marginal Density Ratio for Off-Policy Evaluation in Contextual Bandits Dec 3, 2023 Causal Inference Multi-Armed Bandits
Code Code Available 0Thompson sampling for zero-inflated count outcomes with an application to the Drink Less mobile health study Nov 24, 2023 Decision Making Multi-Armed Bandits
— Unverified 0When is Off-Policy Evaluation (Reward Modeling) Useful in Contextual Bandits? A Data-Centric Perspective Nov 23, 2023 Large Language Model Multi-Armed Bandits
Code Code Available 0Provably Efficient High-Dimensional Bandit Learning with Batched Feedbacks Nov 22, 2023 Multi-Armed Bandits
— Unverified 0An Experimental Design for Anytime-Valid Causal Inference on Multi-Armed Bandits Nov 9, 2023 Causal Inference Experimental Design
Code Code Available 0Adversarial Attacks on Cooperative Multi-agent Bandits Nov 3, 2023 Multi-Armed Bandits
— Unverified 0Efficient Generalized Low-Rank Tensor Contextual Bandits Nov 3, 2023 Decision Making Multi-Armed Bandits
— Unverified 0LLMs-augmented Contextual Bandit Nov 3, 2023 Multi-Armed Bandits reinforcement-learning
— Unverified 0High-dimensional Linear Bandits with Knapsacks Nov 2, 2023 Multi-Armed Bandits
— Unverified 0Federated Linear Bandits with Finite Adversarial Actions Nov 2, 2023 Multi-Armed Bandits
— Unverified 0An Improved Relaxation for Oracle-Efficient Adversarial Contextual Bandits Oct 29, 2023 Multi-Armed Bandits
— Unverified 0Near-Optimal Pure Exploration in Matrix Games: A Generalization of Stochastic Bandits & Dueling Bandits Oct 25, 2023 Multi-Armed Bandits
Code Code Available 0A Risk-Averse Framework for Non-Stationary Stochastic Multi-Armed Bandits Oct 24, 2023 Change Point Detection Multi-Armed Bandits
— Unverified 0Off-Policy Evaluation for Large Action Spaces via Policy Convolution Oct 24, 2023 Multi-Armed Bandits Off-policy evaluation
— Unverified 0Contextual Bandits for Evaluating and Improving Inventory Control Policies Oct 24, 2023 Multi-Armed Bandits
— Unverified 0Towards a Pretrained Model for Restless Bandits via Multi-arm Generalization Oct 23, 2023 Multi-agent Reinforcement Learning Multi-Armed Bandits
— Unverified 0α-Fair Contextual Bandits Oct 22, 2023 Multi-Armed Bandits Recommendation Systems
— Unverified 0Pure Exploration in Asynchronous Federated Bandits Oct 17, 2023 Multi-Armed Bandits
— Unverified 0Leveraging heterogeneous spillover in maximizing contextual bandit rewards Oct 16, 2023 Multi-Armed Bandits Recommendation Systems
— Unverified 0Bad Values but Good Behavior: Learning Highly Misspecified Bandits and MDPs Oct 13, 2023 Decision Making Multi-Armed Bandits
— Unverified 0Byzantine-Resilient Decentralized Multi-Armed Bandits Oct 11, 2023 Multi-Armed Bandits Recommendation Systems
— Unverified 0