Individual Regret in Cooperative Stochastic Multi-Armed Bandits Nov 10, 2024 Multi-Armed Bandits
— Unverified 0Variance-Aware Linear UCB with Deep Representation for Neural Contextual Bandits Nov 8, 2024 Computational Efficiency Multi-Armed Bandits
Code Code Available 0Multi-armed Bandits with Missing Outcome Nov 8, 2024 Decision Making Multi-Armed Bandits
Code Code Available 0Structure Matters: Dynamic Policy Gradient Nov 7, 2024 Multi-Armed Bandits
— Unverified 0Sharp Analysis for KL-Regularized Contextual Bandits and RLHF Nov 7, 2024 Multi-Armed Bandits Reinforcement Learning (RL)
— Unverified 0Rising Rested Bandits: Lower Bounds and Efficient Algorithms Nov 6, 2024 Model Selection Multi-Armed Bandits
— Unverified 0Non-Stationary Learning of Neural Networks with Automatic Soft Parameter Reset Nov 6, 2024 Continual Learning Multi-Armed Bandits
— Unverified 0PageRank Bandits for Link Prediction Nov 3, 2024 Decision Making Graph Learning
Code Code Available 0MBExplainer: Multilevel bandit-based explanations for downstream models with augmented graph embeddings Nov 1, 2024 Graph Classification Multi-Armed Bandits
— Unverified 0Minimum Empirical Divergence for Sub-Gaussian Linear Bandits Oct 31, 2024 Multi-Armed Bandits Off-policy evaluation
Code Code Available 0FedMABA: Towards Fair Federated Learning through Multi-Armed Bandits Allocation Oct 26, 2024 Fairness Federated Learning
— Unverified 0Learning to Explore with Lagrangians for Bandits under Unknown Linear Constraints Oct 24, 2024 Fairness Multi-Armed Bandits
— Unverified 0Optimal Streaming Algorithms for Multi-Armed Bandits Oct 23, 2024 Multi-Armed Bandits
— Unverified 0Reward Maximization for Pure Exploration: Minimax Optimal Good Arm Identification for Nonparametric Multi-Armed Bandits Oct 21, 2024 Multi-Armed Bandits valid
— Unverified 0Online Learning for Function Placement in Serverless Computing Oct 17, 2024 Multi-Armed Bandits
Code Code Available 0Contextual Bandits with Arm Request Costs and Delays Oct 17, 2024 Movie Recommendation Multi-Armed Bandits
— Unverified 0Is Prior-Free Black-Box Non-Stationary Reinforcement Learning Feasible? Oct 17, 2024 Change Detection Multi-Armed Bandits
— Unverified 0How Does Variance Shape the Regret in Contextual Bandits? Oct 16, 2024 Multi-Armed Bandits
— Unverified 0Comparative Performance of Collaborative Bandit Algorithms: Effect of Sparsity and Exploration Intensity Oct 15, 2024 Clustering Multi-Armed Bandits
— Unverified 0Combinatorial Multi-armed Bandits: Arm Selection via Group Testing Oct 14, 2024 Multi-Armed Bandits parameter estimation
— Unverified 0Contextual Bandits with Non-Stationary Correlated Rewards for User Association in MmWave Vehicular Networks Oct 8, 2024 Multi-Armed Bandits Thompson Sampling
— Unverified 0Stochastic Bandits for Egalitarian Assignment Oct 8, 2024 Fairness Multi-Armed Bandits
— Unverified 0Diminishing Exploration: A Minimalist Approach to Piecewise Stationary Multi-Armed Bandits Oct 8, 2024 Change Detection Multi-Armed Bandits
— Unverified 0EVOLvE: Evaluating and Optimizing LLMs For Exploration Oct 8, 2024 Decision Making Under Uncertainty Multi-Armed Bandits
— Unverified 0DOPL: Direct Online Preference Learning for Restless Bandits with Preference Feedback Oct 7, 2024 Multi-Armed Bandits Sequential Decision Making
— Unverified 0High Probability Bound for Cross-Learning Contextual Bandits with Unknown Context Distributions Oct 5, 2024 Multi-Armed Bandits
— Unverified 0Minimax-optimal trust-aware multi-armed bandits Oct 4, 2024 Decision Making Multi-Armed Bandits
— Unverified 0Online Posterior Sampling with a Diffusion Prior Oct 4, 2024 Multi-Armed Bandits
— Unverified 0uniINF: Best-of-Both-Worlds Algorithm for Parameter-Free Heavy-Tailed MABs Oct 4, 2024 Multi-Armed Bandits Scheduling
— Unverified 0On Lai's Upper Confidence Bound in Multi-Armed Bandits Oct 3, 2024 Multi-Armed Bandits
— Unverified 0Fast and Sample Efficient Multi-Task Representation Learning in Stochastic Contextual Bandits Oct 2, 2024 Multi-Armed Bandits Multi-Task Learning
— Unverified 0Stabilizing the Kumaraswamy Distribution Oct 1, 2024 Link Prediction Multi-Armed Bandits
— Unverified 0Optimism in the Face of Ambiguity Principle for Multi-Armed Bandits Sep 30, 2024 Computational Efficiency Multi-Armed Bandits
— Unverified 0Linear Contextual Bandits with Interference Sep 24, 2024 Causal Inference Decision Making
— Unverified 0Second Order Bounds for Contextual Bandits with Function Approximation Sep 24, 2024 Multi-Armed Bandits
— Unverified 0Designing an Interpretable Interface for Contextual Bandits Sep 23, 2024 Multi-Armed Bandits Off-policy evaluation
— Unverified 0Causal Feature Selection Method for Contextual Multi-Armed Bandits in Recommender System Sep 20, 2024 feature selection Multi-Armed Bandits
— Unverified 0Partially Observable Contextual Bandits with Linear Payoffs Sep 17, 2024 Decision Making Multi-Armed Bandits
— Unverified 0Batch Ensemble for Variance Dependent Regret in Stochastic Bandits Sep 13, 2024 Multi-Armed Bandits Reinforcement Learning (RL)
— Unverified 0Batched Online Contextual Sparse Bandits with Sequential Inclusion of Features Sep 13, 2024 Decision Making Fairness
— Unverified 0A Hybrid Meta-Learning and Multi-Armed Bandit Approach for Context-Specific Multi-Objective Recommendation Optimization Sep 13, 2024 Meta-Learning Multi-Armed Bandits
— Unverified 0Modified Meta-Thompson Sampling for Linear Bandits and Its Bayes Regret Analysis Sep 10, 2024 Meta-Learning Multi-Armed Bandits
— Unverified 0Whittle Index Learning Algorithms for Restless Bandits with Constant Stepsizes Sep 6, 2024 Multi-Armed Bandits Q-Learning
— Unverified 0Faster Q-Learning Algorithms for Restless Bandits Sep 6, 2024 Multi-Armed Bandits Q-Learning
— Unverified 0Performance-Aware Self-Configurable Multi-Agent Networks: A Distributed Submodular Approach for Simultaneous Coordination and Network Design Sep 2, 2024 Event Detection Multi-Armed Bandits
Code Code Available 0Improving Thompson Sampling via Information Relaxation for Budgeted Multi-armed Bandits Aug 28, 2024 Multi-Armed Bandits Thompson Sampling
— Unverified 0Representative Arm Identification: A fixed confidence approach to identify cluster representatives Aug 26, 2024 Multi-Armed Bandits
— Unverified 0Contextual Bandit with Herding Effects: Algorithms and Recommendation Applications Aug 26, 2024 Multi-Armed Bandits Thompson Sampling
— Unverified 0Online Fair Division with Contextual Bandits Aug 23, 2024 Fairness Multi-Armed Bandits
— Unverified 0Dynamic Product Image Generation and Recommendation at Scale for Personalized E-commerce Aug 22, 2024 Image Generation Multi-Armed Bandits
— Unverified 0