A Novel Approach to Balance Convenience and Nutrition in Meals With Long-Term Group Recommendations and Reasoning on Multimodal Recipes and its Implementation in BEACON Dec 23, 2024 Multi-Armed Bandits Nutrition
— Unverified 0Balans: Multi-Armed Bandits-based Adaptive Large Neighborhood Search for Mixed-Integer Programming Problem Dec 18, 2024 Combinatorial Optimization Multi-Armed Bandits
Code Code Available 1Lagrangian Index Policy for Restless Bandits with Average Reward Dec 17, 2024 Multi-Armed Bandits reinforcement-learning
— Unverified 0MaxInfoRL: Boosting exploration in reinforcement learning through information gain maximization Dec 16, 2024 Multi-Armed Bandits Reinforcement Learning (RL)
— Unverified 0IRL for Restless Multi-Armed Bandits with Applications in Maternal and Child Health Dec 11, 2024 Multi-Armed Bandits
Code Code Available 0An Optimistic Algorithm for Online Convex Optimization with Adversarial Constraints Dec 11, 2024 Multi-Armed Bandits
— Unverified 0UCB algorithms for multi-armed bandits: Precise regret and adaptive inference Dec 9, 2024 Multi-Armed Bandits
— Unverified 0Conservative Contextual Bandits: Beyond Linear Representations Dec 9, 2024 Multi-Armed Bandits Sequential Decision Making
— Unverified 0Coordinated Multi-Armed Bandits for Improved Spatial Reuse in Wi-Fi Dec 4, 2024 Decision Making Fairness
— Unverified 0Data Acquisition for Improving Model Fairness using Reinforcement Learning Dec 4, 2024 Data Valuation Fairness
— Unverified 0Selective Reviews of Bandit Problems in AI via a Statistical View Dec 3, 2024 Decision Making Decision Making Under Uncertainty
— Unverified 0Contextual Bandits in Payment Processing: Non-uniform Exploration and Supervised Learning at Adyen Nov 30, 2024 Multi-Armed Bandits regression
— Unverified 0Achieving PAC Guarantees in Mechanism Design through Multi-Armed Bandits Nov 30, 2024 Multi-Armed Bandits
— Unverified 0Off-policy estimation with adaptively collected data: the power of online learning Nov 19, 2024 Causal Inference Multi-Armed Bandits
— Unverified 0A unifying framework for generalised Bayesian online learning in non-stationary environments Nov 15, 2024 Continual Learning Multi-Armed Bandits
Code Code Available 1Multi-Agent Stochastic Bandits Robust to Adversarial Corruptions Nov 12, 2024 Multi-Armed Bandits
— Unverified 0Individual Regret in Cooperative Stochastic Multi-Armed Bandits Nov 10, 2024 Multi-Armed Bandits
— Unverified 0Variance-Aware Linear UCB with Deep Representation for Neural Contextual Bandits Nov 8, 2024 Computational Efficiency Multi-Armed Bandits
Code Code Available 0Multi-armed Bandits with Missing Outcome Nov 8, 2024 Decision Making Multi-Armed Bandits
Code Code Available 0Structure Matters: Dynamic Policy Gradient Nov 7, 2024 Multi-Armed Bandits
— Unverified 0Sharp Analysis for KL-Regularized Contextual Bandits and RLHF Nov 7, 2024 Multi-Armed Bandits Reinforcement Learning (RL)
— Unverified 0Rising Rested Bandits: Lower Bounds and Efficient Algorithms Nov 6, 2024 Model Selection Multi-Armed Bandits
— Unverified 0Non-Stationary Learning of Neural Networks with Automatic Soft Parameter Reset Nov 6, 2024 Continual Learning Multi-Armed Bandits
— Unverified 0PageRank Bandits for Link Prediction Nov 3, 2024 Decision Making Graph Learning
Code Code Available 0MBExplainer: Multilevel bandit-based explanations for downstream models with augmented graph embeddings Nov 1, 2024 Graph Classification Multi-Armed Bandits
— Unverified 0Minimum Empirical Divergence for Sub-Gaussian Linear Bandits Oct 31, 2024 Multi-Armed Bandits Off-policy evaluation
Code Code Available 0FedMABA: Towards Fair Federated Learning through Multi-Armed Bandits Allocation Oct 26, 2024 Fairness Federated Learning
— Unverified 0Learning to Explore with Lagrangians for Bandits under Unknown Linear Constraints Oct 24, 2024 Fairness Multi-Armed Bandits
— Unverified 0Optimal Streaming Algorithms for Multi-Armed Bandits Oct 23, 2024 Multi-Armed Bandits
— Unverified 0Reward Maximization for Pure Exploration: Minimax Optimal Good Arm Identification for Nonparametric Multi-Armed Bandits Oct 21, 2024 Multi-Armed Bandits valid
— Unverified 0Contextual Bandits with Arm Request Costs and Delays Oct 17, 2024 Movie Recommendation Multi-Armed Bandits
— Unverified 0Online Learning for Function Placement in Serverless Computing Oct 17, 2024 Multi-Armed Bandits
Code Code Available 0Is Prior-Free Black-Box Non-Stationary Reinforcement Learning Feasible? Oct 17, 2024 Change Detection Multi-Armed Bandits
— Unverified 0How Does Variance Shape the Regret in Contextual Bandits? Oct 16, 2024 Multi-Armed Bandits
— Unverified 0Comparative Performance of Collaborative Bandit Algorithms: Effect of Sparsity and Exploration Intensity Oct 15, 2024 Clustering Multi-Armed Bandits
— Unverified 0Combinatorial Multi-armed Bandits: Arm Selection via Group Testing Oct 14, 2024 Multi-Armed Bandits parameter estimation
— Unverified 0EVOLvE: Evaluating and Optimizing LLMs For Exploration Oct 8, 2024 Decision Making Under Uncertainty Multi-Armed Bandits
— Unverified 0Stochastic Bandits for Egalitarian Assignment Oct 8, 2024 Fairness Multi-Armed Bandits
— Unverified 0Diminishing Exploration: A Minimalist Approach to Piecewise Stationary Multi-Armed Bandits Oct 8, 2024 Change Detection Multi-Armed Bandits
— Unverified 0Contextual Bandits with Non-Stationary Correlated Rewards for User Association in MmWave Vehicular Networks Oct 8, 2024 Multi-Armed Bandits Thompson Sampling
— Unverified 0DOPL: Direct Online Preference Learning for Restless Bandits with Preference Feedback Oct 7, 2024 Multi-Armed Bandits Sequential Decision Making
— Unverified 0High Probability Bound for Cross-Learning Contextual Bandits with Unknown Context Distributions Oct 5, 2024 Multi-Armed Bandits
— Unverified 0Online Posterior Sampling with a Diffusion Prior Oct 4, 2024 Multi-Armed Bandits
— Unverified 0Minimax-optimal trust-aware multi-armed bandits Oct 4, 2024 Decision Making Multi-Armed Bandits
— Unverified 0uniINF: Best-of-Both-Worlds Algorithm for Parameter-Free Heavy-Tailed MABs Oct 4, 2024 Multi-Armed Bandits Scheduling
— Unverified 0On Lai's Upper Confidence Bound in Multi-Armed Bandits Oct 3, 2024 Multi-Armed Bandits
— Unverified 0Fast and Sample Efficient Multi-Task Representation Learning in Stochastic Contextual Bandits Oct 2, 2024 Multi-Armed Bandits Multi-Task Learning
— Unverified 0LASeR: Learning to Adaptively Select Reward Models with Multi-Armed Bandits Oct 2, 2024 Instruction Following Math
Code Code Available 1Stabilizing the Kumaraswamy Distribution Oct 1, 2024 Link Prediction Multi-Armed Bandits
— Unverified 0Optimism in the Face of Ambiguity Principle for Multi-Armed Bandits Sep 30, 2024 Computational Efficiency Multi-Armed Bandits
— Unverified 0