Linear Contextual Bandits with Interference Sep 24, 2024 Causal Inference Decision Making
— Unverified 0Second Order Bounds for Contextual Bandits with Function Approximation Sep 24, 2024 Multi-Armed Bandits
— Unverified 0Designing an Interpretable Interface for Contextual Bandits Sep 23, 2024 Multi-Armed Bandits Off-policy evaluation
— Unverified 0Causal Feature Selection Method for Contextual Multi-Armed Bandits in Recommender System Sep 20, 2024 feature selection Multi-Armed Bandits
— Unverified 0Partially Observable Contextual Bandits with Linear Payoffs Sep 17, 2024 Decision Making Multi-Armed Bandits
— Unverified 0Batched Online Contextual Sparse Bandits with Sequential Inclusion of Features Sep 13, 2024 Decision Making Fairness
— Unverified 0Batch Ensemble for Variance Dependent Regret in Stochastic Bandits Sep 13, 2024 Multi-Armed Bandits Reinforcement Learning (RL)
— Unverified 0A Hybrid Meta-Learning and Multi-Armed Bandit Approach for Context-Specific Multi-Objective Recommendation Optimization Sep 13, 2024 Meta-Learning Multi-Armed Bandits
— Unverified 0Modified Meta-Thompson Sampling for Linear Bandits and Its Bayes Regret Analysis Sep 10, 2024 Meta-Learning Multi-Armed Bandits
— Unverified 0Whittle Index Learning Algorithms for Restless Bandits with Constant Stepsizes Sep 6, 2024 Multi-Armed Bandits Q-Learning
— Unverified 0Faster Q-Learning Algorithms for Restless Bandits Sep 6, 2024 Multi-Armed Bandits Q-Learning
— Unverified 0Performance-Aware Self-Configurable Multi-Agent Networks: A Distributed Submodular Approach for Simultaneous Coordination and Network Design Sep 2, 2024 Event Detection Multi-Armed Bandits
Code Code Available 0Improving Thompson Sampling via Information Relaxation for Budgeted Multi-armed Bandits Aug 28, 2024 Multi-Armed Bandits Thompson Sampling
— Unverified 0Contextual Bandit with Herding Effects: Algorithms and Recommendation Applications Aug 26, 2024 Multi-Armed Bandits Thompson Sampling
— Unverified 0Representative Arm Identification: A fixed confidence approach to identify cluster representatives Aug 26, 2024 Multi-Armed Bandits
— Unverified 0Online Fair Division with Contextual Bandits Aug 23, 2024 Fairness Multi-Armed Bandits
— Unverified 0Dynamic Product Image Generation and Recommendation at Scale for Personalized E-commerce Aug 22, 2024 Image Generation Multi-Armed Bandits
— Unverified 0Balancing Act: Prioritization Strategies for LLM-Designed Restless Bandit Rewards Aug 22, 2024 Language Modeling Language Modelling
— Unverified 0Multi-agent Multi-armed Bandits with Stochastic Sharable Arm Capacities Aug 20, 2024 Multi-Armed Bandits
— Unverified 0Contextual Bandits for Unbounded Context Distributions Aug 19, 2024 Decision Making Multi-Armed Bandits
— Unverified 0GINO-Q: Learning an Asymptotically Optimal Index Policy for Restless Multi-armed Bandits Aug 19, 2024 Multi-Armed Bandits Q-Learning
— Unverified 0Reciprocal Learning Aug 12, 2024 Active Learning Multi-Armed Bandits
— Unverified 0Hierarchical Multi-Armed Bandits for the Concurrent Intelligent Tutoring of Concepts and Problems of Varying Difficulty Levels Aug 10, 2024 Knowledge Tracing Multi-Armed Bandits
Code Code Available 0Mitigating Exposure Bias in Online Learning to Rank Recommendation: A Novel Reward Model for Cascading Bandits Aug 8, 2024 Exposure Fairness Fairness
Code Code Available 0Combining Diverse Information for Coordinated Action: Stochastic Bandit Algorithms for Heterogeneous Agents Aug 6, 2024 Multi-Armed Bandits Sensitivity
Code Code Available 0Empathic Responding for Digital Interpersonal Emotion Regulation via Content Recommendation Aug 5, 2024 Multi-Armed Bandits
— Unverified 0Online Learning for Autonomous Management of Intent-based 6G Networks Jul 25, 2024 Efficient Exploration Management
— Unverified 0Identifiable latent bandits: Combining observational data and exploration for personalized healthcare Jul 23, 2024 Decision Making Multi-Armed Bandits
— Unverified 0Scalable Exploration via Ensemble++ Jul 18, 2024 Computational Efficiency Decision Making
Code Code Available 0Satisficing Exploration for Deep Reinforcement Learning Jul 16, 2024 Deep Reinforcement Learning Multi-Armed Bandits
— Unverified 0Open Problem: Tight Bounds for Kernelized Multi-Armed Bandits with Bernoulli Rewards Jul 8, 2024 Multi-Armed Bandits
— Unverified 0On Speeding Up Language Model Evaluation Jul 8, 2024 Language Model Evaluation Language Modeling
— Unverified 0Honor Among Bandits: No-Regret Learning for Online Fair Division Jul 1, 2024 Fairness Multi-Armed Bandits
— Unverified 0A Contextual Combinatorial Bandit Approach to Negotiation Jun 30, 2024 Multi-Armed Bandits
— Unverified 0Classical Bandit Algorithms for Entanglement Detection in Parameterized Qubit States Jun 28, 2024 Multi-Armed Bandits
— Unverified 0Jump Starting Bandits with LLM-Generated Prior Knowledge Jun 27, 2024 Multi-Armed Bandits Recommendation Systems
Code Code Available 0EduQate: Generating Adaptive Curricula through RMABs in Education Settings Jun 20, 2024 Multi-Armed Bandits Q-Learning
— Unverified 0BEACON: Balancing Convenience and Nutrition in Meals With Long-Term Group Recommendations and Reasoning on Multimodal Recipes Jun 19, 2024 Multi-Armed Bandits Nutrition
— Unverified 0Towards Bayesian Data Selection Jun 18, 2024 Active Learning Additive models
— Unverified 0Discovering Minimal Reinforcement Learning Environments Jun 18, 2024 continuous-control Continuous Control
Code Code Available 1Improving Reward-Conditioned Policies for Multi-Armed Bandits using Normalized Weight Functions Jun 16, 2024 Multi-Armed Bandits Policy Gradient Methods
— Unverified 0An Adaptive Method for Contextual Stochastic Multi-armed Bandits with Rewards Generated by a Linear Dynamical System Jun 14, 2024 Multi-Armed Bandits
— Unverified 0Linear Contextual Bandits with Hybrid Payoff: Revisited Jun 14, 2024 Diversity Multi-Armed Bandits
Code Code Available 0Towards Domain Adaptive Neural Contextual Bandits Jun 13, 2024 Decision Making Domain Adaptation
— Unverified 0A Federated Online Restless Bandit Framework for Cooperative Resource Allocation Jun 12, 2024 Federated Learning Multi-Armed Bandits
— Unverified 0Asymptotically Optimal Regret for Black-Box Predict-then-Optimize Jun 12, 2024 Decision Making Multi-Armed Bandits
— Unverified 0Sample Complexity Reduction via Policy Difference Estimation in Tabular Reinforcement Learning Jun 11, 2024 Multi-Armed Bandits Reinforcement Learning (RL)
— Unverified 0A conversion theorem and minimax optimality for continuum contextual bandits Jun 9, 2024 Multi-Armed Bandits
— Unverified 0Data-Driven Upper Confidence Bounds with Near-Optimal Regret for Heavy-Tailed Bandits Jun 9, 2024 Decision Making Multi-Armed Bandits
— Unverified 0Adaptively Learning to Select-Rank in Online Platforms Jun 7, 2024 Multi-Armed Bandits Thompson Sampling
— Unverified 0