Optimal Batched Linear Bandits Jun 6, 2024 Computational Efficiency Multi-Armed Bandits
Code Code Available 0Combinatorial Multivariant Multi-Armed Bandits with Applications to Episodic Reinforcement Learning and Beyond Jun 3, 2024 Multi-Armed Bandits Reinforcement Learning (RL)
— Unverified 0Global Rewards in Restless Multi-Armed Bandits Jun 2, 2024 Multi-Armed Bandits
— Unverified 0Strategic Linear Contextual Bandits Jun 1, 2024 Multi-Armed Bandits Recommendation Systems
— Unverified 0A Batch Sequential Halving Algorithm without Performance Degradation Jun 1, 2024 Computational Efficiency Multi-Armed Bandits
— Unverified 0No-Regret Learning for Fair Multi-Agent Social Welfare Optimization May 31, 2024 Fairness Multi-Armed Bandits
— Unverified 0Understanding Memory-Regret Trade-Off for Streaming Stochastic Multi-Armed Bandits May 30, 2024 Multi-Armed Bandits
— Unverified 0Optimizing Sharpe Ratio: Risk-Adjusted Decision-Making in Multi-Armed Bandits May 28, 2024 Decision Making Management
— Unverified 0Causal Contextual Bandits with Adaptive Context May 28, 2024 Multi-Armed Bandits
Code Code Available 0Multi-Armed Bandits with Network Interference May 28, 2024 Multi-Armed Bandits
Code Code Available 0Offline Oracle-Efficient Learning for Contextual MDPs via Layerwise Exploration-Exploitation Tradeoff May 28, 2024 Density Estimation Multi-Armed Bandits
— Unverified 0Multi-Player Approaches for Dueling Bandits May 25, 2024 Multi-Armed Bandits
— Unverified 0Indexed Minimum Empirical Divergence-Based Algorithms for Linear Bandits May 24, 2024 Multi-Armed Bandits Thompson Sampling
— Unverified 0Budgeted Recommendation with Delayed Feedback May 19, 2024 Decision Making Multi-Armed Bandits
— Unverified 0No-Regret is not enough! Bandits with General Constraints through Adaptive Regret Minimization May 10, 2024 Multi-Armed Bandits
— Unverified 0Federated Combinatorial Multi-Agent Multi-Armed Bandits May 9, 2024 Combinatorial Optimization Data Summarization
— Unverified 0Optimal Baseline Corrections for Off-Policy Contextual Bandits May 9, 2024 Decision Making Multi-Armed Bandits
Code Code Available 0Imprecise Multi-Armed Bandits May 9, 2024 Multi-Armed Bandits
— Unverified 0Leveraging (Biased) Information: Multi-armed Bandits with Offline Data May 4, 2024 Multi-Armed Bandits
— Unverified 0Mathematics of statistical sequential decision-making: concentration, risk-awareness and modelling in stochastic bandits, with applications to bariatric surgery May 3, 2024 Decision Making Interpretable Machine Learning
— Unverified 0Provably Efficient Reinforcement Learning for Adversarial Restless Multi-Armed Bandits with Unknown Transitions and Bandit Feedback May 2, 2024 Multi-Armed Bandits Sequential Decision Making
— Unverified 0Recommenadation aided Caching using Combinatorial Multi-armed Bandits Apr 30, 2024 Multi-Armed Bandits
— Unverified 0Disentangling Exploration from Exploitation Apr 29, 2024 Disentanglement Multi-Armed Bandits
— Unverified 0Causally Abstracted Multi-armed Bandits Apr 26, 2024 Decision Making Multi-Armed Bandits
Code Code Available 0Structured Reinforcement Learning for Delay-Optimal Data Transmission in Dense mmWave Networks Apr 25, 2024 Fairness Multi-Armed Bandits
— Unverified 0Sequential Decision Making with Expert Demonstrations under Unobserved Heterogeneity Apr 10, 2024 Decision Making Meta Reinforcement Learning
Code Code Available 0Generalized Linear Bandits with Limited Adaptivity Apr 10, 2024 Multi-Armed Bandits
Code Code Available 0Feel-Good Thompson Sampling for Contextual Dueling Bandits Apr 9, 2024 Decision Making Multi-Armed Bandits
— Unverified 0Hypothesis Generation with Large Language Models Apr 5, 2024 Multi-Armed Bandits
Code Code Available 2On the Importance of Uncertainty in Decision-Making with Large Language Models Apr 3, 2024 Decision Making Multi-Armed Bandits
— Unverified 0Doubly-Robust Off-Policy Evaluation with Estimated Logging Policy Apr 2, 2024 Multi-Armed Bandits Off-policy evaluation
— Unverified 0Nearly-tight Approximation Guarantees for the Improving Multi-Armed Bandits Problem Apr 1, 2024 Multi-Armed Bandits
— Unverified 0A Correction of Pseudo Log-Likelihood Method Mar 26, 2024 Multi-Armed Bandits
— Unverified 0Contextual Restless Multi-Armed Bandits with Application to Demand Response Decision-Making Mar 22, 2024 Decision Making Multi-Armed Bandits
— Unverified 0Transfer in Sequential Multi-armed Bandits via Reward Samples Mar 19, 2024 Multi-Armed Bandits
— Unverified 0Phasic Diversity Optimization for Population-Based Reinforcement Learning Mar 17, 2024 Diversity MuJoCo
— Unverified 0Cramming Contextual Bandits for On-policy Statistical Evaluation Mar 11, 2024 Multi-Armed Bandits Off-policy evaluation
— Unverified 0ε-Neural Thompson Sampling of Deep Brain Stimulation for Parkinson Disease Treatment Mar 11, 2024 Multi-Armed Bandits Reinforcement Learning (RL)
— Unverified 0Efficient Public Health Intervention Planning Using Decomposition-Based Decision-Focused Learning Mar 8, 2024 Multi-Armed Bandits
— Unverified 0A General Reduction for High-Probability Analysis with General Light-Tailed Distributions Mar 5, 2024 Multi-Armed Bandits Stochastic Optimization
— Unverified 0LC-Tsallis-INF: Generalized Best-of-Both-Worlds Linear Contextual Bandits Mar 5, 2024 Multi-Armed Bandits
— Unverified 0Adaptive Learning Rate for Follow-the-Regularized-Leader: Competitive Analysis and Best-of-Both-Worlds Mar 1, 2024 Decision Making Multi-Armed Bandits
— Unverified 0Federated Linear Contextual Bandits with Heterogeneous Clients Feb 29, 2024 All Federated Learning
— Unverified 0Investigating Gender Fairness in Machine Learning-driven Personalized Care for Chronic Pain Feb 29, 2024 Decision Making Fairness
— Unverified 0Batched Nonparametric Contextual Bandits Feb 27, 2024 Multi-Armed Bandits
— Unverified 0Is Offline Decision Making Possible with Only Few Samples? Reliable Decisions in Data-Starved Bandits via Trust Region Enhancement Feb 24, 2024 Decision Making Multi-Armed Bandits
— Unverified 0Low-Rank Bandits via Tight Two-to-Infinity Singular Subspace Recovery Feb 24, 2024 Multi-Armed Bandits
Code Code Available 0Multi-Armed Bandits with Abstention Feb 23, 2024 Decision Making Multi-Armed Bandits
— Unverified 0Optimistic Information Directed Sampling Feb 23, 2024 Multi-Armed Bandits
— Unverified 0A Decision-Language Model (DLM) for Dynamic Restless Multi-Armed Bandit Tasks in Public Health Feb 22, 2024 Language Modeling Language Modelling
— Unverified 0