Contextual Linear Bandits with Delay as Payoff Feb 18, 2025 Multi-Armed Bandits
— Unverified 0Model selection for behavioral learning data and applications to contextual bandits Feb 18, 2025 Model Selection Multi-Armed Bandits
— Unverified 0Near-Optimal Private Learning in Linear Contextual Bandits Feb 18, 2025 Multi-Armed Bandits
— Unverified 0Improved Offline Contextual Bandits with Second-Order Bounds: Betting and Freezing Feb 15, 2025 Multi-Armed Bandits
— Unverified 0Contextual bandits with entropy-based human feedback Feb 12, 2025 Multi-Armed Bandits
Code Code Available 0Provably Efficient RLHF Pipeline: A Unified View from Contextual Bandits Feb 11, 2025 Computational Efficiency Multi-Armed Bandits
— Unverified 0Heterogeneous Multi-agent Multi-armed Bandits on Stochastic Block Models Feb 11, 2025 Multi-Armed Bandits Stochastic Block Model
— Unverified 0Quantile Multi-Armed Bandits with 1-bit Feedback Feb 10, 2025 Multi-Armed Bandits
— Unverified 0Towards a Sharp Analysis of Offline Policy Learning for f-Divergence-Regularized Contextual Bandits Feb 9, 2025 Multi-Armed Bandits
— Unverified 0From Restless to Contextual: A Thresholding Bandit Approach to Improve Finite-horizon Performance Feb 7, 2025 Multi-Armed Bandits
Code Code Available 0Nearly Tight Bounds for Cross-Learning Contextual Bandits with Graphical Feedback Feb 7, 2025 Multi-Armed Bandits
— Unverified 0Early Stopping in Contextual Bandits and Inferences Feb 5, 2025 Decision Making Multi-Armed Bandits
— Unverified 0Catoni Contextual Bandits are Robust to Heavy-tailed Rewards Feb 4, 2025 Multi-Armed Bandits
— Unverified 0Nearly Tight Bounds for Exploration in Streaming Multi-armed Bandits with Known Optimality Gap Feb 3, 2025 Multi-Armed Bandits
— Unverified 0Optimizing Online Advertising with Multi-Armed Bandits: Mitigating the Cold Start Problem under Auction Dynamics Feb 3, 2025 Multi-Armed Bandits
— Unverified 0Meta-Prompt Optimization for LLM-Based Sequential Decision Making Feb 2, 2025 Bayesian Optimization Decision Making
— Unverified 0Offline Learning for Combinatorial Multi-armed Bandits Jan 31, 2025 Decision Making Language Modeling
— Unverified 0Multi-agent Multi-armed Bandit with Fully Heavy-tailed Dynamics Jan 31, 2025 Multi-Armed Bandits
— Unverified 0Solving Inverse Problem for Multi-armed Bandits via Convex Optimization Jan 31, 2025 Multi-Armed Bandits
Code Code Available 0Nearly-Optimal Bandit Learning in Stackelberg Games with Side Information Jan 31, 2025 Multi-Armed Bandits
— Unverified 0Contextual Online Decision Making with Infinite-Dimensional Functional Regression Jan 30, 2025 Decision Making Multi-Armed Bandits
— Unverified 0Breaking the (1/Δ_2) Barrier: Better Batched Best Arm Identification with Adaptive Grids Jan 29, 2025 Multi-Armed Bandits
— Unverified 0Sequential Learning of the Pareto Front for Multi-objective Bandits Jan 29, 2025 Multi-Armed Bandits
Code Code Available 0HD-CB: The First Exploration of Hyperdimensional Computing for Contextual Bandits Problems Jan 28, 2025 Computational Efficiency Multi-Armed Bandits
— Unverified 0Restless Multi-armed Bandits under Frequency and Window Constraints for Public Service Inspections Jan 27, 2025 Multi-Armed Bandits Scheduling
— Unverified 0Decision Making in Changing Environments: Robustness, Query-Based Learning, and Differential Privacy Jan 24, 2025 Decision Making Multi-Armed Bandits
— Unverified 0Optimal Multi-Objective Best Arm Identification with Fixed Confidence Jan 23, 2025 Multi-Armed Bandits
— Unverified 0Efficient Implementation of LinearUCB through Algorithmic Improvements and Vector Computing Acceleration for Embedded Learning Systems Jan 22, 2025 Decision Making Edge-computing
— Unverified 0Heterogeneous Multi-Player Multi-Armed Bandits Robust To Adversarial Attacks Jan 21, 2025 Adversarial Attack All
— Unverified 0Multilinguality in LLM-Designed Reward Functions for Restless Bandits: Effects on Task Performance and Fairness Jan 20, 2025 Fairness Multi-Armed Bandits
— Unverified 0Pairwise Elimination with Instance-Dependent Guarantees for Bandits with Cost Subsidy Jan 17, 2025 Multi-Armed Bandits
— Unverified 0Neural Risk-sensitive Satisficing in Contextual Bandits Jan 15, 2025 Multi-Armed Bandits Recommendation Systems
— Unverified 0Differentially Private Kernelized Contextual Bandits Jan 13, 2025 Multi-Armed Bandits
— Unverified 0Finite-Horizon Single-Pull Restless Bandits: An Efficient Index Policy For Scarce Resource Allocation Jan 10, 2025 Multi-Armed Bandits
— Unverified 0On The Statistical Complexity of Offline Decision-Making Jan 10, 2025 Decision Making Multi-Armed Bandits
— Unverified 0An Instrumental Value for Data Production and its Application to Data Pricing Dec 24, 2024 Decision Making Multi-Armed Bandits
— Unverified 0A Novel Approach to Balance Convenience and Nutrition in Meals With Long-Term Group Recommendations and Reasoning on Multimodal Recipes and its Implementation in BEACON Dec 23, 2024 Multi-Armed Bandits Nutrition
— Unverified 0Lagrangian Index Policy for Restless Bandits with Average Reward Dec 17, 2024 Multi-Armed Bandits reinforcement-learning
— Unverified 0MaxInfoRL: Boosting exploration in reinforcement learning through information gain maximization Dec 16, 2024 Multi-Armed Bandits Reinforcement Learning (RL)
— Unverified 0An Optimistic Algorithm for Online Convex Optimization with Adversarial Constraints Dec 11, 2024 Multi-Armed Bandits
— Unverified 0IRL for Restless Multi-Armed Bandits with Applications in Maternal and Child Health Dec 11, 2024 Multi-Armed Bandits
Code Code Available 0UCB algorithms for multi-armed bandits: Precise regret and adaptive inference Dec 9, 2024 Multi-Armed Bandits
— Unverified 0Conservative Contextual Bandits: Beyond Linear Representations Dec 9, 2024 Multi-Armed Bandits Sequential Decision Making
— Unverified 0Coordinated Multi-Armed Bandits for Improved Spatial Reuse in Wi-Fi Dec 4, 2024 Decision Making Fairness
— Unverified 0Data Acquisition for Improving Model Fairness using Reinforcement Learning Dec 4, 2024 Data Valuation Fairness
— Unverified 0Selective Reviews of Bandit Problems in AI via a Statistical View Dec 3, 2024 Decision Making Decision Making Under Uncertainty
— Unverified 0Achieving PAC Guarantees in Mechanism Design through Multi-Armed Bandits Nov 30, 2024 Multi-Armed Bandits
— Unverified 0Contextual Bandits in Payment Processing: Non-uniform Exploration and Supervised Learning at Adyen Nov 30, 2024 Multi-Armed Bandits regression
— Unverified 0Off-policy estimation with adaptively collected data: the power of online learning Nov 19, 2024 Causal Inference Multi-Armed Bandits
— Unverified 0Multi-Agent Stochastic Bandits Robust to Adversarial Corruptions Nov 12, 2024 Multi-Armed Bandits
— Unverified 0