Offline Clustering of Linear Bandits: Unlocking the Power of Clusters in Data-Limited Environments May 25, 2025 Clustering Multi-Armed Bandits
— Unverified 0Test-Time Scaling of Diffusion Models via Noise Trajectory Search May 24, 2025 Denoising Image Generation
Code Code Available 0KL-regularization Itself is Differentially Private in Bandits and RLHF May 23, 2025 Decision Making Multi-Armed Bandits
— Unverified 0Scalable and Interpretable Contextual Bandits: A Literature Review and Retail Offer Prototype May 22, 2025 Feature Engineering Large Language Model
— Unverified 0Optimal Best-Arm Identification under Fixed Confidence with Multiple Optima May 21, 2025 Multi-Armed Bandits
— Unverified 0In-Domain African Languages Translation Using LLMs and Multi-armed Bandits May 21, 2025 Domain Adaptation Machine Translation
— Unverified 0Human in the Loop Adaptive Optimization for Improved Time Series Forecasting May 21, 2025 Language Modeling Language Modelling
Code Code Available 0High-dimensional Nonparametric Contextual Bandit Problem May 20, 2025 Decision Making Multi-Armed Bandits
— Unverified 0Augmenting Online RL with Offline Data is All You Need: A Unified Hybrid RL Algorithm Design and Analysis May 19, 2025 All Multi-Armed Bandits
— Unverified 0Multi-Armed Bandits Meet Large Language Models May 19, 2025 Decision Making Multi-Armed Bandits
— Unverified 0Near Optimal Best Arm Identification for Clustered Bandits May 15, 2025 Clustering Computational Efficiency
— Unverified 0Batched Nonparametric Bandits via k-Nearest Neighbor UCB May 15, 2025 Decision Making Marketing
— Unverified 0Adaptive, Robust and Scalable Bayesian Filtering for Online Learning May 12, 2025 Continual Learning Multi-Armed Bandits
— Unverified 0Navigating the Rashomon Effect: How Personalization Can Help Adjust Interpretable Machine Learning Models to Individual Users May 11, 2025 Additive models Interpretable Machine Learning
— Unverified 0Adaptive Budgeted Multi-Armed Bandits for IoT with Dynamic Resource Constraints May 5, 2025 Multi-Armed Bandits
— Unverified 0Preference-centric Bandits: Optimality of Mixtures and Regret-efficient Algorithms Apr 29, 2025 Multi-Armed Bandits Navigate
— Unverified 0Access Probability Optimization in RACH: A Multi-Armed Bandits Approach Apr 18, 2025 Multi-Armed Bandits
— Unverified 0On the Problem of Best Arm Retention Apr 16, 2025 Multi-Armed Bandits
— Unverified 0Neural Contextual Bandits Under Delayed Feedback Constraints Apr 16, 2025 Multi-Armed Bandits Recommendation Systems
— Unverified 0Learning-Based User Association for MmWave Vehicular Networks With Kernelized Contextual Bandits Apr 15, 2025 Multi-Armed Bandits
— Unverified 0Towards More Efficient, Robust, Instance-adaptive, and Generalizable Sequential Decision making Apr 12, 2025 Decision Making Decision Making Under Uncertainty
— Unverified 0A Classification View on Meta Learning Bandits Apr 6, 2025 Classification Meta-Learning
— Unverified 0An Exploration-free Method for a Linear Stochastic Bandit Driven by a Linear Gaussian Dynamical System Apr 4, 2025 Hyperparameter Optimization Multi-Armed Bandits
— Unverified 0Antithetic Sampling for Top-k Shapley Identification Apr 2, 2025 Multi-Armed Bandits
Code Code Available 0Efficient Near-Optimal Algorithm for Online Shortest Paths in Directed Acyclic Graphs with Bandit Feedback Against Adaptive Adversaries Apr 1, 2025 Multi-Armed Bandits
— Unverified 0Reinforcement Learning for Machine Learning Model Deployment: Evaluating Multi-Armed Bandits in ML Ops Environments Mar 28, 2025 Management Model Selection
— Unverified 0MultiScale Contextual Bandits for Long Term Objectives Mar 22, 2025 Multi-Armed Bandits Recommendation Systems
— Unverified 0Sparse Additive Contextual Bandits: A Nonparametric Approach for Online Decision-making with High-dimensional Covariates Mar 21, 2025 Decision Making Multi-Armed Bandits
— Unverified 0NeuroSep-CP-LCB: A Deep Learning-based Contextual Multi-armed Bandit Algorithm with Uncertainty Quantification for Early Sepsis Prediction Mar 20, 2025 Conformal Prediction Decision Making
Code Code Available 0Sparse Nonparametric Contextual Bandits Mar 20, 2025 Multi-Armed Bandits Thompson Sampling
— Unverified 0A New Benchmark for Online Learning with Budget-Balancing Constraints Mar 19, 2025 Multi-Armed Bandits
— Unverified 0Variance-Dependent Regret Lower Bounds for Contextual Bandits Mar 15, 2025 Multi-Armed Bandits
— Unverified 0Bi-Criteria Optimization for Combinatorial Bandits: Sublinear Regret and Constraint Violation under Bandit Feedback Mar 15, 2025 Multi-Armed Bandits
— Unverified 0Locally Private Nonparametric Contextual Multi-armed Bandits Mar 11, 2025 Decision Making Multi-Armed Bandits
Code Code Available 0Multiplayer Information Asymmetric Contextual Bandits Mar 11, 2025 Multi-Armed Bandits
— Unverified 0Graph-Dependent Regret Bounds in Multi-Armed Bandits with Interference Mar 10, 2025 Multi-Armed Bandits Sequential Decision Making
— Unverified 0Cost-Aware Optimal Pairwise Pure Exploration Mar 10, 2025 Multi-Armed Bandits
— Unverified 0Greedy Algorithm for Structured Bandits: A Sharp Characterization of Asymptotic Success / Failure Mar 6, 2025 Decision Making Multi-Armed Bandits
— Unverified 0Tight Gap-Dependent Memory-Regret Trade-Off for Single-Pass Streaming Stochastic Multi-Armed Bandits Mar 4, 2025 Multi-Armed Bandits
— Unverified 0Evolution of Information in Interactive Decision Making: A Case Study for Multi-Armed Bandits Mar 1, 2025 Decision Making Multi-Armed Bandits
— Unverified 0Towards Understanding the Benefit of Multitask Representation Learning in Decision Process Mar 1, 2025 Multi-Armed Bandits Reinforcement Learning (RL)
— Unverified 0Semi-Parametric Batched Global Multi-Armed Bandits with Covariates Mar 1, 2025 Decision Making Multi-Armed Bandits
— Unverified 0Functional multi-armed bandit and the best function identification problems Mar 1, 2025 Multi-Armed Bandits
— Unverified 0Meta-Reasoner: Dynamic Guidance for Optimized Inference-time Reasoning in Large Language Models Feb 27, 2025 Mathematical Reasoning Multi-Armed Bandits
— Unverified 0Transfer Learning in Latent Contextual Bandits with Covariate Shift Through Causal Transportability Feb 27, 2025 Causal Inference Multi-Armed Bandits
Code Code Available 0Heterogeneous Multi-Agent Bandits with Parsimonious Hints Feb 22, 2025 4k Multi-Armed Bandits
— Unverified 0Multi-agent Multi-armed Bandits with Minimum Reward Guarantee Fairness Feb 21, 2025 Fairness Multi-Armed Bandits
Code Code Available 0Achieving adaptivity and optimality for multi-armed bandits using Exponential-Kullback Leibler Maillard Sampling Feb 20, 2025 Multi-Armed Bandits Thompson Sampling
— Unverified 0Continuous K-Max Bandits Feb 19, 2025 Distributed Computing Multi-Armed Bandits
— Unverified 0Efficient and Optimal Policy Gradient Algorithm for Corrupted Multi-armed Bandits Feb 19, 2025 Multi-Armed Bandits
— Unverified 0