Graph-Dependent Regret Bounds in Multi-Armed Bandits with Interference Mar 10, 2025 Multi-Armed Bandits Sequential Decision Making
— Unverified 0Greedy Algorithm for Structured Bandits: A Sharp Characterization of Asymptotic Success / Failure Mar 6, 2025 Decision Making Multi-Armed Bandits
— Unverified 0Tight Gap-Dependent Memory-Regret Trade-Off for Single-Pass Streaming Stochastic Multi-Armed Bandits Mar 4, 2025 Multi-Armed Bandits
— Unverified 0Towards Understanding the Benefit of Multitask Representation Learning in Decision Process Mar 1, 2025 Multi-Armed Bandits Reinforcement Learning (RL)
— Unverified 0Evolution of Information in Interactive Decision Making: A Case Study for Multi-Armed Bandits Mar 1, 2025 Decision Making Multi-Armed Bandits
— Unverified 0Semi-Parametric Batched Global Multi-Armed Bandits with Covariates Mar 1, 2025 Decision Making Multi-Armed Bandits
— Unverified 0Functional multi-armed bandit and the best function identification problems Mar 1, 2025 Multi-Armed Bandits
— Unverified 0Meta-Reasoner: Dynamic Guidance for Optimized Inference-time Reasoning in Large Language Models Feb 27, 2025 Mathematical Reasoning Multi-Armed Bandits
— Unverified 0Transfer Learning in Latent Contextual Bandits with Covariate Shift Through Causal Transportability Feb 27, 2025 Causal Inference Multi-Armed Bandits
Code Code Available 0Heterogeneous Multi-Agent Bandits with Parsimonious Hints Feb 22, 2025 4k Multi-Armed Bandits
— Unverified 0Multi-agent Multi-armed Bandits with Minimum Reward Guarantee Fairness Feb 21, 2025 Fairness Multi-Armed Bandits
Code Code Available 0Achieving adaptivity and optimality for multi-armed bandits using Exponential-Kullback Leibler Maillard Sampling Feb 20, 2025 Multi-Armed Bandits Thompson Sampling
— Unverified 0Efficient and Optimal Policy Gradient Algorithm for Corrupted Multi-armed Bandits Feb 19, 2025 Multi-Armed Bandits
— Unverified 0Continuous K-Max Bandits Feb 19, 2025 Distributed Computing Multi-Armed Bandits
— Unverified 0Contextual Linear Bandits with Delay as Payoff Feb 18, 2025 Multi-Armed Bandits
— Unverified 0Model selection for behavioral learning data and applications to contextual bandits Feb 18, 2025 Model Selection Multi-Armed Bandits
— Unverified 0Near-Optimal Private Learning in Linear Contextual Bandits Feb 18, 2025 Multi-Armed Bandits
— Unverified 0Improved Offline Contextual Bandits with Second-Order Bounds: Betting and Freezing Feb 15, 2025 Multi-Armed Bandits
— Unverified 0Contextual bandits with entropy-based human feedback Feb 12, 2025 Multi-Armed Bandits
Code Code Available 0Heterogeneous Multi-agent Multi-armed Bandits on Stochastic Block Models Feb 11, 2025 Multi-Armed Bandits Stochastic Block Model
— Unverified 0Provably Efficient RLHF Pipeline: A Unified View from Contextual Bandits Feb 11, 2025 Computational Efficiency Multi-Armed Bandits
— Unverified 0Quantile Multi-Armed Bandits with 1-bit Feedback Feb 10, 2025 Multi-Armed Bandits
— Unverified 0Towards a Sharp Analysis of Offline Policy Learning for f-Divergence-Regularized Contextual Bandits Feb 9, 2025 Multi-Armed Bandits
— Unverified 0From Restless to Contextual: A Thresholding Bandit Approach to Improve Finite-horizon Performance Feb 7, 2025 Multi-Armed Bandits
Code Code Available 0Nearly Tight Bounds for Cross-Learning Contextual Bandits with Graphical Feedback Feb 7, 2025 Multi-Armed Bandits
— Unverified 0Early Stopping in Contextual Bandits and Inferences Feb 5, 2025 Decision Making Multi-Armed Bandits
— Unverified 0Catoni Contextual Bandits are Robust to Heavy-tailed Rewards Feb 4, 2025 Multi-Armed Bandits
— Unverified 0Optimizing Online Advertising with Multi-Armed Bandits: Mitigating the Cold Start Problem under Auction Dynamics Feb 3, 2025 Multi-Armed Bandits
— Unverified 0Nearly Tight Bounds for Exploration in Streaming Multi-armed Bandits with Known Optimality Gap Feb 3, 2025 Multi-Armed Bandits
— Unverified 0Meta-Prompt Optimization for LLM-Based Sequential Decision Making Feb 2, 2025 Bayesian Optimization Decision Making
— Unverified 0Nearly-Optimal Bandit Learning in Stackelberg Games with Side Information Jan 31, 2025 Multi-Armed Bandits
— Unverified 0Offline Learning for Combinatorial Multi-armed Bandits Jan 31, 2025 Decision Making Language Modeling
— Unverified 0Solving Inverse Problem for Multi-armed Bandits via Convex Optimization Jan 31, 2025 Multi-Armed Bandits
Code Code Available 0Multi-agent Multi-armed Bandit with Fully Heavy-tailed Dynamics Jan 31, 2025 Multi-Armed Bandits
— Unverified 0Contextual Online Decision Making with Infinite-Dimensional Functional Regression Jan 30, 2025 Decision Making Multi-Armed Bandits
— Unverified 0Breaking the (1/Δ_2) Barrier: Better Batched Best Arm Identification with Adaptive Grids Jan 29, 2025 Multi-Armed Bandits
— Unverified 0Sequential Learning of the Pareto Front for Multi-objective Bandits Jan 29, 2025 Multi-Armed Bandits
Code Code Available 0HD-CB: The First Exploration of Hyperdimensional Computing for Contextual Bandits Problems Jan 28, 2025 Computational Efficiency Multi-Armed Bandits
— Unverified 0Restless Multi-armed Bandits under Frequency and Window Constraints for Public Service Inspections Jan 27, 2025 Multi-Armed Bandits Scheduling
— Unverified 0Decision Making in Changing Environments: Robustness, Query-Based Learning, and Differential Privacy Jan 24, 2025 Decision Making Multi-Armed Bandits
— Unverified 0Optimal Multi-Objective Best Arm Identification with Fixed Confidence Jan 23, 2025 Multi-Armed Bandits
— Unverified 0Efficient Implementation of LinearUCB through Algorithmic Improvements and Vector Computing Acceleration for Embedded Learning Systems Jan 22, 2025 Decision Making Edge-computing
— Unverified 0Heterogeneous Multi-Player Multi-Armed Bandits Robust To Adversarial Attacks Jan 21, 2025 Adversarial Attack All
— Unverified 0Multilinguality in LLM-Designed Reward Functions for Restless Bandits: Effects on Task Performance and Fairness Jan 20, 2025 Fairness Multi-Armed Bandits
— Unverified 0Pairwise Elimination with Instance-Dependent Guarantees for Bandits with Cost Subsidy Jan 17, 2025 Multi-Armed Bandits
— Unverified 0Neural Risk-sensitive Satisficing in Contextual Bandits Jan 15, 2025 Multi-Armed Bandits Recommendation Systems
— Unverified 0Differentially Private Kernelized Contextual Bandits Jan 13, 2025 Multi-Armed Bandits
— Unverified 0On The Statistical Complexity of Offline Decision-Making Jan 10, 2025 Decision Making Multi-Armed Bandits
— Unverified 0Finite-Horizon Single-Pull Restless Bandits: An Efficient Index Policy For Scarce Resource Allocation Jan 10, 2025 Multi-Armed Bandits
— Unverified 0An Instrumental Value for Data Production and its Application to Data Pricing Dec 24, 2024 Decision Making Multi-Armed Bandits
— Unverified 0