Non-Stationary Contextual Bandit Learning via Neural Predictive Ensemble Sampling Oct 11, 2023 Multi-Armed Bandits
— Unverified 0Ensemble Active Learning by Contextual Bandits for AI Incubation in Manufacturing Oct 10, 2023 Active Learning Decision Making
— Unverified 0Adversarial Attacks on Combinatorial Multi-Armed Bandits Oct 8, 2023 Multi-Armed Bandits
Code Code Available 0Finite-Time Analysis of Whittle Index based Q-Learning for Restless Multi-Armed Bandits with Neural Network Function Approximation Oct 3, 2023 Multi-Armed Bandits Q-Learning
— Unverified 0Improved Algorithms for Adversarial Bandits with Unbounded Losses Oct 3, 2023 Multi-Armed Bandits
— Unverified 0Adversarial Contextual Bandits Go Kernelized Oct 2, 2023 Decision Making Multi-Armed Bandits
— Unverified 0Discrete Choice Multi-Armed Bandits Oct 1, 2023 Discrete Choice Models Multi-Armed Bandits
— Unverified 0Bayesian Design Principles for Frequentist Sequential Learning Oct 1, 2023 Multi-Armed Bandits reinforcement-learning
Code Code Available 0Follow-ups Also Matter: Improving Contextual Bandits via Post-serving Contexts Sep 25, 2023 LEMMA Multi-Armed Bandits
— Unverified 0Diversify and Conquer: Bandits and Diversity for an Enhanced E-commerce Homepage Experience Sep 25, 2023 Diversity Multi-Armed Bandits
— Unverified 0A Convex Framework for Confounding Robust Inference Sep 21, 2023 Model Selection Multi-Armed Bandits
Code Code Available 0Task Selection and Assignment for Multi-modal Multi-task Dialogue Act Classification with Non-stationary Multi-armed Bandits Sep 18, 2023 Dialogue Act Classification Multi-Armed Bandits
— Unverified 0Wasserstein Distributionally Robust Policy Evaluation and Learning for Contextual Bandits Sep 15, 2023 Multi-Armed Bandits Off-policy evaluation
— Unverified 0Doubly High-Dimensional Contextual Bandits: An Interpretable Model for Joint Assortment-Pricing Sep 14, 2023 Multi-Armed Bandits
— Unverified 0The Best Arm Evades: Near-optimal Multi-pass Streaming Lower Bounds for Pure Exploration in Multi-armed Bandits Sep 6, 2023 Multi-Armed Bandits
— Unverified 0Bypassing the Simulator: Near-Optimal Adversarial Linear Contextual Bandits Sep 2, 2023 Computational Efficiency Multi-Armed Bandits
— Unverified 0Concentrated Differential Privacy for Bandits Sep 1, 2023 Multi-Armed Bandits Recommendation Systems
— Unverified 0Pure Exploration under Mediators' Feedback Aug 29, 2023 Decision Making Multi-Armed Bandits
— Unverified 0Stochastic Graph Bandit Learning with Side-Observations Aug 29, 2023 Computational Efficiency Multi-Armed Bandits
— Unverified 0Learning How to Price Charging in Electric Ride-Hailing Markets Aug 25, 2023 Multi-Armed Bandits
— Unverified 0Master-slave Deep Architecture for Top-K Multi-armed Bandits with Non-linear Bandit Feedback and Diversity Constraints Aug 24, 2023 Diversity Multi-Armed Bandits
Code Code Available 0On Universally Optimal Algorithms for A/B Testing Aug 23, 2023 Multi-Armed Bandits
— Unverified 0Clustered Linear Contextual Bandits with Knapsacks Aug 21, 2023 Econometrics Multi-Armed Bandits
— Unverified 0Graph Neural Bandits Aug 21, 2023 Multi-Armed Bandits
— Unverified 0Cost-Efficient Online Decision Making: A Combinatorial Multi-Armed Bandit Approach Aug 21, 2023 Decision Making Multi-Armed Bandits
Code Code Available 0AdaptEx: A Self-Service Contextual Bandit Platform Aug 8, 2023 Multi-Armed Bandits Thompson Sampling
— Unverified 0Cooperative Multi-agent Bandits: Distributed Algorithms with Optimal Individual Regret and Constant Communication Costs Aug 8, 2023 Multi-Armed Bandits
— Unverified 0Transfer Learning with Partially Observable Offline Data via Causal Bounds Aug 7, 2023 Multi-Armed Bandits Transfer Learning
— Unverified 0Online Matching: A Real-time Bandit System for Large-scale Recommendations Jul 29, 2023 Multi-Armed Bandits Recommendation Systems
Code Code Available 0Provable Benefits of Policy Learning from Human Preferences in Contextual Bandit Problems Jul 24, 2023 Decision Making Multi-Armed Bandits
— Unverified 0Contextual Bandits and Imitation Learning via Preference-Based Active Queries Jul 24, 2023 Imitation Learning Multi-Armed Bandits
— Unverified 0Preferences Evolve And So Should Your Bandits: Bandits with Evolving States for Online Platforms Jul 21, 2023 Multi-Armed Bandits Recommendation Systems
— Unverified 0Decentralized Smart Charging of Large-Scale EVs using Adaptive Multi-Agent Multi-Armed Bandits Jul 20, 2023 Fairness Multi-Armed Bandits
— Unverified 0VITS : Variational Inference Thompson Sampling for contextual bandits Jul 19, 2023 Multi-Armed Bandits Thompson Sampling
Code Code Available 0On Interpolating Experts and Multi-Armed Bandits Jul 14, 2023 Multi-Armed Bandits
— Unverified 0Adaptive Linear Estimating Equations Jul 14, 2023 Multi-Armed Bandits
Code Code Available 0Tracking Most Significant Shifts in Nonparametric Contextual Bandits Jul 11, 2023 Multi-Armed Bandits
— Unverified 0SHAP@k:Efficient and Probably Approximately Correct (PAC) Identification of Top-k Features Jul 10, 2023 Feature Importance Multi-Armed Bandits
— Unverified 0BOF-UCB: A Bayesian-Optimistic Frequentist Algorithm for Non-Stationary Contextual Bandits Jul 7, 2023 Decision Making Multi-Armed Bandits
— Unverified 0Meta-Learning Adversarial Bandit Algorithms Jul 5, 2023 Meta-Learning Multi-Armed Bandits
— Unverified 0Proportional Response: Contextual Bandits for Simple and Cumulative Regret Minimization Jul 5, 2023 Multi-Armed Bandits
— Unverified 0Thompson sampling for improved exploration in GFlowNets Jun 30, 2023 Active Learning Decision Making
— Unverified 0Kernel ε-Greedy for Multi-Armed Bandits with Covariates Jun 29, 2023 Multi-Armed Bandits
— Unverified 0Pure exploration in multi-armed bandits with low rank structure using oblivious sampler Jun 28, 2023 Multi-Armed Bandits
— Unverified 0You Can Trade Your Experience in Distributed Multi-Agent Multi-Armed Bandits Jun 19, 2023 Decision Making Multi-Armed Bandits
— Unverified 0Langevin Thompson Sampling with Logarithmic Communication: Bandits and Reinforcement Learning Jun 15, 2023 Decision Making Multi-Armed Bandits
— Unverified 0Oracle-Efficient Pessimism: Offline Policy Optimization in Contextual Bandits Jun 13, 2023 Multi-Armed Bandits
— Unverified 0Multi-Fidelity Multi-Armed Bandits Revisited Jun 13, 2023 Multi-Armed Bandits
— Unverified 0Budgeted Multi-Armed Bandits with Asymmetric Confidence Intervals Jun 12, 2023 Multi-Armed Bandits
Code Code Available 0Optimal Multitask Linear Regression and Contextual Bandits under Sparse Heterogeneity Jun 9, 2023 Multi-Armed Bandits regression
— Unverified 0