Nonparametric Contextual Bandits in Metric Spaces with Unknown Metric Dec 1, 2019 Multi-Armed Bandits
— Unverified 00 Nonparametric Stochastic Contextual Bandits Jan 5, 2018 General Classification image-classification
— Unverified 00 Non-Stationary Contextual Bandit Learning via Neural Predictive Ensemble Sampling Oct 11, 2023 Multi-Armed Bandits
— Unverified 00 Adversarial Rewards in Universal Learning for Contextual Bandits Feb 14, 2023 Multi-Armed Bandits
— Unverified 00 Non-Stationary Learning of Neural Networks with Automatic Soft Parameter Reset Nov 6, 2024 Continual Learning Multi-Armed Bandits
— Unverified 00 Non-stationary Reinforcement Learning without Prior Knowledge: An Optimal Black-box Approach Feb 10, 2021 Multi-Armed Bandits reinforcement-learning
— Unverified 00 Nonstochastic Multi-Armed Bandits with Graph-Structured Feedback Sep 30, 2014 Multi-Armed Bandits
— Unverified 00 Non-Stochastic Multi-Player Multi-Armed Bandits: Optimal Rate With Collision Information, Sublinear Without Apr 28, 2019 Multi-Armed Bandits
— Unverified 00 No-Regret is not enough! Bandits with General Constraints through Adaptive Regret Minimization May 10, 2024 Multi-Armed Bandits
— Unverified 00 No-Regret Learning for Fair Multi-Agent Social Welfare Optimization May 31, 2024 Fairness Multi-Armed Bandits
— Unverified 00 Observation-Augmented Contextual Multi-Armed Bandits for Robotic Search and Exploration Dec 19, 2023 Bayesian Inference Decision Making
— Unverified 00 Occupancy Information Ratio: Infinite-Horizon, Information-Directed, Parameterized Policy Search Jan 21, 2022 Multi-Armed Bandits Reinforcement Learning (RL)
— Unverified 00 Offline Clustering of Linear Bandits: Unlocking the Power of Clusters in Data-Limited Environments May 25, 2025 Clustering Multi-Armed Bandits
— Unverified 00 Offline Contextual Bandits for Wireless Network Optimization Nov 11, 2021 Computational Efficiency Multi-Armed Bandits
— Unverified 00 Offline Contextual Multi-armed Bandits for Mobile Health Interventions: A Case Study on Emotion Regulation Aug 21, 2020 Management Multi-Armed Bandits
— Unverified 00 Offline Learning for Combinatorial Multi-armed Bandits Jan 31, 2025 Decision Making Language Modeling
— Unverified 00 Offline Oracle-Efficient Learning for Contextual MDPs via Layerwise Exploration-Exploitation Tradeoff May 28, 2024 Density Estimation Multi-Armed Bandits
— Unverified 00 Off-policy estimation with adaptively collected data: the power of online learning Nov 19, 2024 Causal Inference Multi-Armed Bandits
— Unverified 00 Off-Policy Evaluation for Large Action Spaces via Policy Convolution Oct 24, 2023 Multi-Armed Bandits Off-policy evaluation
— Unverified 00 Off-Policy Risk Assessment in Contextual Bandits Apr 18, 2021 Multi-Armed Bandits Off-policy evaluation
— Unverified 00 Off-Policy Risk Assessment in Markov Decision Processes Sep 21, 2022 Multi-Armed Bandits Safety Alignment
— Unverified 00 On Best-Arm Identification with a Fixed Budget in Non-Parametric Multi-Armed Bandits Sep 30, 2022 Multi-Armed Bandits
— Unverified 00 On conditional versus marginal bias in multi-armed bandits Feb 19, 2020 Multi-Armed Bandits
— Unverified 00 On Differentially Private Federated Linear Contextual Bandits Feb 27, 2023 Multi-Armed Bandits
— Unverified 00 On Finding the Largest Mean Among Many Jun 17, 2013 Multi-Armed Bandits
— Unverified 00 On Interpolating Experts and Multi-Armed Bandits Jul 14, 2023 Multi-Armed Bandits
— Unverified 00 On Kernelized Multi-armed Bandits Apr 3, 2017 Multi-Armed Bandits
— Unverified 00 On Kernelized Multi-Armed Bandits with Constraints Mar 29, 2022 Multi-Armed Bandits Thompson Sampling
— Unverified 00 On Lai's Upper Confidence Bound in Multi-Armed Bandits Oct 3, 2024 Multi-Armed Bandits
— Unverified 00 On Learning to Rank Long Sequences with Contextual Bandits Jun 7, 2021 Learning-To-Rank Multi-Armed Bandits
— Unverified 00 Online Algorithm for Unsupervised Sequential Selection with Contextual Information Oct 23, 2020 Multi-Armed Bandits
— Unverified 00 Online Allocation and Pricing: Constant Regret via Bellman Inequalities Jun 14, 2019 Multi-Armed Bandits
— Unverified 00 Online and Distribution-Free Robustness: Regression and Contextual Bandits with Huber Contamination Oct 8, 2020 Adversarial Robustness Multi-Armed Bandits
— Unverified 00 Online and Scalable Model Selection with Multi-Armed Bandits Jan 25, 2021 BIG-bench Machine Learning Model Selection
— Unverified 00 Online certification of preference-based fairness for personalized recommender systems Apr 29, 2021 Fairness Multi-Armed Bandits
— Unverified 00 Online Continuous Hyperparameter Optimization for Generalized Linear Contextual Bandits Feb 18, 2023 Hyperparameter Optimization Multi-Armed Bandits
— Unverified 00 Generalizable Meta-Heuristic based on Temporal Estimation of Rewards for Large Scale Blackbox Optimization Dec 17, 2018 Multi-Armed Bandits
— Unverified 00 Online Fair Division with Contextual Bandits Aug 23, 2024 Fairness Multi-Armed Bandits
— Unverified 00 Online Fair Revenue Maximizing Cake Division with Non-Contiguous Pieces in Adversarial Bandits Nov 29, 2021 Fairness Multi-Armed Bandits
— Unverified 00 Online Learning for Autonomous Management of Intent-based 6G Networks Jul 25, 2024 Efficient Exploration Management
— Unverified 00 Online Learning for Cooperative Multi-Player Multi-Armed Bandits Sep 7, 2021 Multi-Armed Bandits
— Unverified 00 Online Learning in Contextual Bandits using Gated Linear Networks Feb 21, 2020 Multi-Armed Bandits
— Unverified 00 Online learning over a finite action set with limited switching Mar 5, 2018 Multi-Armed Bandits
— Unverified 00 Online Learning under Adversarial Corruptions Jan 1, 2021 Multi-Armed Bandits
— Unverified 00 Online Learning via the Differential Privacy Lens Nov 27, 2017 Multi-Armed Bandits
— Unverified 00 Online Learning with an Unknown Fairness Metric Feb 20, 2018 Fairness Multi-Armed Bandits
— Unverified 00 Online learning with Corrupted context: Corrupted Contextual Bandits Jun 26, 2020 Multi-Armed Bandits
— Unverified 00 Online learning with feedback graphs and switching costs Oct 23, 2018 Multi-Armed Bandits
— Unverified 00 Online Learning with Off-Policy Feedback Jul 18, 2022 Decision Making Multi-Armed Bandits
— Unverified 00 Online Limited Memory Neural-Linear Bandits Jan 1, 2021 Efficient Exploration Multi-Armed Bandits
— Unverified 00