Adaptive Endpointing with Deep Contextual Multi-armed Bandits Mar 23, 2023 Multi-Armed Bandits
— Unverified 00 Adaptive Exploration in Linear Contextual Bandit Oct 15, 2019 Decision Making Multi-Armed Bandits
— Unverified 00 Adaptive Learning Rate for Follow-the-Regularized-Leader: Competitive Analysis and Best-of-Both-Worlds Mar 1, 2024 Decision Making Multi-Armed Bandits
— Unverified 00 Adaptively Learning to Select-Rank in Online Platforms Jun 7, 2024 Multi-Armed Bandits Thompson Sampling
— Unverified 00 Adaptive Regret for Bandits Made Possible: Two Queries Suffice Jan 17, 2024 Hyperparameter Optimization Multi-Armed Bandits
— Unverified 00 Adaptive, Robust and Scalable Bayesian Filtering for Online Learning May 12, 2025 Continual Learning Multi-Armed Bandits
— Unverified 00 ADARES: Adaptive Resource Management for Virtual Machines Dec 5, 2018 Management Multi-Armed Bandits
— Unverified 00 A Decision-Language Model (DLM) for Dynamic Restless Multi-Armed Bandit Tasks in Public Health Feb 22, 2024 Language Modeling Language Modelling
— Unverified 00 Bandits with Knapsacks beyond the Worst-Case Feb 1, 2020 Multi-Armed Bandits
— Unverified 00 Adversarial Attacks on Adversarial Bandits Jan 30, 2023 Multi-Armed Bandits Recommendation Systems
— Unverified 00 Adversarial Attacks on Cooperative Multi-agent Bandits Nov 3, 2023 Multi-Armed Bandits
— Unverified 00 Adversarial Attacks on Linear Contextual Bandits Feb 10, 2020 Multi-Armed Bandits Recommendation Systems
— Unverified 00 Adversarial Bandits with Knapsacks Nov 28, 2018 Multi-Armed Bandits Scheduling
— Unverified 00 Adversarial Contextual Bandits Go Kernelized Oct 2, 2023 Decision Making Multi-Armed Bandits
— Unverified 00 Adversarial Linear Contextual Bandits with Graph-Structured Side Observations Dec 10, 2020 Multi-Armed Bandits
— Unverified 00 α-Fair Contextual Bandits Oct 22, 2023 Multi-Armed Bandits Recommendation Systems
— Unverified 00 A Farewell to Arms: Sequential Reward Maximization on a Budget with a Giving Up Option Mar 6, 2020 Decision Making Multi-Armed Bandits
— Unverified 00 A Federated Online Restless Bandit Framework for Cooperative Resource Allocation Jun 12, 2024 Federated Learning Multi-Armed Bandits
— Unverified 00 A Framework for Adapting Offline Algorithms to Solve Combinatorial Multi-Armed Bandit Problems with Bandit Feedback Jan 30, 2023 Multi-Armed Bandits
— Unverified 00 A framework for optimizing COVID-19 testing policy using a Multi Armed Bandit approach Jul 28, 2020 Decision Making Multi-Armed Bandits
— Unverified 00 A Gang of Bandits Jun 4, 2013 Clustering Multi-Armed Bandits
— Unverified 00 A General Framework for Bandit Problems Beyond Cumulative Objectives Jun 4, 2018 Multi-Armed Bandits
— Unverified 00 A General Framework for Off-Policy Learning with Partially-Observed Reward Jun 17, 2025 Multi-Armed Bandits
— Unverified 00 A General Theory of the Stochastic Linear Bandit and Its Applications Feb 12, 2020 Multi-Armed Bandits Thompson Sampling
— Unverified 00 A Hierarchical Nearest Neighbour Approach to Contextual Bandits Dec 14, 2023 Computational Efficiency Multi-Armed Bandits
— Unverified 00 A Hybrid Meta-Learning and Multi-Armed Bandit Approach for Context-Specific Multi-Objective Recommendation Optimization Sep 13, 2024 Meta-Learning Multi-Armed Bandits
— Unverified 00 A KL-LUCB algorithm for Large-Scale Crowdsourcing Dec 1, 2017 Multi-Armed Bandits
— Unverified 00 Algorithms for Differentially Private Multi-Armed Bandits Nov 27, 2015 Multi-Armed Bandits
— Unverified 00 Algorithms for multi-armed bandit problems Feb 25, 2014 Multi-Armed Bandits
— Unverified 00 Algorithms with Logarithmic or Sublinear Regret for Constrained Contextual Bandits Apr 27, 2015 Multi-Armed Bandits
— Unverified 00 Almost Boltzmann Exploration Jan 25, 2019 Multi-Armed Bandits Reinforcement Learning
— Unverified 00 Almost Optimal Batch-Regret Tradeoff for Batch Linear Contextual Bandits Oct 15, 2021 Multi-Armed Bandits
— Unverified 00 A Model Selection Approach for Corruption Robust Reinforcement Learning Oct 7, 2021 Model Selection Multi-Armed Bandits
— Unverified 00 An Adaptive Method for Contextual Stochastic Multi-armed Bandits with Rewards Generated by a Linear Dynamical System Jun 14, 2024 Multi-Armed Bandits
— Unverified 00 Analysis of Thompson Sampling for Partially Observable Contextual Multi-Armed Bandits Oct 23, 2021 Decision Making Multi-Armed Bandits
— Unverified 00 An Analysis of Reinforcement Learning for Malaria Control Jul 19, 2021 Multi-Armed Bandits OpenAI Gym
— Unverified 00 An Analysis of the Value of Information when Exploring Stochastic, Discrete Multi-Armed Bandits Oct 8, 2017 Multi-Armed Bandits
— Unverified 00 A Near-Optimal Change-Detection Based Algorithm for Piecewise-Stationary Combinatorial Semi-Bandits Aug 27, 2019 Change Detection Multi-Armed Bandits
— Unverified 00 An efficient algorithm for contextual bandits with knapsacks, and an extension to concave objectives Jun 10, 2015 Multi-Armed Bandits Open-Ended Question Answering
— Unverified 00 An Efficient Algorithm for Deep Stochastic Contextual Bandits Apr 12, 2021 Multi-Armed Bandits Stochastic Optimization
— Unverified 00 An Empirical Evaluation of Federated Contextual Bandit Algorithms Mar 17, 2023 Federated Learning Multi-Armed Bandits
— Unverified 00 An Empirical Evaluation of Thompson Sampling Dec 1, 2011 Multi-Armed Bandits Thompson Sampling
— Unverified 00 A New Algorithm for Non-stationary Contextual Bandits: Efficient, Optimal, and Parameter-free Feb 3, 2019 Multi-Armed Bandits
— Unverified 00 A New Benchmark for Online Learning with Budget-Balancing Constraints Mar 19, 2025 Multi-Armed Bandits
— Unverified 00 An Exploration-free Method for a Linear Stochastic Bandit Driven by a Linear Gaussian Dynamical System Apr 4, 2025 Hyperparameter Optimization Multi-Armed Bandits
— Unverified 00 An Improved Relaxation for Oracle-Efficient Adversarial Contextual Bandits Oct 29, 2023 Multi-Armed Bandits
— Unverified 00 An Instance-Dependent Analysis for the Cooperative Multi-Player Multi-Armed Bandit Nov 8, 2021 Multi-Armed Bandits
— Unverified 00 An Instrumental Value for Data Production and its Application to Data Pricing Dec 24, 2024 Decision Making Multi-Armed Bandits
— Unverified 00 An Optimal Algorithm for Adversarial Bandits with Arbitrary Delays Oct 14, 2019 Multi-Armed Bandits
— Unverified 00 Tsallis-INF: An Optimal Algorithm for Stochastic and Adversarial Bandits Jul 19, 2018 Multi-Armed Bandits Thompson Sampling
— Unverified 00