Adaptive, Robust and Scalable Bayesian Filtering for Online Learning May 12, 2025 Continual Learning Multi-Armed Bandits
— Unverified 0Active Velocity Estimation using Light Curtains via Self-Supervised Multi-Armed Bandits Feb 24, 2023 Multi-Armed Bandits Navigate
— Unverified 0ADARES: Adaptive Resource Management for Virtual Machines Dec 5, 2018 Management Multi-Armed Bandits
— Unverified 0AdaLinUCB: Opportunistic Learning for Contextual Bandits Feb 20, 2019 Multi-Armed Bandits
— Unverified 0A Decision-Language Model (DLM) for Dynamic Restless Multi-Armed Bandit Tasks in Public Health Feb 22, 2024 Language Modeling Language Modelling
— Unverified 0Bandits with Knapsacks beyond the Worst-Case Feb 1, 2020 Multi-Armed Bandits
— Unverified 0Adversarial Attacks on Adversarial Bandits Jan 30, 2023 Multi-Armed Bandits Recommendation Systems
— Unverified 0Adapting Bandit Algorithms for Settings with Sequentially Available Arms Sep 30, 2021 Management Multi-Armed Bandits
— Unverified 0Adversarial Attacks on Cooperative Multi-agent Bandits Nov 3, 2023 Multi-Armed Bandits
— Unverified 0Adversarial Attacks on Linear Contextual Bandits Feb 10, 2020 Multi-Armed Bandits Recommendation Systems
— Unverified 0Adversarial Bandits with Knapsacks Nov 28, 2018 Multi-Armed Bandits Scheduling
— Unverified 0Adversarial Contextual Bandits Go Kernelized Oct 2, 2023 Decision Making Multi-Armed Bandits
— Unverified 0Approximate Function Evaluation via Multi-Armed Bandits Mar 18, 2022 Multi-Armed Bandits
— Unverified 0A Central Limit Theorem, Loss Aversion and Multi-Armed Bandits Jun 10, 2021 Multi-Armed Bandits
— Unverified 0Approximately Stationary Bandits with Knapsacks Feb 28, 2023 Multi-Armed Bandits
— Unverified 0A Provably Efficient Model-Free Posterior Sampling Method for Episodic Reinforcement Learning Aug 23, 2022 Multi-Armed Bandits reinforcement-learning
— Unverified 0A Risk-Averse Framework for Non-Stationary Stochastic Multi-Armed Bandits Oct 24, 2023 Change Point Detection Multi-Armed Bandits
— Unverified 0A Survey of Learning in Multiagent Environments: Dealing with Non-Stationarity Jul 28, 2017 Multi-Armed Bandits Reinforcement Learning
— Unverified 0An Optimistic Algorithm for Online Convex Optimization with Adversarial Constraints Dec 11, 2024 Multi-Armed Bandits
— Unverified 0Adaptive Learning Rate for Follow-the-Regularized-Leader: Competitive Analysis and Best-of-Both-Worlds Mar 1, 2024 Decision Making Multi-Armed Bandits
— Unverified 0A General Reduction for High-Probability Analysis with General Light-Tailed Distributions Mar 5, 2024 Multi-Armed Bandits Stochastic Optimization
— Unverified 0Active Inference for Autonomous Decision-Making with Contextual Multi-Armed Bandits Sep 19, 2022 Decision Making Decision Making Under Uncertainty
— Unverified 0Adaptive Exploration in Linear Contextual Bandit Oct 15, 2019 Decision Making Multi-Armed Bandits
— Unverified 0Accurate and Fast Federated Learning via Combinatorial Multi-Armed Bandits Dec 6, 2020 BIG-bench Machine Learning Federated Learning
— Unverified 0A Novel Approach to Balance Convenience and Nutrition in Meals With Long-Term Group Recommendations and Reasoning on Multimodal Recipes and its Implementation in BEACON Dec 23, 2024 Multi-Armed Bandits Nutrition
— Unverified 0A Bandit Approach to Sequential Experimental Design with False Discovery Control Dec 1, 2018 Drug Discovery Experimental Design
— Unverified 0Access Probability Optimization in RACH: A Multi-Armed Bandits Approach Apr 18, 2025 Multi-Armed Bandits
— Unverified 0An Optimal Algorithm for Multiplayer Multi-Armed Bandits Sep 28, 2019 Multi-Armed Bandits
— Unverified 0Almost Optimal Batch-Regret Tradeoff for Batch Linear Contextual Bandits Oct 15, 2021 Multi-Armed Bandits
— Unverified 0Adaptive Endpointing with Deep Contextual Multi-armed Bandits Mar 23, 2023 Multi-Armed Bandits
— Unverified 0A Correction of Pseudo Log-Likelihood Method Mar 26, 2024 Multi-Armed Bandits
— Unverified 0Almost Boltzmann Exploration Jan 25, 2019 Multi-Armed Bandits Reinforcement Learning
— Unverified 0A Model Selection Approach for Corruption Robust Reinforcement Learning Oct 7, 2021 Model Selection Multi-Armed Bandits
— Unverified 0Algorithms with Logarithmic or Sublinear Regret for Constrained Contextual Bandits Apr 27, 2015 Multi-Armed Bandits
— Unverified 0An Adaptive Method for Contextual Stochastic Multi-armed Bandits with Rewards Generated by a Linear Dynamical System Jun 14, 2024 Multi-Armed Bandits
— Unverified 0Analysis of Thompson Sampling for Partially Observable Contextual Multi-Armed Bandits Oct 23, 2021 Decision Making Multi-Armed Bandits
— Unverified 0An Analysis of Reinforcement Learning for Malaria Control Jul 19, 2021 Multi-Armed Bandits OpenAI Gym
— Unverified 0An Analysis of the Value of Information when Exploring Stochastic, Discrete Multi-Armed Bandits Oct 8, 2017 Multi-Armed Bandits
— Unverified 0A Near-Optimal Change-Detection Based Algorithm for Piecewise-Stationary Combinatorial Semi-Bandits Aug 27, 2019 Change Detection Multi-Armed Bandits
— Unverified 0An efficient algorithm for contextual bandits with knapsacks, and an extension to concave objectives Jun 10, 2015 Multi-Armed Bandits Open-Ended Question Answering
— Unverified 0An Efficient Algorithm for Deep Stochastic Contextual Bandits Apr 12, 2021 Multi-Armed Bandits Stochastic Optimization
— Unverified 0Adaptive Discretization against an Adversary: Lipschitz bandits, Dynamic Pricing, and Auction Tuning Jun 22, 2020 Multi-Armed Bandits
— Unverified 0Active Reinforcement Learning: Observing Rewards at a Cost Nov 13, 2020 Multi-Armed Bandits reinforcement-learning
— Unverified 0An Empirical Evaluation of Thompson Sampling Dec 1, 2011 Multi-Armed Bandits Thompson Sampling
— Unverified 0Adaptively Learning to Select-Rank in Online Platforms Jun 7, 2024 Multi-Armed Bandits Thompson Sampling
— Unverified 0A New Algorithm for Non-stationary Contextual Bandits: Efficient, Optimal, and Parameter-free Feb 3, 2019 Multi-Armed Bandits
— Unverified 0A New Benchmark for Online Learning with Budget-Balancing Constraints Mar 19, 2025 Multi-Armed Bandits
— Unverified 0Active Search for High Recall: a Non-Stationary Extension of Thompson Sampling Dec 27, 2017 Multi-Armed Bandits Thompson Sampling
— Unverified 0An Exploration-free Method for a Linear Stochastic Bandit Driven by a Linear Gaussian Dynamical System Apr 4, 2025 Hyperparameter Optimization Multi-Armed Bandits
— Unverified 0Tsallis-INF: An Optimal Algorithm for Stochastic and Adversarial Bandits Jul 19, 2018 Multi-Armed Bandits Thompson Sampling
— Unverified 0