Contextual Combinatorial Multi-armed Bandits with Volatile Arms and Submodular Reward Dec 1, 2018 Decision Making Multi-Armed Bandits
— Unverified 0A conversion theorem and minimax optimality for continuum contextual bandits Jun 9, 2024 Multi-Armed Bandits
— Unverified 0Contextual Information-Directed Sampling May 22, 2022 Multi-Armed Bandits Reinforcement Learning (RL)
— Unverified 0Contextual Linear Bandits with Delay as Payoff Feb 18, 2025 Multi-Armed Bandits
— Unverified 0Contextual memory bandit for pro-active dialog engagement Jan 1, 2018 Multi-Armed Bandits
— Unverified 0Contextual Multi-Armed Bandits for Causal Marketing Oct 2, 2018 Causal Inference counterfactual
— Unverified 0Bandits Don’t Follow Rules: Balancing Multi-Facet Machine Translation with Multi-Armed Bandits Nov 1, 2021 Machine Translation Multi-Armed Bandits
— Unverified 0Contextual Multinomial Logit Bandits with General Value Functions Feb 12, 2024 Computational Efficiency Multi-Armed Bandits
— Unverified 0Contextual Online Decision Making with Infinite-Dimensional Functional Regression Jan 30, 2025 Decision Making Multi-Armed Bandits
— Unverified 0Contextual Pandora's Box May 26, 2022 Multi-Armed Bandits Stochastic Optimization
— Unverified 0Contextual Restless Multi-Armed Bandits with Application to Demand Response Decision-Making Mar 22, 2024 Decision Making Multi-Armed Bandits
— Unverified 0Context Uncertainty in Contextual Bandits with Applications to Recommender Systems Feb 1, 2022 Multi-Armed Bandits Recommendation Systems
— Unverified 0Continuous K-Max Bandits Feb 19, 2025 Distributed Computing Multi-Armed Bandits
— Unverified 0Continuous-Time Multi-Armed Bandits with Controlled Restarts Jun 30, 2020 Multi-Armed Bandits
— Unverified 0Convex Hull Monte-Carlo Tree Search Mar 9, 2020 Multi-Armed Bandits
— Unverified 0Cooperative Multi-agent Bandits: Distributed Algorithms with Optimal Individual Regret and Constant Communication Costs Aug 8, 2023 Multi-Armed Bandits
— Unverified 0Cooperative Stochastic Multi-agent Multi-armed Bandits Robust to Adversarial Corruptions Jun 8, 2021 Multi-Armed Bandits Open-Ended Question Answering
— Unverified 0Coordinated Attacks against Contextual Bandits: Fundamental Limits and Defense Mechanisms Jan 30, 2022 Collaborative Filtering Multi-Armed Bandits
— Unverified 0Contexts can be Cheap: Solving Stochastic Contextual Bandits with Linear Bandit Algorithms Nov 8, 2022 Multi-Armed Bandits
— Unverified 0Coordination without communication: optimal regret in two players multi-armed bandits Feb 14, 2020 Multi-Armed Bandits Vocal Bursts Valence Prediction
— Unverified 0Bandits with Partially Observable Confounded Data Jun 11, 2020 Multi-Armed Bandits
— Unverified 0CorrAttack: Black-box Adversarial Attack with Structured Search Oct 3, 2020 Adversarial Attack Bayesian Optimization
— Unverified 0Bandits with Temporal Stochastic Constraints Nov 22, 2018 Multi-Armed Bandits
— Unverified 0Almost Boltzmann Exploration Jan 25, 2019 Multi-Armed Bandits Reinforcement Learning
— Unverified 0Corruption-Robust Algorithms with Uncertainty Weighting for Nonlinear Contextual Bandits and Markov Decision Processes Dec 12, 2022 Multi-Armed Bandits Reinforcement Learning (RL)
— Unverified 0Corruption-robust exploration in episodic reinforcement learning Nov 20, 2019 Multi-Armed Bandits reinforcement-learning
— Unverified 0Context-Aware Bandits Oct 12, 2015 Clustering Multi-Armed Bandits
— Unverified 0Banker Online Mirror Descent: A Universal Approach for Delayed Online Bandit Learning Jan 25, 2023 Multi-Armed Bandits
— Unverified 0Customized Nonlinear Bandits for Online Response Selection in Neural Conversation Models Nov 22, 2017 Multi-Armed Bandits Response Generation
— Unverified 0Query-Reward Tradeoffs in Multi-Armed Bandits Oct 12, 2021 Multi-Armed Bandits
— Unverified 0Data Acquisition for Improving Model Fairness using Reinforcement Learning Dec 4, 2024 Data Valuation Fairness
— Unverified 0Data Dependent Regret Guarantees Against General Comparators for Full or Bandit Feedback Mar 12, 2023 Multi-Armed Bandits
— Unverified 0Data-Driven Upper Confidence Bounds with Near-Optimal Regret for Heavy-Tailed Bandits Jun 9, 2024 Decision Making Multi-Armed Bandits
— Unverified 0Data Poisoning Attacks in Contextual Bandits Aug 17, 2018 Data Poisoning Multi-Armed Bandits
— Unverified 0Data Poisoning Attacks on Stochastic Bandits May 16, 2019 Data Poisoning Multi-Armed Bandits
— Unverified 0DBA bandits: Self-driving index tuning under ad-hoc, analytical workloads with safety guarantees Oct 19, 2020 Attribute Decision Making
— Unverified 0Batched Nonparametric Bandits via k-Nearest Neighbor UCB May 15, 2025 Decision Making Marketing
— Unverified 0Decentralized Cooperative Reinforcement Learning with Hierarchical Information Structure Nov 1, 2021 Multi-agent Reinforcement Learning Multi-Armed Bandits
— Unverified 0Asymptotic Performance of Thompson Sampling in the Batched Multi-Armed Bandits Oct 1, 2021 Multi-Armed Bandits Thompson Sampling
— Unverified 0Decentralized Exploration in Multi-Armed Bandits -- Extended version Nov 19, 2018 Multi-Armed Bandits
— Unverified 0Decentralized Upper Confidence Bound Algorithms for Homogeneous Multi-Agent Multi-Armed Bandits Nov 22, 2021 Multi-Armed Bandits
— Unverified 0Decentralized Multi-player Multi-armed Bandits with No Collision Information Feb 29, 2020 Multi-Armed Bandits
— Unverified 0Decentralized Smart Charging of Large-Scale EVs using Adaptive Multi-Agent Multi-Armed Bandits Jul 20, 2023 Fairness Multi-Armed Bandits
— Unverified 0Decision Automation for Electric Power Network Recovery Oct 1, 2019 Decision Making Multi-Armed Bandits
— Unverified 0Scalable Decision-Focused Learning in Restless Multi-Armed Bandits with Application to Maternal and Child Health Feb 2, 2022 Multi-Armed Bandits Scheduling
— Unverified 0Decision Making in Changing Environments: Robustness, Query-Based Learning, and Differential Privacy Jan 24, 2025 Decision Making Multi-Armed Bandits
— Unverified 0Asymptotic Instance-Optimal Algorithms for Interactive Decision Making Jun 6, 2022 Decision Making Multi-Armed Bandits
— Unverified 0Batch Ensemble for Variance Dependent Regret in Stochastic Bandits Sep 13, 2024 Multi-Armed Bandits Reinforcement Learning (RL)
— Unverified 0Deep Contextual Bandits for Fast Neighbor-Aided Initial Access in mmWave Cell-Free Networks Mar 17, 2021 Multi-Armed Bandits
— Unverified 0Constrained Pure Exploration Multi-Armed Bandits with a Fixed Budget Nov 27, 2022 Attribute Multi-Armed Bandits
— Unverified 0