Approximate Function Evaluation via Multi-Armed Bandits Mar 18, 2022 Multi-Armed Bandits
— Unverified 0Reinforced Meta Active Learning Mar 9, 2022 Active Learning Informativeness
— Unverified 0Reward-Biased Maximum Likelihood Estimation for Neural Contextual Bandits Mar 8, 2022 Multi-Armed Bandits
— Unverified 0PAC-Bayesian Lifelong Learning For Multi-Armed Bandits Mar 7, 2022 Lifelong learning Multi-Armed Bandits
— Unverified 0Restless Multi-Armed Bandits under Exogenous Global Markov Process Feb 28, 2022 Multi-Armed Bandits
— Unverified 0Federated Online Sparse Decision Making Feb 27, 2022 Decision Making Multi-Armed Bandits
— Unverified 0Truncated LinUCB for Stochastic Linear Bandits Feb 23, 2022 Multi-Armed Bandits
Code Code Available 0The Pareto Frontier of Instance-Dependent Guarantees in Multi-Player Multi-Armed Bandits with no Communication Feb 19, 2022 Multi-Armed Bandits
— Unverified 0Cost-Efficient Distributed Learning via Combinatorial Multi-Armed Bandits Feb 16, 2022 Multi-Armed Bandits
— Unverified 0Versatile Dueling Bandits: Best-of-both-World Analyses for Online Learning from Preferences Feb 14, 2022 Multi-Armed Bandits
— Unverified 0Efficient Kernel UCB for Contextual Bandits Feb 11, 2022 Computational Efficiency Multi-Armed Bandits
Code Code Available 0Shuffle Private Linear Contextual Bandits Feb 11, 2022 Multi-Armed Bandits
— Unverified 0Settling the Communication Complexity for Distributed Offline Reinforcement Learning Feb 10, 2022 Multi-Armed Bandits Offline RL
— Unverified 0Remote Contextual Bandits Feb 10, 2022 Marketing Multi-Armed Bandits
— Unverified 0Smoothed Online Learning is as Easy as Statistical Learning Feb 9, 2022 Learning Theory Multi-Armed Bandits
— Unverified 0Budgeted Combinatorial Multi-Armed Bandits Feb 8, 2022 Multi-Armed Bandits
— Unverified 0Variance-Optimal Augmentation Logging for Counterfactual Evaluation in Contextual Bandits Feb 3, 2022 counterfactual Multi-Armed Bandits
— Unverified 0Multi-armed Bandits for Link Configuration in Millimeter-wave Networks Feb 2, 2022 Multi-Armed Bandits
— Unverified 0Adaptive Experimentation with Delayed Binary Feedback Feb 2, 2022 Multi-Armed Bandits valid
Code Code Available 0Efficient Algorithms for Learning to Control Bandits with Unobserved Contexts Feb 2, 2022 Multi-Armed Bandits
— Unverified 0Scalable Decision-Focused Learning in Restless Multi-Armed Bandits with Application to Maternal and Child Health Feb 2, 2022 Multi-Armed Bandits Scheduling
— Unverified 0Context Uncertainty in Contextual Bandits with Applications to Recommender Systems Feb 1, 2022 Multi-Armed Bandits Recommendation Systems
— Unverified 0Evaluating Deep Vs. Wide & Deep Learners As Contextual Bandits For Personalized Email Promo Recommendations Jan 31, 2022 Multi-Armed Bandits Thompson Sampling
Code Code Available 0Neural Collaborative Filtering Bandits via Meta Learning Jan 31, 2022 Collaborative Filtering Decision Making
— Unverified 0Optimal Regret Is Achievable with Bounded Approximate Inference Error: An Enhanced Bayesian Upper Confidence Bound Framework Jan 31, 2022 Bayesian Inference Multi-Armed Bandits
Code Code Available 0Coordinated Attacks against Contextual Bandits: Fundamental Limits and Defense Mechanisms Jan 30, 2022 Collaborative Filtering Multi-Armed Bandits
— Unverified 0Top-K Ranking Deep Contextual Bandits for Information Selection Systems Jan 28, 2022 Multi-Armed Bandits
— Unverified 0Networked Restless Multi-Armed Bandits for Mobile Interventions Jan 28, 2022 Multi-Armed Bandits
— Unverified 0Adaptive Best-of-Both-Worlds Algorithm for Heavy-Tailed Multi-Armed Bandits Jan 28, 2022 Multi-Armed Bandits
— Unverified 0Learning Neural Contextual Bandits Through Perturbed Rewards Jan 24, 2022 Computational Efficiency Multi-Armed Bandits
— Unverified 0Occupancy Information Ratio: Infinite-Horizon, Information-Directed, Parameterized Policy Search Jan 21, 2022 Multi-Armed Bandits Reinforcement Learning (RL)
— Unverified 0Semantic Parsing for Planning Goals as Constrained Combinatorial Contextual Bandits Jan 16, 2022 Multi-Armed Bandits Semantic Parsing
— Unverified 0Contextual Bandits for Advertising Campaigns: A Diffusion-Model Independent Approach (Extended Version) Jan 13, 2022 Multi-Armed Bandits
— Unverified 0Modelling Cournot Games as Multi-agent Multi-armed Bandits Jan 1, 2022 Multi-Armed Bandits
— Unverified 0Off-Policy Evaluation Using Information Borrowing and Context-Based Switching Dec 18, 2021 Multi-Armed Bandits Off-policy evaluation
Code Code Available 0Stochastic differential equations for limiting description of UCB rule for Gaussian multi-armed bandits Dec 13, 2021 Multi-Armed Bandits
— Unverified 0Safe Linear Leveling Bandits Dec 13, 2021 Multi-Armed Bandits Thompson Sampling
— Unverified 0Privacy Amplification via Shuffling for Linear Contextual Bandits Dec 11, 2021 Multi-Armed Bandits
— Unverified 0Efficient Action Poisoning Attacks on Linear Contextual Bandits Dec 10, 2021 Multi-Armed Bandits
— Unverified 0Best Arm Identification under Additive Transfer Bandits Dec 8, 2021 Multi-Armed Bandits Transfer Learning
— Unverified 0Contextual Bandit Applications in Customer Support Bot Dec 6, 2021 Multi-Armed Bandits
— Unverified 0On Submodular Contextual Bandits Dec 3, 2021 Multi-Armed Bandits
— Unverified 0Bandits with Knapsacks beyond the Worst Case Dec 1, 2021 Multi-Armed Bandits
— Unverified 0Identification of the Generalized Condorcet Winner in Multi-dueling Bandits Dec 1, 2021 Multi-Armed Bandits
Code Code Available 0Optimal Algorithms for Stochastic Contextual Preference Bandits Dec 1, 2021 Decision Making Information Retrieval
— Unverified 0Subgaussian and Differentiable Importance Sampling for Off-Policy Evaluation and Learning Dec 1, 2021 Multi-Armed Bandits Off-policy evaluation
Code Code Available 0Multi-Armed Bandits with Bounded Arm-Memory: Near-Optimal Guarantees for Best-Arm Identification and Regret Minimization Dec 1, 2021 Multi-Armed Bandits Open-Ended Question Answering
— Unverified 0Asymptotically Best Causal Effect Identification with Multi-Armed Bandits Dec 1, 2021 Multi-Armed Bandits
— Unverified 0Online Fair Revenue Maximizing Cake Division with Non-Contiguous Pieces in Adversarial Bandits Nov 29, 2021 Fairness Multi-Armed Bandits
— Unverified 0Decentralized Upper Confidence Bound Algorithms for Homogeneous Multi-Agent Multi-Armed Bandits Nov 22, 2021 Multi-Armed Bandits
— Unverified 0