Multi-armed Bandits for Link Configuration in Millimeter-wave Networks Feb 2, 2022 Multi-Armed Bandits
— Unverified 0Adaptive Experimentation with Delayed Binary Feedback Feb 2, 2022 Multi-Armed Bandits valid
Code Code Available 0Scalable Decision-Focused Learning in Restless Multi-Armed Bandits with Application to Maternal and Child Health Feb 2, 2022 Multi-Armed Bandits Scheduling
— Unverified 0Efficient Algorithms for Learning to Control Bandits with Unobserved Contexts Feb 2, 2022 Multi-Armed Bandits
— Unverified 0Context Uncertainty in Contextual Bandits with Applications to Recommender Systems Feb 1, 2022 Multi-Armed Bandits Recommendation Systems
— Unverified 0Optimal Regret Is Achievable with Bounded Approximate Inference Error: An Enhanced Bayesian Upper Confidence Bound Framework Jan 31, 2022 Bayesian Inference Multi-Armed Bandits
Code Code Available 0Evaluating Deep Vs. Wide & Deep Learners As Contextual Bandits For Personalized Email Promo Recommendations Jan 31, 2022 Multi-Armed Bandits Thompson Sampling
Code Code Available 0Neural Collaborative Filtering Bandits via Meta Learning Jan 31, 2022 Collaborative Filtering Decision Making
— Unverified 0Coordinated Attacks against Contextual Bandits: Fundamental Limits and Defense Mechanisms Jan 30, 2022 Collaborative Filtering Multi-Armed Bandits
— Unverified 0Networked Restless Multi-Armed Bandits for Mobile Interventions Jan 28, 2022 Multi-Armed Bandits
— Unverified 0Top-K Ranking Deep Contextual Bandits for Information Selection Systems Jan 28, 2022 Multi-Armed Bandits
— Unverified 0Adaptive Best-of-Both-Worlds Algorithm for Heavy-Tailed Multi-Armed Bandits Jan 28, 2022 Multi-Armed Bandits
— Unverified 0Learning Neural Contextual Bandits Through Perturbed Rewards Jan 24, 2022 Computational Efficiency Multi-Armed Bandits
— Unverified 0Occupancy Information Ratio: Infinite-Horizon, Information-Directed, Parameterized Policy Search Jan 21, 2022 Multi-Armed Bandits Reinforcement Learning (RL)
— Unverified 0Semantic Parsing for Planning Goals as Constrained Combinatorial Contextual Bandits Jan 16, 2022 Multi-Armed Bandits Semantic Parsing
— Unverified 0Contextual Bandits for Advertising Campaigns: A Diffusion-Model Independent Approach (Extended Version) Jan 13, 2022 Multi-Armed Bandits
— Unverified 0Modelling Cournot Games as Multi-agent Multi-armed Bandits Jan 1, 2022 Multi-Armed Bandits
— Unverified 0Off-Policy Evaluation Using Information Borrowing and Context-Based Switching Dec 18, 2021 Multi-Armed Bandits Off-policy evaluation
Code Code Available 0Stochastic differential equations for limiting description of UCB rule for Gaussian multi-armed bandits Dec 13, 2021 Multi-Armed Bandits
— Unverified 0Safe Linear Leveling Bandits Dec 13, 2021 Multi-Armed Bandits Thompson Sampling
— Unverified 0Privacy Amplification via Shuffling for Linear Contextual Bandits Dec 11, 2021 Multi-Armed Bandits
— Unverified 0Efficient Action Poisoning Attacks on Linear Contextual Bandits Dec 10, 2021 Multi-Armed Bandits
— Unverified 0Best Arm Identification under Additive Transfer Bandits Dec 8, 2021 Multi-Armed Bandits Transfer Learning
— Unverified 0Contextual Bandit Applications in Customer Support Bot Dec 6, 2021 Multi-Armed Bandits
— Unverified 0On Submodular Contextual Bandits Dec 3, 2021 Multi-Armed Bandits
— Unverified 0Optimal Algorithms for Stochastic Contextual Preference Bandits Dec 1, 2021 Decision Making Information Retrieval
— Unverified 0Identification of the Generalized Condorcet Winner in Multi-dueling Bandits Dec 1, 2021 Multi-Armed Bandits
Code Code Available 0Asymptotically Best Causal Effect Identification with Multi-Armed Bandits Dec 1, 2021 Multi-Armed Bandits
— Unverified 0Subgaussian and Differentiable Importance Sampling for Off-Policy Evaluation and Learning Dec 1, 2021 Multi-Armed Bandits Off-policy evaluation
Code Code Available 0Bandits with Knapsacks beyond the Worst Case Dec 1, 2021 Multi-Armed Bandits
— Unverified 0Multi-Armed Bandits with Bounded Arm-Memory: Near-Optimal Guarantees for Best-Arm Identification and Regret Minimization Dec 1, 2021 Multi-Armed Bandits Open-Ended Question Answering
— Unverified 0Online Fair Revenue Maximizing Cake Division with Non-Contiguous Pieces in Adversarial Bandits Nov 29, 2021 Fairness Multi-Armed Bandits
— Unverified 0Offline Neural Contextual Bandits: Pessimism, Optimization and Generalization Nov 27, 2021 Multi-Armed Bandits
Code Code Available 1Decentralized Upper Confidence Bound Algorithms for Homogeneous Multi-Agent Multi-Armed Bandits Nov 22, 2021 Multi-Armed Bandits
— Unverified 0Offline Contextual Bandits for Wireless Network Optimization Nov 11, 2021 Computational Efficiency Multi-Armed Bandits
— Unverified 0An Instance-Dependent Analysis for the Cooperative Multi-Player Multi-Armed Bandit Nov 8, 2021 Multi-Armed Bandits
— Unverified 0Universal and data-adaptive algorithms for model selection in linear contextual bandits Nov 8, 2021 Diversity Model Selection
— Unverified 0Empirical analysis of representation learning and exploration in neural kernel bandits Nov 5, 2021 Bayesian Inference Decision Making
Code Code Available 0Privacy-Preserving Communication-Efficient Federated Multi-Armed Bandits Nov 2, 2021 Decision Making Multi-Armed Bandits
— Unverified 0Bandits Don’t Follow Rules: Balancing Multi-Facet Machine Translation with Multi-Armed Bandits Nov 1, 2021 Machine Translation Multi-Armed Bandits
— Unverified 0Decentralized Cooperative Reinforcement Learning with Hierarchical Information Structure Nov 1, 2021 Multi-agent Reinforcement Learning Multi-Armed Bandits
— Unverified 0(Almost) Free Incentivized Exploration from Decentralized Learning Agents Oct 27, 2021 Multi-Armed Bandits
Code Code Available 0Heterogeneous Multi-player Multi-armed Bandits: Closing the Gap and Generalization Oct 27, 2021 Efficient Exploration Multi-Armed Bandits
Code Code Available 0Federated Linear Contextual Bandits Oct 27, 2021 Multi-Armed Bandits
— Unverified 0The Pareto Frontier of model selection for general Contextual Bandits Oct 25, 2021 Model Selection Multi-Armed Bandits
— Unverified 0Linear Contextual Bandits with Adversarial Corruptions Oct 25, 2021 Multi-Armed Bandits
— Unverified 0Analysis of Thompson Sampling for Partially Observable Contextual Multi-Armed Bandits Oct 23, 2021 Decision Making Multi-Armed Bandits
— Unverified 0Towards the D-Optimal Online Experiment Design for Recommender Selection Oct 23, 2021 Multi-Armed Bandits
Code Code Available 0Dynamic pricing and assortment under a contextual MNL demand Oct 19, 2021 Multi-Armed Bandits
— Unverified 0Stateful Offline Contextual Policy Evaluation and Learning Oct 19, 2021 Management Multi-Armed Bandits
— Unverified 0