Offline Contextual Bandits for Wireless Network Optimization Nov 11, 2021 Computational Efficiency Multi-Armed Bandits
— Unverified 0Universal and data-adaptive algorithms for model selection in linear contextual bandits Nov 8, 2021 Diversity Model Selection
— Unverified 0An Instance-Dependent Analysis for the Cooperative Multi-Player Multi-Armed Bandit Nov 8, 2021 Multi-Armed Bandits
— Unverified 0Empirical analysis of representation learning and exploration in neural kernel bandits Nov 5, 2021 Bayesian Inference Decision Making
Code Code Available 0Privacy-Preserving Communication-Efficient Federated Multi-Armed Bandits Nov 2, 2021 Decision Making Multi-Armed Bandits
— Unverified 0Bandits Don’t Follow Rules: Balancing Multi-Facet Machine Translation with Multi-Armed Bandits Nov 1, 2021 Machine Translation Multi-Armed Bandits
— Unverified 0Decentralized Cooperative Reinforcement Learning with Hierarchical Information Structure Nov 1, 2021 Multi-agent Reinforcement Learning Multi-Armed Bandits
— Unverified 0(Almost) Free Incentivized Exploration from Decentralized Learning Agents Oct 27, 2021 Multi-Armed Bandits
Code Code Available 0Federated Linear Contextual Bandits Oct 27, 2021 Multi-Armed Bandits
— Unverified 0Heterogeneous Multi-player Multi-armed Bandits: Closing the Gap and Generalization Oct 27, 2021 Efficient Exploration Multi-Armed Bandits
Code Code Available 0The Pareto Frontier of model selection for general Contextual Bandits Oct 25, 2021 Model Selection Multi-Armed Bandits
— Unverified 0Linear Contextual Bandits with Adversarial Corruptions Oct 25, 2021 Multi-Armed Bandits
— Unverified 0Towards the D-Optimal Online Experiment Design for Recommender Selection Oct 23, 2021 Multi-Armed Bandits
Code Code Available 0Analysis of Thompson Sampling for Partially Observable Contextual Multi-Armed Bandits Oct 23, 2021 Decision Making Multi-Armed Bandits
— Unverified 0Dynamic pricing and assortment under a contextual MNL demand Oct 19, 2021 Multi-Armed Bandits
— Unverified 0Stateful Offline Contextual Policy Evaluation and Learning Oct 19, 2021 Management Multi-Armed Bandits
— Unverified 0Achieving the Pareto Frontier of Regret Minimization and Best Arm Identification in Multi-Armed Bandits Oct 16, 2021 Multi-Armed Bandits
— Unverified 0Almost Optimal Batch-Regret Tradeoff for Batch Linear Contextual Bandits Oct 15, 2021 Multi-Armed Bandits
— Unverified 0Bandits Don't Follow Rules: Balancing Multi-Facet Machine Translation with Multi-Armed Bandits Oct 13, 2021 Machine Translation Multi-Armed Bandits
— Unverified 0Query-Reward Tradeoffs in Multi-Armed Bandits Oct 12, 2021 Multi-Armed Bandits
— Unverified 0Deep Upper Confidence Bound Algorithm for Contextual Bandit Ranking of Information Selection Oct 8, 2021 Multi-Armed Bandits
— Unverified 0A Model Selection Approach for Corruption Robust Reinforcement Learning Oct 7, 2021 Model Selection Multi-Armed Bandits
— Unverified 0Feel-Good Thompson Sampling for Contextual Bandits and Reinforcement Learning Oct 2, 2021 Multi-Armed Bandits regression
— Unverified 0Asymptotic Performance of Thompson Sampling in the Batched Multi-Armed Bandits Oct 1, 2021 Multi-Armed Bandits Thompson Sampling
— Unverified 0Batched Thompson Sampling Oct 1, 2021 Multi-Armed Bandits Thompson Sampling
— Unverified 0Adapting Bandit Algorithms for Settings with Sequentially Available Arms Sep 30, 2021 Management Multi-Armed Bandits
— Unverified 0Regularized-OFU: an efficient algorithm for general contextual bandit with optimization oracles Sep 29, 2021 Multi-Armed Bandits Thompson Sampling
— Unverified 0Causal Contextual Bandits with Targeted Interventions Sep 29, 2021 Multi-Armed Bandits
— Unverified 0Expected Improvement-based Contextual Bandits Sep 29, 2021 Bayesian Optimization Multi-Armed Bandits
— Unverified 0Batched Bandits with Crowd Externalities Sep 29, 2021 Multi-Armed Bandits
— Unverified 0Risk averse non-stationary multi-armed bandits Sep 28, 2021 Multi-Armed Bandits
— Unverified 0Robust Generalization of Quadratic Neural Networks via Function Identification Sep 22, 2021 Generalization Bounds Learning Theory
— Unverified 0Generalized Translation and Scale Invariant Online Algorithm for Adversarial Multi-Armed Bandits Sep 19, 2021 Multi-Armed Bandits Translation
— Unverified 0Field Study in Deploying Restless Multi-Armed Bandits: Assisting Non-Profits in Improving Maternal and Child Health Sep 16, 2021 Multi-Armed Bandits
— Unverified 0Estimation of Warfarin Dosage with Reinforcement Learning Sep 15, 2021 Multi-Armed Bandits reinforcement-learning
Code Code Available 0Exploiting Heterogeneity in Robust Federated Best-Arm Identification Sep 13, 2021 Multi-Armed Bandits
— Unverified 0Improved Algorithms for Misspecified Linear Markov Decision Processes Sep 12, 2021 Multi-Armed Bandits
— Unverified 0Best-Arm Identification in Correlated Multi-Armed Bandits Sep 10, 2021 Multi-Armed Bandits
— Unverified 0Online Learning for Cooperative Multi-Player Multi-Armed Bandits Sep 7, 2021 Multi-Armed Bandits
— Unverified 0Max-Utility Based Arm Selection Strategy For Sequential Query Recommendations Aug 31, 2021 Multi-Armed Bandits
— Unverified 0No DBA? No regret! Multi-armed bandits for index tuning of analytical and HTAP workloads with provable guarantees Aug 23, 2021 Decision Making Decision Making Under Uncertainty
— Unverified 0Batched Thompson Sampling for Multi-Armed Bandits Aug 15, 2021 Multi-Armed Bandits Thompson Sampling
— Unverified 0Metadata-based Multi-Task Bandits with Bayesian Hierarchical Models Aug 13, 2021 Multi-Armed Bandits Thompson Sampling
— Unverified 0Regret Analysis of Learning-Based MPC with Partially-Unknown Cost Function Aug 4, 2021 Multi-Armed Bandits
— Unverified 0Maximizing and Satisficing in Multi-armed Bandits with Graph Information Aug 2, 2021 Decision Making Multi-Armed Bandits
Code Code Available 0Indexability and Rollout Policy for Multi-State Partially Observable Restless Bandits Jul 30, 2021 Multi-Armed Bandits Recommendation Systems
— Unverified 0Combining Online Learning and Offline Learning for Contextual Bandits with Deficient Support Jul 24, 2021 Multi-Armed Bandits
— Unverified 0Finite-time Analysis of Globally Nonstationary Multi-Armed Bandits Jul 23, 2021 Multi-Armed Bandits
Code Code Available 0From Predictions to Decisions: The Importance of Joint Predictive Distributions Jul 20, 2021 Multi-Armed Bandits Thompson Sampling
— Unverified 0An Analysis of Reinforcement Learning for Malaria Control Jul 19, 2021 Multi-Armed Bandits OpenAI Gym
— Unverified 0