Achieving the Pareto Frontier of Regret Minimization and Best Arm Identification in Multi-Armed Bandits Oct 16, 2021 Multi-Armed Bandits
— Unverified 0Almost Optimal Batch-Regret Tradeoff for Batch Linear Contextual Bandits Oct 15, 2021 Multi-Armed Bandits
— Unverified 0Bandits Don't Follow Rules: Balancing Multi-Facet Machine Translation with Multi-Armed Bandits Oct 13, 2021 Machine Translation Multi-Armed Bandits
— Unverified 0Query-Reward Tradeoffs in Multi-Armed Bandits Oct 12, 2021 Multi-Armed Bandits
— Unverified 0Deep Upper Confidence Bound Algorithm for Contextual Bandit Ranking of Information Selection Oct 8, 2021 Multi-Armed Bandits
— Unverified 0A Model Selection Approach for Corruption Robust Reinforcement Learning Oct 7, 2021 Model Selection Multi-Armed Bandits
— Unverified 0EE-Net: Exploitation-Exploration Neural Networks in Contextual Bandits Oct 7, 2021 Multi-Armed Bandits Thompson Sampling
Code Code Available 1Feel-Good Thompson Sampling for Contextual Bandits and Reinforcement Learning Oct 2, 2021 Multi-Armed Bandits regression
— Unverified 0Asymptotic Performance of Thompson Sampling in the Batched Multi-Armed Bandits Oct 1, 2021 Multi-Armed Bandits Thompson Sampling
— Unverified 0Batched Thompson Sampling Oct 1, 2021 Multi-Armed Bandits Thompson Sampling
— Unverified 0Adapting Bandit Algorithms for Settings with Sequentially Available Arms Sep 30, 2021 Management Multi-Armed Bandits
— Unverified 0Causal Contextual Bandits with Targeted Interventions Sep 29, 2021 Multi-Armed Bandits
— Unverified 0Regularized-OFU: an efficient algorithm for general contextual bandit with optimization oracles Sep 29, 2021 Multi-Armed Bandits Thompson Sampling
— Unverified 0Expected Improvement-based Contextual Bandits Sep 29, 2021 Bayesian Optimization Multi-Armed Bandits
— Unverified 0Batched Bandits with Crowd Externalities Sep 29, 2021 Multi-Armed Bandits
— Unverified 0Risk averse non-stationary multi-armed bandits Sep 28, 2021 Multi-Armed Bandits
— Unverified 0Robust Generalization of Quadratic Neural Networks via Function Identification Sep 22, 2021 Generalization Bounds Learning Theory
— Unverified 0Generalized Translation and Scale Invariant Online Algorithm for Adversarial Multi-Armed Bandits Sep 19, 2021 Multi-Armed Bandits Translation
— Unverified 0Field Study in Deploying Restless Multi-Armed Bandits: Assisting Non-Profits in Improving Maternal and Child Health Sep 16, 2021 Multi-Armed Bandits
— Unverified 0Estimation of Warfarin Dosage with Reinforcement Learning Sep 15, 2021 Multi-Armed Bandits reinforcement-learning
Code Code Available 0Exploiting Heterogeneity in Robust Federated Best-Arm Identification Sep 13, 2021 Multi-Armed Bandits
— Unverified 0Improved Algorithms for Misspecified Linear Markov Decision Processes Sep 12, 2021 Multi-Armed Bandits
— Unverified 0Best-Arm Identification in Correlated Multi-Armed Bandits Sep 10, 2021 Multi-Armed Bandits
— Unverified 0Online Learning for Cooperative Multi-Player Multi-Armed Bandits Sep 7, 2021 Multi-Armed Bandits
— Unverified 0Max-Utility Based Arm Selection Strategy For Sequential Query Recommendations Aug 31, 2021 Multi-Armed Bandits
— Unverified 0No DBA? No regret! Multi-armed bandits for index tuning of analytical and HTAP workloads with provable guarantees Aug 23, 2021 Decision Making Decision Making Under Uncertainty
— Unverified 0Batched Thompson Sampling for Multi-Armed Bandits Aug 15, 2021 Multi-Armed Bandits Thompson Sampling
— Unverified 0Metadata-based Multi-Task Bandits with Bayesian Hierarchical Models Aug 13, 2021 Multi-Armed Bandits Thompson Sampling
— Unverified 0Regret Analysis of Learning-Based MPC with Partially-Unknown Cost Function Aug 4, 2021 Multi-Armed Bandits
— Unverified 0Maximizing and Satisficing in Multi-armed Bandits with Graph Information Aug 2, 2021 Decision Making Multi-Armed Bandits
Code Code Available 0Indexability and Rollout Policy for Multi-State Partially Observable Restless Bandits Jul 30, 2021 Multi-Armed Bandits Recommendation Systems
— Unverified 0Combining Online Learning and Offline Learning for Contextual Bandits with Deficient Support Jul 24, 2021 Multi-Armed Bandits
— Unverified 0Finite-time Analysis of Globally Nonstationary Multi-Armed Bandits Jul 23, 2021 Multi-Armed Bandits
Code Code Available 0From Predictions to Decisions: The Importance of Joint Predictive Distributions Jul 20, 2021 Multi-Armed Bandits Thompson Sampling
— Unverified 0An Analysis of Reinforcement Learning for Malaria Control Jul 19, 2021 Multi-Armed Bandits OpenAI Gym
— Unverified 0GuideBoot: Guided Bootstrap for Deep Contextual Bandits Jul 18, 2021 Multi-Armed Bandits Thompson Sampling
— Unverified 0Inverse Contextual Bandits: Learning How Behavior Evolves over Time Jul 13, 2021 Benchmarking Decision Making
Code Code Available 0Adapting to Misspecification in Contextual Bandits Jul 12, 2021 Multi-Armed Bandits regression
— Unverified 0Model Selection for Generic Contextual Bandits Jul 7, 2021 model Model Selection
— Unverified 0Neural Contextual Bandits without Regret Jul 7, 2021 Decision Making Multi-Armed Bandits
Code Code Available 0Efficient First-Order Contextual Bandits: Prediction, Allocation, and Triangular Discrimination Jul 5, 2021 Decision Making Multi-Armed Bandits
— Unverified 0Dueling Bandits with Adversarial Sleeping Jul 5, 2021 Management Multi-Armed Bandits
— Unverified 0Restless and Uncertain: Robust Policies for Restless Bandits via Deep Multi-Agent Reinforcement Learning Jul 4, 2021 Deep Reinforcement Learning Multi-agent Reinforcement Learning
— Unverified 0Bayesian decision-making under misspecified priors with applications to meta-learning Jul 3, 2021 Decision Making Meta-Learning
— Unverified 0Regularized OFU: an Efficient UCB Estimator forNon-linear Contextual Bandit Jun 29, 2021 Multi-Armed Bandits
— Unverified 0Knowledge Infused Policy Gradients with Upper Confidence Bound for Relational Bandits Jun 25, 2021 Descriptive Multi-Armed Bandits
— Unverified 0Multi-player Multi-armed Bandits with Collision-Dependent Reward Distributions Jun 25, 2021 Multi-Armed Bandits
— Unverified 0Random Effect Bandits Jun 23, 2021 Multi-Armed Bandits Thompson Sampling
— Unverified 0Q-Learning Lagrange Policies for Multi-Action Restless Bandits Jun 22, 2021 Multi-Armed Bandits Q-Learning
Code Code Available 0A Reduction-Based Framework for Conservative Bandits and Reinforcement Learning Jun 22, 2021 Multi-Armed Bandits reinforcement-learning
— Unverified 0