GuideBoot: Guided Bootstrap for Deep Contextual Bandits Jul 18, 2021 Multi-Armed Bandits Thompson Sampling
— Unverified 0Inverse Contextual Bandits: Learning How Behavior Evolves over Time Jul 13, 2021 Benchmarking Decision Making
Code Code Available 0Adapting to Misspecification in Contextual Bandits Jul 12, 2021 Multi-Armed Bandits regression
— Unverified 0Neural Contextual Bandits without Regret Jul 7, 2021 Decision Making Multi-Armed Bandits
Code Code Available 0Model Selection for Generic Contextual Bandits Jul 7, 2021 model Model Selection
— Unverified 0Dueling Bandits with Adversarial Sleeping Jul 5, 2021 Management Multi-Armed Bandits
— Unverified 0Efficient First-Order Contextual Bandits: Prediction, Allocation, and Triangular Discrimination Jul 5, 2021 Decision Making Multi-Armed Bandits
— Unverified 0Restless and Uncertain: Robust Policies for Restless Bandits via Deep Multi-Agent Reinforcement Learning Jul 4, 2021 Deep Reinforcement Learning Multi-agent Reinforcement Learning
— Unverified 0Bayesian decision-making under misspecified priors with applications to meta-learning Jul 3, 2021 Decision Making Meta-Learning
— Unverified 0Regularized OFU: an Efficient UCB Estimator forNon-linear Contextual Bandit Jun 29, 2021 Multi-Armed Bandits
— Unverified 0Knowledge Infused Policy Gradients with Upper Confidence Bound for Relational Bandits Jun 25, 2021 Descriptive Multi-Armed Bandits
— Unverified 0Multi-player Multi-armed Bandits with Collision-Dependent Reward Distributions Jun 25, 2021 Multi-Armed Bandits
— Unverified 0Random Effect Bandits Jun 23, 2021 Multi-Armed Bandits Thompson Sampling
— Unverified 0Q-Learning Lagrange Policies for Multi-Action Restless Bandits Jun 22, 2021 Multi-Armed Bandits Q-Learning
Code Code Available 0A Reduction-Based Framework for Conservative Bandits and Reinforcement Learning Jun 22, 2021 Multi-Armed Bandits reinforcement-learning
— Unverified 0Reinforcement Learning for Physical Layer Communications Jun 22, 2021 Deep Reinforcement Learning Multi-Armed Bandits
Code Code Available 0BanditMF: Multi-Armed Bandit Based Matrix Factorization Recommender System Jun 21, 2021 Collaborative Filtering Multi-Armed Bandits
— Unverified 0Smooth Sequential Optimisation with Delayed Feedback Jun 21, 2021 Multi-Armed Bandits
— Unverified 0Banker Online Mirror Descent Jun 16, 2021 Multi-Armed Bandits
— Unverified 0Guaranteed Fixed-Confidence Best Arm Identification in Multi-Armed Bandits: Simple Sequential Elimination Algorithms Jun 12, 2021 Multi-Armed Bandits
— Unverified 0Towards Costless Model Selection in Contextual Bandits: A Bias-Variance Perspective Jun 11, 2021 Model Selection Multi-Armed Bandits
— Unverified 0A Central Limit Theorem, Loss Aversion and Multi-Armed Bandits Jun 10, 2021 Multi-Armed Bandits
— Unverified 0Fixed-Budget Best-Arm Identification in Structured Bandits Jun 9, 2021 Multi-Armed Bandits
— Unverified 0Scale Free Adversarial Multi Armed Bandits Jun 8, 2021 Multi-Armed Bandits
— Unverified 0Cooperative Stochastic Multi-agent Multi-armed Bandits Robust to Adversarial Corruptions Jun 8, 2021 Multi-Armed Bandits Open-Ended Question Answering
— Unverified 0On Learning to Rank Long Sequences with Contextual Bandits Jun 7, 2021 Learning-To-Rank Multi-Armed Bandits
— Unverified 0Multi-facet Contextual Bandits: A Neural Network Perspective Jun 6, 2021 Multi-Armed Bandits Recommendation Systems
Code Code Available 0Robust Stochastic Linear Contextual Bandits Under Adversarial Attacks Jun 5, 2021 Multi-Armed Bandits Recommendation Systems
— Unverified 0Differentially Private Multi-Armed Bandits in the Shuffle Model Jun 5, 2021 Multi-Armed Bandits
— Unverified 0Fair Exploration via Axiomatic Bargaining Jun 4, 2021 Fairness Multi-Armed Bandits
— Unverified 0Optimal Rates of (Locally) Differentially Private Heavy-tailed Multi-Armed Bandits Jun 4, 2021 Multi-Armed Bandits
— Unverified 0Stochastic Multi-Armed Bandits with Unrestricted Delay Distributions Jun 4, 2021 Multi-Armed Bandits
— Unverified 0Addressing the Long-term Impact of ML Decisions via Policy Regret Jun 2, 2021 Multi-Armed Bandits
Code Code Available 0Invariant Policy Learning: A Causal Perspective Jun 1, 2021 Multi-Armed Bandits Recommendation Systems
Code Code Available 0Recurrent Submodular Welfare and Matroid Blocking Semi-Bandits May 21, 2021 Blocking Multi-Armed Bandits
— Unverified 0Parallelizing Contextual Bandits May 21, 2021 Decision Making Decision Making Under Uncertainty
— Unverified 0Diffusion Approximations for Thompson Sampling May 19, 2021 Multi-Armed Bandits Thompson Sampling
— Unverified 0Combinatorial Multi-armed Bandits for Resource Allocation May 10, 2021 Multi-Armed Bandits
Code Code Available 0Stochastic Multi-Armed Bandits with Control Variates May 9, 2021 Multi-Armed Bandits
— Unverified 0Contextual Bandits with Sparse Data in Web setting May 6, 2021 Articles Dimensionality Reduction
— Unverified 0Policy Learning with Adaptively Collected Data May 5, 2021 Multi-Armed Bandits
Code Code Available 0Optimal Algorithms for Range Searching over Multi-Armed Bandits May 4, 2021 Multi-Armed Bandits
— Unverified 0Statistical Inference with M-Estimators on Adaptively Collected Data Apr 29, 2021 Decision Making Multi-Armed Bandits
— Unverified 0Online certification of preference-based fairness for personalized recommender systems Apr 29, 2021 Fairness Multi-Armed Bandits
— Unverified 0Off-Policy Risk Assessment in Contextual Bandits Apr 18, 2021 Multi-Armed Bandits Off-policy evaluation
— Unverified 0Censored Semi-Bandits for Resource Allocation Apr 12, 2021 Multi-Armed Bandits
— Unverified 0An Efficient Algorithm for Deep Stochastic Contextual Bandits Apr 12, 2021 Multi-Armed Bandits Stochastic Optimization
— Unverified 0Leveraging Good Representations in Linear Contextual Bandits Apr 8, 2021 Multi-Armed Bandits
— Unverified 0Multinomial Logit Contextual Bandits: Provable Optimality and Practicality Mar 25, 2021 Multi-Armed Bandits
— Unverified 0Towards Optimal Algorithms for Multi-Player Bandits without Collision Sensing Information Mar 24, 2021 Multi-Armed Bandits
— Unverified 0