Reinforcement Learning for Physical Layer Communications Jun 22, 2021 Deep Reinforcement Learning Multi-Armed Bandits
Code Code Available 0BanditMF: Multi-Armed Bandit Based Matrix Factorization Recommender System Jun 21, 2021 Collaborative Filtering Multi-Armed Bandits
— Unverified 0Smooth Sequential Optimisation with Delayed Feedback Jun 21, 2021 Multi-Armed Bandits
— Unverified 0Banker Online Mirror Descent Jun 16, 2021 Multi-Armed Bandits
— Unverified 0Guaranteed Fixed-Confidence Best Arm Identification in Multi-Armed Bandits: Simple Sequential Elimination Algorithms Jun 12, 2021 Multi-Armed Bandits
— Unverified 0Towards Costless Model Selection in Contextual Bandits: A Bias-Variance Perspective Jun 11, 2021 Model Selection Multi-Armed Bandits
— Unverified 0A Central Limit Theorem, Loss Aversion and Multi-Armed Bandits Jun 10, 2021 Multi-Armed Bandits
— Unverified 0Fixed-Budget Best-Arm Identification in Structured Bandits Jun 9, 2021 Multi-Armed Bandits
— Unverified 0Scale Free Adversarial Multi Armed Bandits Jun 8, 2021 Multi-Armed Bandits
— Unverified 0Cooperative Stochastic Multi-agent Multi-armed Bandits Robust to Adversarial Corruptions Jun 8, 2021 Multi-Armed Bandits Open-Ended Question Answering
— Unverified 0Generalized Linear Bandits with Local Differential Privacy Jun 7, 2021 Decision Making Multi-Armed Bandits
Code Code Available 1On Learning to Rank Long Sequences with Contextual Bandits Jun 7, 2021 Learning-To-Rank Multi-Armed Bandits
— Unverified 0Multi-facet Contextual Bandits: A Neural Network Perspective Jun 6, 2021 Multi-Armed Bandits Recommendation Systems
Code Code Available 0Differentially Private Multi-Armed Bandits in the Shuffle Model Jun 5, 2021 Multi-Armed Bandits
— Unverified 0Robust Stochastic Linear Contextual Bandits Under Adversarial Attacks Jun 5, 2021 Multi-Armed Bandits Recommendation Systems
— Unverified 0Fair Exploration via Axiomatic Bargaining Jun 4, 2021 Fairness Multi-Armed Bandits
— Unverified 0Optimal Rates of (Locally) Differentially Private Heavy-tailed Multi-Armed Bandits Jun 4, 2021 Multi-Armed Bandits
— Unverified 0Stochastic Multi-Armed Bandits with Unrestricted Delay Distributions Jun 4, 2021 Multi-Armed Bandits
— Unverified 0Off-Policy Evaluation via Adaptive Weighting with Data from Contextual Bandits Jun 3, 2021 Multi-Armed Bandits Off-policy evaluation
Code Code Available 1Addressing the Long-term Impact of ML Decisions via Policy Regret Jun 2, 2021 Multi-Armed Bandits
Code Code Available 0Invariant Policy Learning: A Causal Perspective Jun 1, 2021 Multi-Armed Bandits Recommendation Systems
Code Code Available 0Recurrent Submodular Welfare and Matroid Blocking Semi-Bandits May 21, 2021 Blocking Multi-Armed Bandits
— Unverified 0Parallelizing Contextual Bandits May 21, 2021 Decision Making Decision Making Under Uncertainty
— Unverified 0Diffusion Approximations for Thompson Sampling May 19, 2021 Multi-Armed Bandits Thompson Sampling
— Unverified 0Deep Bandits Show-Off: Simple and Efficient Exploration with Deep Networks May 10, 2021 Efficient Exploration Multi-Armed Bandits
Code Code Available 1Combinatorial Multi-armed Bandits for Resource Allocation May 10, 2021 Multi-Armed Bandits
Code Code Available 0Stochastic Multi-Armed Bandits with Control Variates May 9, 2021 Multi-Armed Bandits
— Unverified 0Contextual Bandits with Sparse Data in Web setting May 6, 2021 Articles Dimensionality Reduction
— Unverified 0Policy Learning with Adaptively Collected Data May 5, 2021 Multi-Armed Bandits
Code Code Available 0Optimal Algorithms for Range Searching over Multi-Armed Bandits May 4, 2021 Multi-Armed Bandits
— Unverified 0Statistical Inference with M-Estimators on Adaptively Collected Data Apr 29, 2021 Decision Making Multi-Armed Bandits
— Unverified 0Online certification of preference-based fairness for personalized recommender systems Apr 29, 2021 Fairness Multi-Armed Bandits
— Unverified 0Off-Policy Risk Assessment in Contextual Bandits Apr 18, 2021 Multi-Armed Bandits Off-policy evaluation
— Unverified 0Censored Semi-Bandits for Resource Allocation Apr 12, 2021 Multi-Armed Bandits
— Unverified 0An Efficient Algorithm for Deep Stochastic Contextual Bandits Apr 12, 2021 Multi-Armed Bandits Stochastic Optimization
— Unverified 0Leveraging Good Representations in Linear Contextual Bandits Apr 8, 2021 Multi-Armed Bandits
— Unverified 0Multinomial Logit Contextual Bandits: Provable Optimality and Practicality Mar 25, 2021 Multi-Armed Bandits
— Unverified 0Towards Optimal Algorithms for Multi-Player Bandits without Collision Sensing Information Mar 24, 2021 Multi-Armed Bandits
— Unverified 0Bridging Offline Reinforcement Learning and Imitation Learning: A Tale of Pessimism Mar 22, 2021 Imitation Learning Multi-Armed Bandits
— Unverified 0Deep Contextual Bandits for Fast Neighbor-Aided Initial Access in mmWave Cell-Free Networks Mar 17, 2021 Multi-Armed Bandits
— Unverified 0Encrypted Linear Contextual Bandit Mar 17, 2021 Decision Making Multi-Armed Bandits
— Unverified 0Nearest Neighbor Search Under Uncertainty Mar 8, 2021 Multi-Armed Bandits Representation Learning
— Unverified 0Efficient Algorithms for Finite Horizon and Streaming Restless Multi-Armed Bandit Problems Mar 8, 2021 Multi-Armed Bandits
— Unverified 0Selective Intervention Planning using Restless Multi-Armed Bandits to Improve Maternal and Child Health Outcomes Mar 7, 2021 Multi-Armed Bandits
— Unverified 0Fairness of Exposure in Stochastic Bandits Mar 3, 2021 Fairness Multi-Armed Bandits
— Unverified 0Local Clustering in Contextual Multi-Armed Bandits Feb 26, 2021 Clustering Multi-Armed Bandits
— Unverified 0Adapting to Misspecification in Contextual Bandits with Offline Regression Oracles Feb 26, 2021 Multi-Armed Bandits regression
— Unverified 0Online Multi-Armed Bandits with Adaptive Inference Feb 25, 2021 Causal Inference Decision Making
— Unverified 0Combinatorial Bandits under Strategic Manipulations Feb 25, 2021 Multi-Armed Bandits Recommendation Systems
Code Code Available 0Federated Multi-armed Bandits with Personalization Feb 25, 2021 Federated Learning Multi-Armed Bandits
Code Code Available 0