Bridging Offline Reinforcement Learning and Imitation Learning: A Tale of Pessimism Mar 22, 2021 Imitation Learning Multi-Armed Bandits
— Unverified 0Encrypted Linear Contextual Bandit Mar 17, 2021 Decision Making Multi-Armed Bandits
— Unverified 0Deep Contextual Bandits for Fast Neighbor-Aided Initial Access in mmWave Cell-Free Networks Mar 17, 2021 Multi-Armed Bandits
— Unverified 0Efficient Algorithms for Finite Horizon and Streaming Restless Multi-Armed Bandit Problems Mar 8, 2021 Multi-Armed Bandits
— Unverified 0Nearest Neighbor Search Under Uncertainty Mar 8, 2021 Multi-Armed Bandits Representation Learning
— Unverified 0Selective Intervention Planning using Restless Multi-Armed Bandits to Improve Maternal and Child Health Outcomes Mar 7, 2021 Multi-Armed Bandits
— Unverified 0Fairness of Exposure in Stochastic Bandits Mar 3, 2021 Fairness Multi-Armed Bandits
— Unverified 0Adapting to Misspecification in Contextual Bandits with Offline Regression Oracles Feb 26, 2021 Multi-Armed Bandits regression
— Unverified 0Local Clustering in Contextual Multi-Armed Bandits Feb 26, 2021 Clustering Multi-Armed Bandits
— Unverified 0Federated Multi-armed Bandits with Personalization Feb 25, 2021 Federated Learning Multi-Armed Bandits
Code Code Available 0Online Multi-Armed Bandits with Adaptive Inference Feb 25, 2021 Causal Inference Decision Making
— Unverified 0Combinatorial Bandits under Strategic Manipulations Feb 25, 2021 Multi-Armed Bandits Recommendation Systems
Code Code Available 0Output-Weighted Sampling for Multi-Armed Bandits with Extreme Payoffs Feb 19, 2021 Decision Making Gaussian Processes
Code Code Available 0Top-k eXtreme Contextual Bandits with Arm Hierarchy Feb 15, 2021 Computational Efficiency Extreme Multi-Label Classification
Code Code Available 0Meta-Thompson Sampling Feb 11, 2021 Efficient Exploration Meta-Learning
— Unverified 0Player Modeling via Multi-Armed Bandits Feb 10, 2021 Multi-Armed Bandits
— Unverified 0Non-stationary Reinforcement Learning without Prior Knowledge: An Optimal Black-box Approach Feb 10, 2021 Multi-Armed Bandits reinforcement-learning
— Unverified 0Multi-Agent Multi-Armed Bandits with Limited Communication Feb 10, 2021 Multi-Armed Bandits
— Unverified 0Regression Oracles and Exploration Strategies for Short-Horizon Multi-Armed Bandits Feb 10, 2021 Multi-Armed Bandits regression
— Unverified 0Fine-Grained Gap-Dependent Bounds for Tabular MDPs via Adaptive Multi-Step Bootstrap Feb 9, 2021 Multi-Armed Bandits
— Unverified 0Online Limited Memory Neural-Linear Bandits with Likelihood Matching Feb 7, 2021 Efficient Exploration Multi-Armed Bandits
Code Code Available 0Bandits for Learning to Explain from Explanations Feb 7, 2021 Gaussian Processes Multi-Armed Bandits
— Unverified 0Confidence-Budget Matching for Sequential Budgeted Learning Feb 5, 2021 Decision Making Decision Making Under Uncertainty
— Unverified 0Transfer Learning in Bandits with Latent Continuity Feb 4, 2021 Multi-Armed Bandits Transfer Learning
— Unverified 0Recurrent Submodular Welfare and Matroid Blocking Bandits Jan 30, 2021 Blocking Multi-Armed Bandits
— Unverified 0Personalization Paradox in Behavior Change Apps: Lessons from a Social Comparison-Based Personalized App for Physical Activity Jan 25, 2021 Multi-Armed Bandits
— Unverified 0Online and Scalable Model Selection with Multi-Armed Bandits Jan 25, 2021 BIG-bench Machine Learning Model Selection
— Unverified 0Efficient Pure Exploration for Combinatorial Bandits with Semi-Bandit Feedback Jan 21, 2021 Multi-Armed Bandits
— Unverified 0Minimax Off-Policy Evaluation for Multi-Armed Bandits Jan 19, 2021 Multi-Armed Bandits Off-policy evaluation
— Unverified 0Resource Allocation in NOMA-based Self-Organizing Networks using Stochastic Multi-Armed Bandits Jan 16, 2021 Management Multi-Armed Bandits
— Unverified 0Survival of the strictest: Stable and unstable equilibria under regularized learning with partial information Jan 12, 2021 Multi-Armed Bandits
— Unverified 0Be Greedy in Multi-Armed Bandits Jan 4, 2021 Multi-Armed Bandits
— Unverified 0Online Learning under Adversarial Corruptions Jan 1, 2021 Multi-Armed Bandits
— Unverified 0Online Limited Memory Neural-Linear Bandits Jan 1, 2021 Efficient Exploration Multi-Armed Bandits
— Unverified 0Combinatorial Pure Exploration with Full-bandit Feedback and Beyond: Solving Combinatorial Optimization under Uncertainty with Limited Observation Dec 31, 2020 Combinatorial Optimization Multi-Armed Bandits
— Unverified 0Learning to Optimize Energy Efficiency in Energy Harvesting Wireless Sensor Networks Dec 30, 2020 Multi-Armed Bandits
— Unverified 0Lifelong Learning in Multi-Armed Bandits Dec 28, 2020 Lifelong learning Multi-Armed Bandits
— Unverified 0A Regret bound for Non-stationary Multi-Armed Bandits with Fairness Constraints Dec 24, 2020 Decision Making Fairness
— Unverified 0Expanding on Repeated Consumer Search Using Multi-Armed Bandits and Secretaries Dec 22, 2020 Multi-Armed Bandits
— Unverified 0Relational Boosted Bandits Dec 16, 2020 Attribute Descriptive
Code Code Available 0A One-Size-Fits-All Solution to Conservative Bandit Problems Dec 14, 2020 All Multi-Armed Bandits
— Unverified 0Active Feature Selection for the Mutual Information Criterion Dec 13, 2020 feature selection Multi-Armed Bandits
Code Code Available 0Adversarial Linear Contextual Bandits with Graph-Structured Side Observations Dec 10, 2020 Multi-Armed Bandits
— Unverified 0Streaming Algorithms for Stochastic Multi-armed Bandits Dec 9, 2020 Multi-Armed Bandits Open-Ended Question Answering
— Unverified 0Efficient Automatic CASH via Rising Bandits Dec 8, 2020 AutoML Bayesian Optimization
— Unverified 0Accurate and Fast Federated Learning via Combinatorial Multi-Armed Bandits Dec 6, 2020 BIG-bench Machine Learning Federated Learning
— Unverified 0Neural Contextual Bandits with Deep Representation and Shallow Exploration Dec 3, 2020 Multi-Armed Bandits Representation Learning
— Unverified 0Distributed Thompson Sampling Dec 3, 2020 Multi-Armed Bandits Thompson Sampling
— Unverified 0Batched Coarse Ranking in Multi-Armed Bandits Dec 1, 2020 Multi-Armed Bandits
— Unverified 0Unreasonable Effectiveness of Greedy Algorithms in Multi-Armed Bandit with Many Arms Dec 1, 2020 Multi-Armed Bandits
Code Code Available 0