Output-Weighted Sampling for Multi-Armed Bandits with Extreme Payoffs Feb 19, 2021 Decision Making Gaussian Processes
Code Code Available 0Top-k eXtreme Contextual Bandits with Arm Hierarchy Feb 15, 2021 Computational Efficiency Extreme Multi-Label Classification
Code Code Available 0Meta-Thompson Sampling Feb 11, 2021 Efficient Exploration Meta-Learning
— Unverified 0Multi-Agent Multi-Armed Bandits with Limited Communication Feb 10, 2021 Multi-Armed Bandits
— Unverified 0Non-stationary Reinforcement Learning without Prior Knowledge: An Optimal Black-box Approach Feb 10, 2021 Multi-Armed Bandits reinforcement-learning
— Unverified 0Regression Oracles and Exploration Strategies for Short-Horizon Multi-Armed Bandits Feb 10, 2021 Multi-Armed Bandits regression
— Unverified 0Player Modeling via Multi-Armed Bandits Feb 10, 2021 Multi-Armed Bandits
— Unverified 0Fine-Grained Gap-Dependent Bounds for Tabular MDPs via Adaptive Multi-Step Bootstrap Feb 9, 2021 Multi-Armed Bandits
— Unverified 0Bandits for Learning to Explain from Explanations Feb 7, 2021 Gaussian Processes Multi-Armed Bandits
— Unverified 0Online Limited Memory Neural-Linear Bandits with Likelihood Matching Feb 7, 2021 Efficient Exploration Multi-Armed Bandits
Code Code Available 0Confidence-Budget Matching for Sequential Budgeted Learning Feb 5, 2021 Decision Making Decision Making Under Uncertainty
— Unverified 0Transfer Learning in Bandits with Latent Continuity Feb 4, 2021 Multi-Armed Bandits Transfer Learning
— Unverified 0Recurrent Submodular Welfare and Matroid Blocking Bandits Jan 30, 2021 Blocking Multi-Armed Bandits
— Unverified 0Federated Multi-Armed Bandits Jan 28, 2021 Federated Learning Multi-Armed Bandits
Code Code Available 1Personalization Paradox in Behavior Change Apps: Lessons from a Social Comparison-Based Personalized App for Physical Activity Jan 25, 2021 Multi-Armed Bandits
— Unverified 0Online and Scalable Model Selection with Multi-Armed Bandits Jan 25, 2021 BIG-bench Machine Learning Model Selection
— Unverified 0Efficient Pure Exploration for Combinatorial Bandits with Semi-Bandit Feedback Jan 21, 2021 Multi-Armed Bandits
— Unverified 0An empirical evaluation of active inference in multi-armed bandits Jan 21, 2021 BIG-bench Machine Learning Decision Making
Code Code Available 1Minimax Off-Policy Evaluation for Multi-Armed Bandits Jan 19, 2021 Multi-Armed Bandits Off-policy evaluation
— Unverified 0Resource Allocation in NOMA-based Self-Organizing Networks using Stochastic Multi-Armed Bandits Jan 16, 2021 Management Multi-Armed Bandits
— Unverified 0Survival of the strictest: Stable and unstable equilibria under regularized learning with partial information Jan 12, 2021 Multi-Armed Bandits
— Unverified 0Be Greedy in Multi-Armed Bandits Jan 4, 2021 Multi-Armed Bandits
— Unverified 0Online Limited Memory Neural-Linear Bandits Jan 1, 2021 Efficient Exploration Multi-Armed Bandits
— Unverified 0Online Learning under Adversarial Corruptions Jan 1, 2021 Multi-Armed Bandits
— Unverified 0Combinatorial Pure Exploration with Full-bandit Feedback and Beyond: Solving Combinatorial Optimization under Uncertainty with Limited Observation Dec 31, 2020 Combinatorial Optimization Multi-Armed Bandits
— Unverified 0Learning to Optimize Energy Efficiency in Energy Harvesting Wireless Sensor Networks Dec 30, 2020 Multi-Armed Bandits
— Unverified 0Lifelong Learning in Multi-Armed Bandits Dec 28, 2020 Lifelong learning Multi-Armed Bandits
— Unverified 0A Regret bound for Non-stationary Multi-Armed Bandits with Fairness Constraints Dec 24, 2020 Decision Making Fairness
— Unverified 0Expanding on Repeated Consumer Search Using Multi-Armed Bandits and Secretaries Dec 22, 2020 Multi-Armed Bandits
— Unverified 0Relational Boosted Bandits Dec 16, 2020 Attribute Descriptive
Code Code Available 0A One-Size-Fits-All Solution to Conservative Bandit Problems Dec 14, 2020 All Multi-Armed Bandits
— Unverified 0Active Feature Selection for the Mutual Information Criterion Dec 13, 2020 feature selection Multi-Armed Bandits
Code Code Available 0Adversarial Linear Contextual Bandits with Graph-Structured Side Observations Dec 10, 2020 Multi-Armed Bandits
— Unverified 0Streaming Algorithms for Stochastic Multi-armed Bandits Dec 9, 2020 Multi-Armed Bandits Open-Ended Question Answering
— Unverified 0Efficient Automatic CASH via Rising Bandits Dec 8, 2020 AutoML Bayesian Optimization
— Unverified 0Accurate and Fast Federated Learning via Combinatorial Multi-Armed Bandits Dec 6, 2020 BIG-bench Machine Learning Federated Learning
— Unverified 0Distributed Thompson Sampling Dec 3, 2020 Multi-Armed Bandits Thompson Sampling
— Unverified 0Neural Contextual Bandits with Deep Representation and Shallow Exploration Dec 3, 2020 Multi-Armed Bandits Representation Learning
— Unverified 0Finding All -Good Arms in Stochastic Bandits Dec 1, 2020 All Multi-Armed Bandits
— Unverified 0Batched Coarse Ranking in Multi-Armed Bandits Dec 1, 2020 Multi-Armed Bandits
— Unverified 0BanditPAM: Almost Linear Time k-Medoids Clustering via Multi-Armed Bandits Dec 1, 2020 Clustering Multi-Armed Bandits
Code Code Available 1Unreasonable Effectiveness of Greedy Algorithms in Multi-Armed Bandit with Many Arms Dec 1, 2020 Multi-Armed Bandits
Code Code Available 0A Tractable Online Learning Algorithm for the Multinomial Logit Contextual Bandit Nov 28, 2020 Decision Making Multi-Armed Bandits
— Unverified 0Resonance: Replacing Software Constants with Context-Aware Models in Real-time Communication Nov 23, 2020 Friction Multi-Armed Bandits
— Unverified 0Fully Gap-Dependent Bounds for Multinomial Logit Bandit Nov 19, 2020 Multi-Armed Bandits
— Unverified 0Reward Biased Maximum Likelihood Estimation for Reinforcement Learning Nov 16, 2020 Multi-Armed Bandits reinforcement-learning
— Unverified 0A New Bandit Setting Balancing Information from State Evolution and Corrupted Context Nov 16, 2020 Decision Making Efficient Exploration
Code Code Available 0Improving Offline Contextual Bandits with Distributional Robustness Nov 13, 2020 counterfactual Multi-Armed Bandits
— Unverified 0Metric-Free Individual Fairness with Cooperative Contextual Bandits Nov 13, 2020 Decision Making Fairness
— Unverified 0Active Reinforcement Learning: Observing Rewards at a Cost Nov 13, 2020 Multi-Armed Bandits reinforcement-learning
— Unverified 0