PG-TS: Improved Thompson Sampling for Logistic Contextual Bandits May 18, 2018 Multi-Armed Bandits Thompson Sampling
— Unverified 0Delegating via Quitting Games Apr 20, 2018 Multi-Armed Bandits
— Unverified 0Combining Difficulty Ranking with Multi-Armed Bandits to Sequence Educational Content Apr 14, 2018 Multi-Armed Bandits
— Unverified 0Best arm identification in multi-armed bandits with delayed feedback Mar 29, 2018 Hyperparameter Optimization Multi-Armed Bandits
— Unverified 0What Doubling Tricks Can and Can't Do for Multi-Armed Bandits Mar 19, 2018 Multi-Armed Bandits Reinforcement Learning
— Unverified 0Semiparametric Contextual Bandits Mar 12, 2018 Multi-Armed Bandits
Code Code Available 0Multi-Armed Bandits for Correlated Markovian Environments with Smoothed Reward Feedback Mar 11, 2018 Multi-Armed Bandits Q-Learning
— Unverified 0Online learning over a finite action set with limited switching Mar 5, 2018 Multi-Armed Bandits
— Unverified 0Practical Contextual Bandits with Regression Oracles Mar 3, 2018 General Classification Multi-Armed Bandits
— Unverified 0The K-Nearest Neighbour UCB algorithm for multi-armed bandits with covariates Mar 1, 2018 Multi-Armed Bandits
— Unverified 0Deep Bayesian Bandits Showdown: An Empirical Comparison of Bayesian Deep Networks for Thompson Sampling Feb 26, 2018 Decision Making Deep Reinforcement Learning
Code Code Available 0Contextual Bandits with Stochastic Experts Feb 23, 2018 Multi-Armed Bandits
Code Code Available 0Regional Multi-Armed Bandits Feb 22, 2018 Multi-Armed Bandits
— Unverified 0Online Learning with an Unknown Fairness Metric Feb 20, 2018 Fairness Multi-Armed Bandits
— Unverified 0Multi-Armed Bandits on Partially Revealed Unit Interval Graphs Feb 12, 2018 Multi-Armed Bandits
— Unverified 0Policy Gradients for Contextual Recommendations Feb 12, 2018 Decision Making Multi-Armed Bandits
— Unverified 0More Robust Doubly Robust Off-policy Evaluation Feb 10, 2018 Multi-Armed Bandits Off-policy evaluation
— Unverified 0Make the Minority Great Again: First-Order Regret Bound for Contextual Bandits Feb 9, 2018 Multi-Armed Bandits
— Unverified 0Nonparametric Stochastic Contextual Bandits Jan 5, 2018 General Classification image-classification
— Unverified 0Contextual memory bandit for pro-active dialog engagement Jan 1, 2018 Multi-Armed Bandits
— Unverified 0Residual Loss Prediction: Reinforcement Learning With No Incremental Feedback Jan 1, 2018 Multi-Armed Bandits Prediction
Code Code Available 0Learning Structural Weight Uncertainty for Sequential Decision-Making Dec 30, 2017 Decision Making Multi-Armed Bandits
Code Code Available 0Active Search for High Recall: a Non-Stationary Extension of Thompson Sampling Dec 27, 2017 Multi-Armed Bandits Thompson Sampling
— Unverified 0Stochastic Multi-armed Bandits in Constant Space Dec 25, 2017 Multi-Armed Bandits
— Unverified 0Gaussian Process bandits with adaptive discretization Dec 5, 2017 Multi-Armed Bandits
— Unverified 0A KL-LUCB algorithm for Large-Scale Crowdsourcing Dec 1, 2017 Multi-Armed Bandits
— Unverified 0Online Learning via the Differential Privacy Lens Nov 27, 2017 Multi-Armed Bandits
— Unverified 0Customized Nonlinear Bandits for Online Response Selection in Neural Conversation Models Nov 22, 2017 Multi-Armed Bandits Response Generation
— Unverified 0Estimation Considerations in Contextual Bandits Nov 19, 2017 Causal Inference Econometrics
— Unverified 0Budget-Constrained Multi-Armed Bandits with Multiple Plays Nov 16, 2017 Multi-Armed Bandits
— Unverified 0Skyline Identification in Multi-Armed Bandits Nov 12, 2017 Multi-Armed Bandits
— Unverified 0Small-loss bounds for online learning with partial information Nov 9, 2017 Multi-Armed Bandits
— Unverified 0Multi-Player Bandits Revisited Nov 7, 2017 Multi-Armed Bandits
— Unverified 0Sparsity, variance and curvature in multi-armed bandits Nov 3, 2017 Generalization Bounds Learning Theory
— Unverified 0Medoids in almost linear time via multi-armed bandits Nov 2, 2017 Multi-Armed Bandits
Code Code Available 0Multi-Armed Bandits with Metric Movement Costs Oct 24, 2017 Multi-Armed Bandits
— Unverified 0Combinatorial Multi-armed Bandits for Real-Time Strategy Games Oct 13, 2017 Multi-Armed Bandits Real-Time Strategy Games
— Unverified 0An Analysis of the Value of Information when Exploring Stochastic, Discrete Multi-Armed Bandits Oct 8, 2017 Multi-Armed Bandits
— Unverified 0Trend Detection based Regret Minimization for Bandit Problems Sep 15, 2017 Multi-Armed Bandits
— Unverified 0Optimal Learning for Sequential Decision Making for Expensive Cost Functions with Stochastic Binary Feedbacks Sep 13, 2017 Decision Making Multi-Armed Bandits
— Unverified 0Variational inference for the multi-armed contextual bandit Sep 10, 2017 Multi-Armed Bandits Reinforcement Learning
Code Code Available 0Ease.ml: Towards Multi-tenant Resource Sharing for Machine Learning Workloads Aug 24, 2017 Bayesian Optimization BIG-bench Machine Learning
— Unverified 0Efficient Contextual Bandits in Non-stationary Worlds Aug 5, 2017 Multi-Armed Bandits
— Unverified 0Reinforcement learning techniques for Outer Loop Link Adaptation in 4G/5G systems Aug 3, 2017 Multi-Armed Bandits reinforcement-learning
— Unverified 0Safety-Aware Algorithms for Adversarial Contextual Bandit Aug 1, 2017 Decision Making Multi-Armed Bandits
— Unverified 0A Survey of Learning in Multiagent Environments: Dealing with Non-Stationarity Jul 28, 2017 Multi-Armed Bandits Reinforcement Learning
— Unverified 0Nonlinear Sequential Accepts and Rejects for Identification of Top Arms in Stochastic Bandits Jul 9, 2017 Multi-Armed Bandits
— Unverified 0Efficient Reinforcement Learning via Initial Pure Exploration Jun 7, 2017 Multi-Armed Bandits reinforcement-learning
— Unverified 0Nearly Optimal Sampling Algorithms for Combinatorial Pure Exploration Jun 4, 2017 Multi-Armed Bandits
— Unverified 0Boltzmann Exploration Done Right May 29, 2017 Decision Making Decision Making Under Uncertainty
— Unverified 0