A Survey on Contextual Multi-armed Bandits Aug 13, 2015 Multi-Armed Bandits Survey
Code Code Available 0Episodic Multi-armed Bandits Aug 4, 2015 Multi-Armed Bandits Reinforcement Learning
— Unverified 0Linear Contextual Bandits with Knapsacks Jul 24, 2015 Multi-Armed Bandits
— Unverified 0Upper-Confidence-Bound Algorithms for Active Learning in Multi-Armed Bandits Jul 16, 2015 Active Learning Multi-Armed Bandits
— Unverified 0Selecting the best system and multi-armed bandits Jul 16, 2015 Multi-Armed Bandits
— Unverified 0Scalable Discrete Sampling as a Multi-Armed Bandit Problem Jun 30, 2015 Bayesian Inference Multi-Armed Bandits
— Unverified 0An efficient algorithm for contextual bandits with knapsacks, and an extension to concave objectives Jun 10, 2015 Multi-Armed Bandits Open-Ended Question Answering
— Unverified 0Regulating Greed Over Time in Multi-Armed Bandits May 21, 2015 Multi-Armed Bandits Time Series Analysis
Code Code Available 0On Regret-Optimal Learning in Decentralized Multi-player Multi-armed Bandits May 4, 2015 Multi-Armed Bandits
— Unverified 0Thompson Sampling for Budgeted Multi-armed Bandits May 1, 2015 Multi-Armed Bandits Thompson Sampling
— Unverified 0Algorithms with Logarithmic or Sublinear Regret for Constrained Contextual Bandits Apr 27, 2015 Multi-Armed Bandits
— Unverified 0Regret vs. Communication: Distributed Stochastic Multi-Armed Bandits and Beyond Apr 14, 2015 Multi-Armed Bandits
— Unverified 0Global Bandits Mar 29, 2015 Decision Making Informativeness
— Unverified 0Networked Stochastic Multi-Armed Bandits with Combinatorial Strategies Mar 20, 2015 Multi-Armed Bandits
— Unverified 0Doubly Robust Policy Evaluation and Optimization Mar 10, 2015 Decision Making Multi-Armed Bandits
— Unverified 0Learning to Search Better Than Your Teacher Feb 8, 2015 Multi-Armed Bandits Structured Prediction
— Unverified 0Learning Multiple Tasks in Parallel with a Shared Annotator Dec 1, 2014 Binary Classification Document Classification
— Unverified 0Combinatorial Pure Exploration of Multi-Armed Bandits Dec 1, 2014 Multi-Armed Bandits
— Unverified 0Nonstochastic Multi-Armed Bandits with Graph-Structured Feedback Sep 30, 2014 Multi-Armed Bandits
— Unverified 0On Minimax Optimal Offline Policy Evaluation Sep 12, 2014 Multi-Armed Bandits Off-policy evaluation
— Unverified 0Bandits Warm-up Cold Recommender Systems Jul 10, 2014 Multi-Armed Bandits Recommendation Systems
— Unverified 0Unimodal Bandits: Regret Lower Bounds and Optimal Algorithms May 20, 2014 Multi-Armed Bandits
— Unverified 0Lipschitz Bandits: Regret Lower Bounds and Optimal Algorithms May 19, 2014 Multi-Armed Bandits
— Unverified 0Reducing Dueling Bandits to Cardinal Bandits May 14, 2014 Multi-Armed Bandits
— Unverified 0Adaptive Contract Design for Crowdsourcing Markets: Bandit Algorithms for Repeated Principal-Agent Problems May 12, 2014 Multi-Armed Bandits
— Unverified 0Generalized Risk-Aversion in Stochastic Multi-Armed Bandits May 5, 2014 Multi-Armed Bandits
— Unverified 0Resourceful Contextual Bandits Feb 27, 2014 Multi-Armed Bandits
— Unverified 0Algorithms for multi-armed bandit problems Feb 25, 2014 Multi-Armed Bandits
— Unverified 0Taming the Monster: A Fast and Simple Algorithm for Contextual Bandits Feb 4, 2014 General Classification Multi-Armed Bandits
Code Code Available 0Exploration vs Exploitation vs Safety: Risk-averse Multi-Armed Bandits Jan 6, 2014 energy management Management
— Unverified 0lil' UCB : An Optimal Exploration Algorithm for Multi-Armed Bandits Dec 27, 2013 Multi-Armed Bandits
— Unverified 0Fundamental Limits of Online and Distributed Algorithms for Statistical Learning and Estimation Nov 14, 2013 Multi-Armed Bandits Stochastic Optimization
— Unverified 0Distributed Exploration in Multi-Armed Bandits Nov 4, 2013 Multi-Armed Bandits
— Unverified 0Generalized Thompson Sampling for Contextual Bandits Oct 27, 2013 Multi-Armed Bandits Thompson Sampling
— Unverified 0Multi-Armed Bandits for Intelligent Tutoring Systems Oct 11, 2013 Multi-Armed Bandits
— Unverified 0Sequential Monte Carlo Bandits Oct 4, 2013 Multi-Armed Bandits
— Unverified 0Finite-Time Analysis of Kernelised Contextual Bandits Sep 26, 2013 Multi-Armed Bandits
— Unverified 0Building Bridges: Viewing Active Learning from the Multi-Armed Bandit Lens Sep 26, 2013 Active Learning Binary Classification
— Unverified 0Distributed Online Learning via Cooperative Contextual Bandits Aug 21, 2013 Event Detection Multi-Armed Bandits
— Unverified 0Modeling Human Decision-making in Generalized Gaussian Multi-armed Bandits Jul 23, 2013 Bayesian Inference Decision Making
— Unverified 0Towards Distribution-Free Multi-Armed Bandits with Combinatorial Strategies Jul 20, 2013 Multi-Armed Bandits
— Unverified 0From Bandits to Experts: A Tale of Domination and Independence Jul 17, 2013 Multi-Armed Bandits
— Unverified 0On Finding the Largest Mean Among Many Jun 17, 2013 Multi-Armed Bandits
— Unverified 0Concentration bounds for temporal difference learning with linear function approximation: The case of batch data and uniform sampling Jun 11, 2013 Multi-Armed Bandits News Recommendation
— Unverified 0A Gang of Bandits Jun 4, 2013 Clustering Multi-Armed Bandits
— Unverified 0Dynamic Ad Allocation: Bandits with Budgets Jun 1, 2013 Multi-Armed Bandits
— Unverified 0Exponentiated Gradient LINUCB for Contextual Multi-Armed Bandits May 10, 2013 Multi-Armed Bandits
— Unverified 0Hierarchical Optimistic Region Selection driven by Curiosity Dec 1, 2012 Active Learning Multi-Armed Bandits
— Unverified 0Risk-Aversion in Multi-armed Bandits Dec 1, 2012 Multi-Armed Bandits
— Unverified 0Thompson Sampling for Contextual Bandits with Linear Payoffs Sep 15, 2012 Multi-Armed Bandits Thompson Sampling
Code Code Available 0