| Deep Reinforcement Learning for Visual Object Tracking in Videos | Jan 31, 2017 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Model-Free Control of Thermostatically Controlled Loads Connected to a District Heating Network | Jan 27, 2017 | Decision MakingReinforcement Learning | —Unverified | 0 |
| A Contextual Bandit Approach for Stream-Based Active Learning | Jan 24, 2017 | Active LearningDecision Making | —Unverified | 0 |
| Toward negotiable reinforcement learning: shifting priorities in Pareto optimal sequential decision-making | Jan 5, 2017 | Decision MakingMulti-Objective Reinforcement Learning | —Unverified | 0 |
| Stochastic Planning and Lifted Inference | Jan 4, 2017 | Decision MakingSequential Decision Making | —Unverified | 0 |
| From Preference-Based to Multiobjective Sequential Decision-Making | Jan 3, 2017 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Multi-armed Bandits: Competing with Optimal Sequences | Dec 1, 2016 | Decision MakingMulti-Armed Bandits | —Unverified | 0 |
| Fast Video Classification via Adaptive Cascading of Deep Models | Nov 20, 2016 | ClassificationCPU | —Unverified | 0 |
| Open Problem: Approximate Planning of POMDPs in the class of Memoryless Policies | Aug 17, 2016 | Decision MakingReinforcement Learning | —Unverified | 0 |
| Human collective intelligence as distributed Bayesian inference | Aug 5, 2016 | Bayesian InferenceDecision Making | —Unverified | 0 |
| Safe Policy Improvement by Minimizing Robust Baseline Regret | Jul 13, 2016 | Decision MakingDecision Making Under Uncertainty | —Unverified | 0 |
| Preference at First Sight | Jun 24, 2016 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Model-Free Episodic Control | Jun 14, 2016 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 |
| The Bayesian Linear Information Filtering Problem | May 30, 2016 | ArticlesDecision Making | —Unverified | 0 |
| Deep Action Sequence Learning for Causal Shape Transformation | May 17, 2016 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Real-Time Web Scale Event Summarization Using Sequential Decision Making | May 12, 2016 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Stochastic Contextual Bandits with Known Reward Functions | Apr 30, 2016 | Decision MakingMulti-Armed Bandits | —Unverified | 0 |
| Deep Learning for Reward Design to Improve Monte Carlo Tree Search in ATARI Games | Apr 24, 2016 | Atari GamesDecision Making | —Unverified | 0 |
| Optimal Sensing via Multi-armed Bandit Relaxations in Mixed Observability Domains | Mar 15, 2016 | Decision MakingDecision Making Under Uncertainty | —Unverified | 0 |
| PAC Reinforcement Learning with Rich Observations | Feb 8, 2016 | Decision MakingMulti-Armed Bandits | —Unverified | 0 |
| End-to-End Goal-Driven Web Navigation | Feb 6, 2016 | Decision MakingQuestion Answering | CodeCode Available | 0 |
| Risk-Constrained Reinforcement Learning with Percentile Risk Criteria | Dec 5, 2015 | Decision MakingMarketing | —Unverified | 0 |
| Reuse of Neural Modules for General Video Game Playing | Dec 4, 2015 | Atari GamesDecision Making | —Unverified | 0 |
| Bandits with Unobserved Confounders: A Causal Approach | Dec 1, 2015 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Solving Transition-Independent Multi-agent MDPs with Sparse Interactions (Extended version) | Nov 29, 2015 | Decision MakingDecision Making Under Uncertainty | —Unverified | 0 |
| Reinforcement Learning Applied to an Electric Water Heater: From Theory to Practice | Nov 29, 2015 | Decision Makingreinforcement-learning | —Unverified | 0 |
| Doubly Robust Off-policy Value Evaluation for Reinforcement Learning | Nov 11, 2015 | Decision Makingreinforcement-learning | —Unverified | 0 |
| The Knowledge Gradient with Logistic Belief Models for Binary Classification | Oct 8, 2015 | Binary ClassificationClassification | —Unverified | 0 |
| Two Phase Q-learning for Bidding-based Vehicle Sharing | Sep 29, 2015 | Decision MakingQ-Learning | —Unverified | 0 |
| Optimization of anemia treatment in hemodialysis patients via reinforcement learning | Sep 14, 2015 | Decision MakingQ-Learning | —Unverified | 0 |
| Learning Efficient Representations for Reinforcement Learning | Aug 28, 2015 | Decision Makingreinforcement-learning | —Unverified | 0 |
| Experimental analysis of data-driven control for a building heating system | Jul 13, 2015 | Decision Makingreinforcement-learning | —Unverified | 0 |
| Utility-based Dueling Bandits as a Partial Monitoring Game | Jul 10, 2015 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Data Generation as Sequential Decision Making | Jun 10, 2015 | Decision MakingImputation | CodeCode Available | 0 |
| Hands-on Learning to Search for Structured Prediction | May 1, 2015 | Decision MakingDependency Parsing | —Unverified | 0 |
| Global Bandits | Mar 29, 2015 | Decision MakingInformativeness | —Unverified | 0 |
| Doubly Robust Policy Evaluation and Optimization | Mar 10, 2015 | Decision MakingMulti-Armed Bandits | —Unverified | 0 |
| Second-order Quantile Methods for Experts and Combinatorial Games | Feb 27, 2015 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Fairness in Multi-Agent Sequential Decision-Making | Dec 1, 2014 | Decision MakingFairness | —Unverified | 0 |
| Active Sensing as Bayes-Optimal Sequential Decision Making | Aug 9, 2014 | Decision MakingSensitivity | —Unverified | 0 |
| Chasing Ghosts: Competing with Stateful Policies | Jul 29, 2014 | AttributeDecision Making | —Unverified | 0 |
| Algorithms for CVaR Optimization in MDPs | Jun 12, 2014 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Proximal Reinforcement Learning: A New Theory of Sequential Decision Making in Primal-Dual Spaces | May 26, 2014 | Decision Makingreinforcement-learning | —Unverified | 0 |
| Variance-Constrained Actor-Critic Algorithms for Discounted and Average Reward MDPs | Mar 25, 2014 | Decision MakingSequential Decision Making | —Unverified | 0 |
| A Survey of Multi-Objective Sequential Decision-Making | Feb 4, 2014 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Exploiting Model Equivalences for Solving Interactive Dynamic Influence Diagrams | Jan 18, 2014 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Non-Deterministic Policies in Markovian Decision Processes | Jan 16, 2014 | Decision Makingreinforcement-learning | —Unverified | 0 |
| Online Planning Algorithms for POMDPs | Jan 15, 2014 | Decision MakingDecision Making Under Uncertainty | —Unverified | 0 |
| Actor-Critic Algorithms for Risk-Sensitive MDPs | Dec 1, 2013 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Variational Planning for Graph-based MDPs | Dec 1, 2013 | Decision MakingSequential Decision Making | —Unverified | 0 |