| Safe Policy Improvement by Minimizing Robust Baseline Regret | Jul 13, 2016 | Decision MakingDecision Making Under Uncertainty | —Unverified | 0 |
| Preference at First Sight | Jun 24, 2016 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Model-Free Episodic Control | Jun 14, 2016 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 |
| The Bayesian Linear Information Filtering Problem | May 30, 2016 | ArticlesDecision Making | —Unverified | 0 |
| Deep Action Sequence Learning for Causal Shape Transformation | May 17, 2016 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Real-Time Web Scale Event Summarization Using Sequential Decision Making | May 12, 2016 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Stochastic Contextual Bandits with Known Reward Functions | Apr 30, 2016 | Decision MakingMulti-Armed Bandits | —Unverified | 0 |
| Deep Learning for Reward Design to Improve Monte Carlo Tree Search in ATARI Games | Apr 24, 2016 | Atari GamesDecision Making | —Unverified | 0 |
| Optimal Sensing via Multi-armed Bandit Relaxations in Mixed Observability Domains | Mar 15, 2016 | Decision MakingDecision Making Under Uncertainty | —Unverified | 0 |
| PAC Reinforcement Learning with Rich Observations | Feb 8, 2016 | Decision MakingMulti-Armed Bandits | —Unverified | 0 |