| Deep Reinforcement Learning for Visual Object Tracking in Videos | Jan 31, 2017 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Model-Free Control of Thermostatically Controlled Loads Connected to a District Heating Network | Jan 27, 2017 | Decision MakingReinforcement Learning | —Unverified | 0 |
| A Contextual Bandit Approach for Stream-Based Active Learning | Jan 24, 2017 | Active LearningDecision Making | —Unverified | 0 |
| Toward negotiable reinforcement learning: shifting priorities in Pareto optimal sequential decision-making | Jan 5, 2017 | Decision MakingMulti-Objective Reinforcement Learning | —Unverified | 0 |
| Stochastic Planning and Lifted Inference | Jan 4, 2017 | Decision MakingSequential Decision Making | —Unverified | 0 |
| From Preference-Based to Multiobjective Sequential Decision-Making | Jan 3, 2017 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Multi-armed Bandits: Competing with Optimal Sequences | Dec 1, 2016 | Decision MakingMulti-Armed Bandits | —Unverified | 0 |
| Fast Video Classification via Adaptive Cascading of Deep Models | Nov 20, 2016 | ClassificationCPU | —Unverified | 0 |
| Open Problem: Approximate Planning of POMDPs in the class of Memoryless Policies | Aug 17, 2016 | Decision MakingReinforcement Learning | —Unverified | 0 |
| Human collective intelligence as distributed Bayesian inference | Aug 5, 2016 | Bayesian InferenceDecision Making | —Unverified | 0 |
| Safe Policy Improvement by Minimizing Robust Baseline Regret | Jul 13, 2016 | Decision MakingDecision Making Under Uncertainty | —Unverified | 0 |
| Preference at First Sight | Jun 24, 2016 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Model-Free Episodic Control | Jun 14, 2016 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 |
| The Bayesian Linear Information Filtering Problem | May 30, 2016 | ArticlesDecision Making | —Unverified | 0 |
| Deep Action Sequence Learning for Causal Shape Transformation | May 17, 2016 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Real-Time Web Scale Event Summarization Using Sequential Decision Making | May 12, 2016 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Stochastic Contextual Bandits with Known Reward Functions | Apr 30, 2016 | Decision MakingMulti-Armed Bandits | —Unverified | 0 |
| Deep Learning for Reward Design to Improve Monte Carlo Tree Search in ATARI Games | Apr 24, 2016 | Atari GamesDecision Making | —Unverified | 0 |
| Optimal Sensing via Multi-armed Bandit Relaxations in Mixed Observability Domains | Mar 15, 2016 | Decision MakingDecision Making Under Uncertainty | —Unverified | 0 |
| PAC Reinforcement Learning with Rich Observations | Feb 8, 2016 | Decision MakingMulti-Armed Bandits | —Unverified | 0 |
| End-to-End Goal-Driven Web Navigation | Feb 6, 2016 | Decision MakingQuestion Answering | CodeCode Available | 0 |
| Risk-Constrained Reinforcement Learning with Percentile Risk Criteria | Dec 5, 2015 | Decision MakingMarketing | —Unverified | 0 |
| Reuse of Neural Modules for General Video Game Playing | Dec 4, 2015 | Atari GamesDecision Making | —Unverified | 0 |
| Bandits with Unobserved Confounders: A Causal Approach | Dec 1, 2015 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Solving Transition-Independent Multi-agent MDPs with Sparse Interactions (Extended version) | Nov 29, 2015 | Decision MakingDecision Making Under Uncertainty | —Unverified | 0 |