| Hashing over Predicted Future Frames for Informed Exploration of Deep Reinforcement Learning | Jul 3, 2017 | Deep Reinforcement LearningEfficient Exploration | —Unverified | 0 |
| Action-Decision Networks for Visual Tracking With Deep Reinforcement Learning | Jul 1, 2017 | Deep Reinforcement LearningGPU | CodeCode Available | 0 |
| Sample-efficient Actor-Critic Reinforcement Learning with Supervised Data for Dialogue Management | Jul 1, 2017 | Deep Reinforcement LearningDialogue Management | —Unverified | 0 |
| A Deep Reinforcement Learning Framework for the Financial Portfolio Management Problem | Jun 30, 2017 | Deep Reinforcement LearningManagement | CodeCode Available | 1 |
| Noisy Networks for Exploration | Jun 30, 2017 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 0 |
| Structure Learning in Motor Control:A Deep Reinforcement Learning Model | Jun 21, 2017 | Deep Reinforcement LearningModel-based Reinforcement Learning | —Unverified | 0 |
| Dex: Incremental Learning for Complex Environments in Deep Reinforcement Learning | Jun 19, 2017 | Continual LearningDeep Reinforcement Learning | CodeCode Available | 0 |
| Deep learning-based numerical methods for high-dimensional parabolic partial differential equations and backward stochastic differential equations | Jun 15, 2017 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Zero-Shot Task Generalization with Multi-Task Deep Reinforcement Learning | Jun 15, 2017 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Deep reinforcement learning from human preferences | Jun 12, 2017 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 0 |
| Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments | Jun 7, 2017 | Deep Reinforcement LearningMulti-agent Reinforcement Learning | CodeCode Available | 1 |
| Parameter Space Noise for Exploration | Jun 6, 2017 | continuous-controlContinuous Control | CodeCode Available | 0 |
| UCB Exploration via Q-Ensembles | Jun 5, 2017 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Interpolated Policy Gradient: Merging On-Policy and Off-Policy Gradient Estimation for Deep Reinforcement Learning | Jun 1, 2017 | continuous-controlContinuous Control | —Unverified | 0 |
| Reinforcement Learning for Learning Rate Control | May 31, 2017 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Fine-grained acceleration control for autonomous intersection management using deep reinforcement learning | May 30, 2017 | Autonomous VehiclesDeep Reinforcement Learning | —Unverified | 0 |
| End-to-end Active Object Tracking via Reinforcement Learning | May 30, 2017 | Deep Reinforcement LearningObject | —Unverified | 0 |
| Cross-Domain Perceptual Reward Functions | May 25, 2017 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| State Space Decomposition and Subgoal Creation for Transfer in Deep Reinforcement Learning | May 24, 2017 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Continuous State-Space Models for Optimal Sepsis Treatment - a Deep Reinforcement Learning Approach | May 23, 2017 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Enhanced Experience Replay Generation for Efficient Reinforcement Learning | May 23, 2017 | Deep Reinforcement LearningGenerative Adversarial Network | —Unverified | 0 |
| Thinking Fast and Slow with Deep Learning and Tree Search | May 23, 2017 | Decision MakingDeep Learning | CodeCode Available | 1 |
| Learning to Mix n-Step Returns: Generalizing lambda-Returns for Deep Reinforcement Learning | May 21, 2017 | BenchmarkingDecision Making | —Unverified | 0 |
| Shallow Updates for Deep Reinforcement Learning | May 21, 2017 | Atari GamesDeep Reinforcement Learning | —Unverified | 0 |
| Learning to Factor Policies and Action-Value Functions: Factored Action Space Representations for Deep Reinforcement learning | May 20, 2017 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |