| Combining Model-based and Model-free RL via Multi-step Control Variates | Jan 1, 2018 | continuous-controlContinuous Control | —Unverified | 0 |
| NerveNet: Learning Structured Policy with Graph Neural Networks | Jan 1, 2018 | Benchmarkingcontinuous-control | CodeCode Available | 0 |
| Bayesian Policy Gradients via Alpha Divergence Dropout Inference | Dec 6, 2017 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Policy Optimization by Genetic Distillation | Nov 3, 2017 | Deep Reinforcement LearningImitation Learning | —Unverified | 0 |
| Stochastic Variance Reduction for Policy Gradient Estimation | Oct 17, 2017 | continuous-controlContinuous Control | —Unverified | 0 |
| A novel DDPG method with prioritized experience replay | Oct 1, 2017 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Deep Reinforcement Learning for Dexterous Manipulation with Concept Networks | Sep 20, 2017 | Deep Reinforcement LearningMuJoCo | —Unverified | 0 |
| DropoutDAgger: A Bayesian Approach to Safe Imitation Learning | Sep 18, 2017 | Imitation LearningMuJoCo | —Unverified | 0 |
| Neural Network Dynamics for Model-Based Deep Reinforcement Learning with Model-Free Fine-Tuning | Aug 8, 2017 | Deep Reinforcement Learningmodel | CodeCode Available | 0 |
| The Intentional Unintentional Agent: Learning to Solve Many Continuous Control Tasks Simultaneously | Jul 11, 2017 | continuous-controlContinuous Control | —Unverified | 0 |