| Variance Reduction for Reinforcement Learning in Input-Driven Environments | Jul 6, 2018 | Meta-LearningMuJoCo | —Unverified | 0 |
| Self-Imitation Learning | Jun 14, 2018 | Atari GamesImitation Learning | CodeCode Available | 0 |
| Supervised Policy Update for Deep Reinforcement Learning | May 29, 2018 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 0 |
| Learning Self-Imitating Diverse Policies | May 25, 2018 | continuous-controlContinuous Control | —Unverified | 0 |
| Policy Optimization with Second-Order Advantage Information | May 9, 2018 | continuous-controlContinuous Control | CodeCode Available | 0 |
| On Learning Intrinsic Rewards for Policy Gradient Methods | Apr 17, 2018 | Atari GamesDecision Making | CodeCode Available | 0 |
| Back to Basics: Benchmarking Canonical Evolution Strategies for Playing Atari | Feb 24, 2018 | Atari GamesBenchmarking | CodeCode Available | 0 |
| Structured Control Nets for Deep Reinforcement Learning | Feb 22, 2018 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 |
| An Empirical Analysis of Proximal Policy Optimization with Kronecker-factored Natural Gradients | Jan 17, 2018 | MuJoCoSensitivity | —Unverified | 0 |
| Combination of Supervised and Reinforcement Learning For Vision-Based Autonomous Control | Jan 1, 2018 | MuJoCoreinforcement-learning | —Unverified | 0 |
| Combining Model-based and Model-free RL via Multi-step Control Variates | Jan 1, 2018 | continuous-controlContinuous Control | —Unverified | 0 |
| NerveNet: Learning Structured Policy with Graph Neural Networks | Jan 1, 2018 | Benchmarkingcontinuous-control | CodeCode Available | 0 |
| Bayesian Policy Gradients via Alpha Divergence Dropout Inference | Dec 6, 2017 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Policy Optimization by Genetic Distillation | Nov 3, 2017 | Deep Reinforcement LearningImitation Learning | —Unverified | 0 |
| Stochastic Variance Reduction for Policy Gradient Estimation | Oct 17, 2017 | continuous-controlContinuous Control | —Unverified | 0 |
| A novel DDPG method with prioritized experience replay | Oct 1, 2017 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Deep Reinforcement Learning for Dexterous Manipulation with Concept Networks | Sep 20, 2017 | Deep Reinforcement LearningMuJoCo | —Unverified | 0 |
| DropoutDAgger: A Bayesian Approach to Safe Imitation Learning | Sep 18, 2017 | Imitation LearningMuJoCo | —Unverified | 0 |
| Neural Network Dynamics for Model-Based Deep Reinforcement Learning with Model-Free Fine-Tuning | Aug 8, 2017 | Deep Reinforcement Learningmodel | CodeCode Available | 0 |
| The Intentional Unintentional Agent: Learning to Solve Many Continuous Control Tasks Simultaneously | Jul 11, 2017 | continuous-controlContinuous Control | —Unverified | 0 |
| Robust Imitation of Diverse Behaviors | Jul 10, 2017 | Imitation LearningMuJoCo | —Unverified | 0 |
| Expected Policy Gradients | Jun 15, 2017 | MuJoCoReinforcement Learning | —Unverified | 0 |
| Learning to Repeat: Fine Grained Action Repetition for Deep Reinforcement Learning | Feb 20, 2017 | Car RacingDecision Making | —Unverified | 0 |
| A K-fold Method for Baseline Estimation in Policy Gradient Algorithms | Jan 3, 2017 | MuJoCoPolicy Gradient Methods | —Unverified | 0 |
| Model-based Adversarial Imitation Learning | Dec 7, 2016 | Imitation Learningmodel | —Unverified | 0 |