| Robust Imitation of Diverse Behaviors | Jul 10, 2017 | Imitation LearningMuJoCo | —Unverified | 0 |
| Expected Policy Gradients | Jun 15, 2017 | MuJoCoReinforcement Learning | —Unverified | 0 |
| Learning to Repeat: Fine Grained Action Repetition for Deep Reinforcement Learning | Feb 20, 2017 | Car RacingDecision Making | —Unverified | 0 |
| A K-fold Method for Baseline Estimation in Policy Gradient Algorithms | Jan 3, 2017 | MuJoCoPolicy Gradient Methods | —Unverified | 0 |
| Model-based Adversarial Imitation Learning | Dec 7, 2016 | Imitation Learningmodel | —Unverified | 0 |
| Q-Prop: Sample-Efficient Policy Gradient with An Off-Policy Critic | Nov 7, 2016 | continuous-controlContinuous Control | CodeCode Available | 0 |
| MuJoCo: A physics engine for model-based control | Oct 7, 2012 | modelMuJoCo | CodeCode Available | 0 |