| DART: Noise Injection for Robust Imitation Learning | Mar 27, 2017 | Imitation LearningMuJoCo | CodeCode Available | 1 |
| Evolution Strategies as a Scalable Alternative to Reinforcement Learning | Mar 10, 2017 | Atari GamesMuJoCo | CodeCode Available | 1 |
| Learning to Repeat: Fine Grained Action Repetition for Deep Reinforcement Learning | Feb 20, 2017 | Car RacingDecision Making | —Unverified | 0 |
| A K-fold Method for Baseline Estimation in Policy Gradient Algorithms | Jan 3, 2017 | MuJoCoPolicy Gradient Methods | —Unverified | 0 |
| Model-based Adversarial Imitation Learning | Dec 7, 2016 | Imitation Learningmodel | —Unverified | 0 |
| Q-Prop: Sample-Efficient Policy Gradient with An Off-Policy Critic | Nov 7, 2016 | continuous-controlContinuous Control | CodeCode Available | 0 |
| MuJoCo: A physics engine for model-based control | Oct 7, 2012 | modelMuJoCo | CodeCode Available | 0 |