| Policy Gradient Methods for Off-policy Control | Dec 13, 2015 | Policy Gradient Methods | —Unverified | 0 |
| High-Dimensional Continuous Control Using Generalized Advantage Estimation | Jun 8, 2015 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Trust Region Policy Optimization | Feb 19, 2015 | Atari GamesPolicy Gradient Methods | CodeCode Available | 1 |
| Policy Gradient for Coherent Risk Measures | Feb 13, 2015 | Policy Gradient MethodsReinforcement Learning | —Unverified | 0 |
| Efficient Baseline-free Sampling in Parameter Exploring Policy Gradients: Super Symmetric PGPE | Dec 13, 2013 | Policy Gradient Methods | —Unverified | 0 |
| Optimistic policy iteration and natural actor-critic: A unifying view and a non-optimality result | Dec 1, 2013 | Policy Gradient MethodsReinforcement Learning | —Unverified | 0 |
| Adaptive Step-Size for Policy Gradient Methods | Dec 1, 2013 | Policy Gradient MethodsReinforcement Learning | —Unverified | 0 |
| A reinterpretation of the policy oscillation phenomenon in approximate policy iteration | Dec 1, 2011 | Policy Gradient MethodsReinforcement Learning | —Unverified | 0 |
| Analysis and Improvement of Policy Gradient Estimation | Dec 1, 2011 | Policy Gradient Methodsreinforcement-learning | —Unverified | 0 |
| On a Connection between Importance Sampling and the Likelihood Ratio Policy Gradient | Dec 1, 2010 | Policy Gradient MethodsReinforcement Learning | —Unverified | 0 |