| Adaptive Step-Size for Policy Gradient Methods | Dec 1, 2013 | Policy Gradient MethodsReinforcement Learning | —Unverified | 0 |
| Optimistic policy iteration and natural actor-critic: A unifying view and a non-optimality result | Dec 1, 2013 | Policy Gradient MethodsReinforcement Learning | —Unverified | 0 |
| A reinterpretation of the policy oscillation phenomenon in approximate policy iteration | Dec 1, 2011 | Policy Gradient MethodsReinforcement Learning | —Unverified | 0 |
| Analysis and Improvement of Policy Gradient Estimation | Dec 1, 2011 | Policy Gradient Methodsreinforcement-learning | —Unverified | 0 |
| Natural Policy Gradient Methods with Parameter-based Exploration for Control Tasks | Dec 1, 2010 | Policy Gradient Methods | —Unverified | 0 |
| On a Connection between Importance Sampling and the Likelihood Ratio Policy Gradient | Dec 1, 2010 | Policy Gradient MethodsReinforcement Learning | —Unverified | 0 |
| Policy Search for Motor Primitives in Robotics | Dec 1, 2008 | Imitation LearningPolicy Gradient Methods | —Unverified | 0 |