| Learning Self-Imitating Diverse Policies | May 25, 2018 | continuous-controlContinuous Control | —Unverified | 0 |
| Learning to Interrupt: A Hierarchical Deep Reinforcement Learning Framework for Efficient Exploration | Jul 30, 2018 | Deep Reinforcement LearningEfficient Exploration | —Unverified | 0 |
| Global Convergence of Policy Gradient Methods for the Linear Quadratic Regulator | Jan 15, 2018 | continuous-controlContinuous Control | —Unverified | 0 |
| Natural Policy Gradient Methods with Parameter-based Exploration for Control Tasks | Dec 1, 2010 | Policy Gradient Methods | —Unverified | 0 |
| Lifelong Learning of Factored Policies via Policy Gradients | Jun 12, 2020 | continuous-controlContinuous Control | —Unverified | 0 |
| Global Convergence of Natural Policy Gradient with Hessian-aided Momentum Variance Reduction | Jan 2, 2024 | MuJoCoPolicy Gradient Methods | —Unverified | 0 |
| Policy Gradient Methods for Distortion Risk Measures | Jul 9, 2021 | Policy Gradient Methodsreinforcement-learning | —Unverified | 0 |
| Linear convergence of a policy gradient method for some finite horizon continuous time control problems | Mar 22, 2022 | Policy Gradient Methodsreinforcement-learning | —Unverified | 0 |
| Global Convergence Guarantees for Federated Policy Gradient Methods with Adversaries | Mar 15, 2024 | Decision MakingPolicy Gradient Methods | —Unverified | 0 |
| Computing and Learning Stationary Mean Field Equilibria with Scalar Interactions: Algorithms and Applications | Feb 2, 2025 | counterfactualPolicy Gradient Methods | —Unverified | 0 |