| Optimistic policy iteration and natural actor-critic: A unifying view and a non-optimality result | Dec 1, 2013 | Policy Gradient MethodsReinforcement Learning | —Unverified | 0 |
| Optimization Landscape of Policy Gradient Methods for Discrete-time Static Output Feedback | Oct 29, 2023 | Policy Gradient Methods | —Unverified | 0 |
| Optimizing over a Restricted Policy Class in Markov Decision Processes | Feb 26, 2018 | Policy Gradient Methods | —Unverified | 0 |
| Optimizing Solution-Samplers for Combinatorial Problems: The Landscape of Policy-Gradient Methods | Oct 8, 2023 | Policy Gradient MethodsTraveling Salesman Problem | —Unverified | 0 |
| Ordering-based Conditions for Global Convergence of Policy Gradient Methods | Apr 2, 2025 | Policy Gradient Methods | —Unverified | 0 |
| PAGE-PG: A Simple and Loopless Variance-Reduced Policy Gradient Method with Probabilistic Gradient Estimation | Feb 1, 2022 | Policy Gradient Methods | —Unverified | 0 |
| PGPS : Coupling Policy Gradient with Population-based Search | Jan 1, 2021 | Deep Reinforcement LearningMuJoCo | —Unverified | 0 |
| PG-Rainbow: Using Distributional Reinforcement Learning in Policy Gradient Methods | Jul 18, 2024 | Atari GamesDecision Making | —Unverified | 0 |
| Policy Gradient for Coherent Risk Measures | Feb 13, 2015 | Policy Gradient MethodsReinforcement Learning | —Unverified | 0 |
| Policy Gradient for Rectangular Robust Markov Decision Processes | Jan 31, 2023 | FormPolicy Gradient Methods | —Unverified | 0 |