| Natural Policy Gradient Methods with Parameter-based Exploration for Control Tasks | Dec 1, 2010 | Policy Gradient Methods | —Unverified | 0 |
| Natural Policy Gradients In Reinforcement Learning Explained | Sep 5, 2022 | Policy Gradient Methodsreinforcement-learning | —Unverified | 0 |
| Neural MMO v1.3: A Massively Multiagent Game Environment for Training and Evaluating Neural Networks | Jan 31, 2020 | Policy Gradient MethodsReinforcement Learning | —Unverified | 0 |
| Neural Policy Gradient Methods: Global Optimality and Rates of Convergence | Aug 29, 2019 | Policy Gradient Methods | —Unverified | 0 |
| Non-Parametric Stochastic Policy Gradient with Strategic Retreat for Non-Stationary Environment | Mar 24, 2022 | Policy Gradient Methods | —Unverified | 0 |
| Object Exchangeability in Reinforcement Learning: Extended Abstract | May 7, 2019 | Deep Reinforcement LearningObject | —Unverified | 0 |
| Off-OAB: Off-Policy Policy Gradient Method with Optimal Action-Dependent Baseline | May 4, 2024 | Computational EfficiencyMuJoCo | —Unverified | 0 |
| On a Connection between Importance Sampling and the Likelihood Ratio Policy Gradient | Dec 1, 2010 | Policy Gradient MethodsReinforcement Learning | —Unverified | 0 |
| On Global Convergence Rates for Federated Policy Gradient under Heterogeneous Environment | May 29, 2025 | Federated LearningPolicy Gradient Methods | —Unverified | 0 |
| On the Convergence of Discounted Policy Gradient Methods | Dec 28, 2022 | Policy Gradient Methodsreinforcement-learning | —Unverified | 0 |
| On the convergence of policy gradient methods to Nash equilibria in general stochastic games | Oct 17, 2022 | Policy Gradient Methods | —Unverified | 0 |
| On the Convergence Rates of Policy Gradient Methods | Jan 19, 2022 | Policy Gradient Methods | —Unverified | 0 |
| On the Global Convergence of Risk-Averse Policy Gradient Methods with Expected Conditional Risk Measures | Jan 26, 2023 | Decision MakingPolicy Gradient Methods | —Unverified | 0 |
| On the Global Convergence Rates of Softmax Policy Gradient Methods | May 13, 2020 | Open-Ended Question AnsweringPolicy Gradient Methods | —Unverified | 0 |
| On the Linear convergence of Natural Policy Gradient Algorithm | May 4, 2021 | Policy Gradient Methodsreinforcement-learning | —Unverified | 0 |
| On the Optimization Landscape of Dynamic Output Feedback: A Case Study for Linear Quadratic Regulator | Sep 12, 2022 | Decision MakingPolicy Gradient Methods | —Unverified | 0 |
| On the Theory of Policy Gradient Methods: Optimality, Approximation, and Distribution Shift | Aug 1, 2019 | Policy Gradient MethodsReinforcement Learning | —Unverified | 0 |
| Optimal Resource Allocation in Wireless Control Systems via Deep Policy Gradient | Oct 25, 2019 | Deep Reinforcement LearningPolicy Gradient Methods | —Unverified | 0 |
| Acceleration in Policy Optimization | Jun 18, 2023 | Meta-LearningPolicy Gradient Methods | —Unverified | 0 |
| Optimistic Policy Gradient in Multi-Player Markov Games with a Single Controller: Convergence Beyond the Minty Property | Dec 19, 2023 | Policy Gradient Methods | —Unverified | 0 |
| Optimistic policy iteration and natural actor-critic: A unifying view and a non-optimality result | Dec 1, 2013 | Policy Gradient MethodsReinforcement Learning | —Unverified | 0 |
| Optimization Landscape of Policy Gradient Methods for Discrete-time Static Output Feedback | Oct 29, 2023 | Policy Gradient Methods | —Unverified | 0 |
| Optimizing over a Restricted Policy Class in Markov Decision Processes | Feb 26, 2018 | Policy Gradient Methods | —Unverified | 0 |
| Optimizing Solution-Samplers for Combinatorial Problems: The Landscape of Policy-Gradient Methods | Oct 8, 2023 | Policy Gradient MethodsTraveling Salesman Problem | —Unverified | 0 |
| Ordering-based Conditions for Global Convergence of Policy Gradient Methods | Apr 2, 2025 | Policy Gradient Methods | —Unverified | 0 |