| Natural Policy Gradient Methods with Parameter-based Exploration for Control Tasks | Dec 1, 2010 | Policy Gradient Methods | —Unverified | 0 |
| Natural Policy Gradients In Reinforcement Learning Explained | Sep 5, 2022 | Policy Gradient Methodsreinforcement-learning | —Unverified | 0 |
| Neural MMO v1.3: A Massively Multiagent Game Environment for Training and Evaluating Neural Networks | Jan 31, 2020 | Policy Gradient MethodsReinforcement Learning | —Unverified | 0 |
| Neural Policy Gradient Methods: Global Optimality and Rates of Convergence | Aug 29, 2019 | Policy Gradient Methods | —Unverified | 0 |
| Non-Parametric Stochastic Policy Gradient with Strategic Retreat for Non-Stationary Environment | Mar 24, 2022 | Policy Gradient Methods | —Unverified | 0 |
| Object Exchangeability in Reinforcement Learning: Extended Abstract | May 7, 2019 | Deep Reinforcement LearningObject | —Unverified | 0 |
| Off-OAB: Off-Policy Policy Gradient Method with Optimal Action-Dependent Baseline | May 4, 2024 | Computational EfficiencyMuJoCo | —Unverified | 0 |
| On a Connection between Importance Sampling and the Likelihood Ratio Policy Gradient | Dec 1, 2010 | Policy Gradient MethodsReinforcement Learning | —Unverified | 0 |
| On Global Convergence Rates for Federated Policy Gradient under Heterogeneous Environment | May 29, 2025 | Federated LearningPolicy Gradient Methods | —Unverified | 0 |
| On the Convergence of Discounted Policy Gradient Methods | Dec 28, 2022 | Policy Gradient Methodsreinforcement-learning | —Unverified | 0 |