| Adaptive Step-Size for Policy Gradient Methods | Dec 1, 2013 | Policy Gradient MethodsReinforcement Learning | —Unverified | 0 |
| Identifying Policy Gradient Subspaces | Jan 12, 2024 | continuous-controlContinuous Control | —Unverified | 0 |
| Image Captioning based on Deep Reinforcement Learning | Sep 13, 2018 | Deep Reinforcement LearningImage Captioning | —Unverified | 0 |
| Improvements on Hindsight Learning | Sep 16, 2018 | Policy Gradient Methodsreinforcement-learning | —Unverified | 0 |
| Global Convergence of Policy Gradient Methods for Linearized Control Problems | Jan 1, 2018 | continuous-controlContinuous Control | —Unverified | 0 |
| Improving DAPO from a Mixed-Policy Perspective | Jul 17, 2025 | Policy Gradient Methods | —Unverified | 0 |
| DeepGait: Planning and Control of Quadrupedal Gaits using Deep Reinforcement Learning | Sep 18, 2019 | Deep Reinforcement LearningMotion Planning | —Unverified | 0 |
| Improving Reward-Conditioned Policies for Multi-Armed Bandits using Normalized Weight Functions | Jun 16, 2024 | Multi-Armed BanditsPolicy Gradient Methods | —Unverified | 0 |
| Global Convergence of Policy Gradient Methods for the Linear Quadratic Regulator | Jan 15, 2018 | continuous-controlContinuous Control | —Unverified | 0 |
| Global Convergence of Natural Policy Gradient with Hessian-aided Momentum Variance Reduction | Jan 2, 2024 | MuJoCoPolicy Gradient Methods | —Unverified | 0 |