| PGPS : Coupling Policy Gradient with Population-based Search | Jan 1, 2021 | Deep Reinforcement LearningMuJoCo | —Unverified | 0 | 0 |
| PG-Rainbow: Using Distributional Reinforcement Learning in Policy Gradient Methods | Jul 18, 2024 | Atari GamesDecision Making | —Unverified | 0 | 0 |
| Policy Gradient for Coherent Risk Measures | Feb 13, 2015 | Policy Gradient MethodsReinforcement Learning | —Unverified | 0 | 0 |
| Policy Gradient for Rectangular Robust Markov Decision Processes | Jan 31, 2023 | FormPolicy Gradient Methods | —Unverified | 0 | 0 |
| Policy gradient learning methods for stochastic control with exit time and applications to share repurchase pricing | Feb 14, 2023 | Policy Gradient Methods | —Unverified | 0 | 0 |
| Policy Gradient Methods Find the Nash Equilibrium in N-player General-sum Linear-quadratic Games | Jul 27, 2021 | Policy Gradient Methods | —Unverified | 0 | 0 |
| Policy Gradient Methods for Designing Dynamic Output Feedback Controllers | Oct 18, 2022 | Policy Gradient Methods | —Unverified | 0 | 0 |
| Policy Gradient Methods for Discrete Time Linear Quadratic Regulator With Random Parameters | Mar 29, 2023 | Policy Gradient Methodsreinforcement-learning | —Unverified | 0 | 0 |
| Policy Gradient Methods for Off-policy Control | Dec 13, 2015 | Policy Gradient Methods | —Unverified | 0 | 0 |
| Policy Gradient Methods for Reinforcement Learning with Function Approximation and Action-Dependent Baselines | Jun 20, 2017 | Policy Gradient Methodsreinforcement-learning | —Unverified | 0 | 0 |
| Policy Gradient Methods for Risk-Sensitive Distributional Reinforcement Learning with Provable Convergence | May 23, 2024 | Distributional Reinforcement LearningPolicy Gradient Methods | —Unverified | 0 | 0 |
| Policy Gradient Methods for the Noisy Linear Quadratic Regulator over a Finite Horizon | Nov 20, 2020 | Policy Gradient Methods | —Unverified | 0 | 0 |
| Policy Gradient Optimization of Thompson Sampling Policies | Jun 30, 2020 | Policy Gradient MethodsThompson Sampling | —Unverified | 0 | 0 |
| Policy Gradients for Contextual Recommendations | Feb 12, 2018 | Decision MakingMulti-Armed Bandits | —Unverified | 0 | 0 |
| Policy Learning and Evaluation with Randomized Quasi-Monte Carlo | Feb 16, 2022 | continuous-controlContinuous Control | —Unverified | 0 | 0 |
| Policy Mirror Descent Inherently Explores Action Space | Mar 8, 2023 | Efficient ExplorationGeneral Reinforcement Learning | —Unverified | 0 | 0 |
| Policy Optimization by Genetic Distillation | Nov 3, 2017 | Deep Reinforcement LearningImitation Learning | —Unverified | 0 | 0 |
| Policy Optimization for Markovian Jump Linear Quadratic Control: Gradient-Based Methods and Global Convergence | Nov 24, 2020 | Policy Gradient Methods | —Unverified | 0 | 0 |
| Policy Optimization for H_2 Linear Control with H_ Robustness Guarantee: Implicit Regularization and Global Convergence | Oct 21, 2019 | Policy Gradient MethodsReinforcement Learning | —Unverified | 0 | 0 |
| Policy Optimization with Demonstrations | Jul 1, 2018 | Policy Gradient MethodsReinforcement Learning | —Unverified | 0 | 0 |
| Policy Optimization with Stochastic Mirror Descent | Jun 25, 2019 | Continuous ControlPolicy Gradient Methods | —Unverified | 0 | 0 |
| Policy Search by Target Distribution Learning for Continuous Control | May 27, 2019 | continuous-controlContinuous Control | —Unverified | 0 | 0 |
| Policy Search for Motor Primitives in Robotics | Dec 1, 2008 | Imitation LearningPolicy Gradient Methods | —Unverified | 0 | 0 |
| Policy Testing in Markov Decision Processes | May 21, 2025 | Policy Gradient Methods | —Unverified | 0 | 0 |
| Policy Tree Network | Sep 25, 2019 | Model-based Reinforcement LearningMuJoCo | —Unverified | 0 | 0 |