| Policy Learning and Evaluation with Randomized Quasi-Monte Carlo | Feb 16, 2022 | continuous-controlContinuous Control | —Unverified | 0 |
| Policy Mirror Descent Inherently Explores Action Space | Mar 8, 2023 | Efficient ExplorationGeneral Reinforcement Learning | —Unverified | 0 |
| Policy Optimization by Genetic Distillation | Nov 3, 2017 | Deep Reinforcement LearningImitation Learning | —Unverified | 0 |
| Policy Optimization for Markovian Jump Linear Quadratic Control: Gradient-Based Methods and Global Convergence | Nov 24, 2020 | Policy Gradient Methods | —Unverified | 0 |
| Policy Optimization for H_2 Linear Control with H_ Robustness Guarantee: Implicit Regularization and Global Convergence | Oct 21, 2019 | Policy Gradient MethodsReinforcement Learning | —Unverified | 0 |
| Policy Optimization with Demonstrations | Jul 1, 2018 | Policy Gradient MethodsReinforcement Learning | —Unverified | 0 |
| Policy Optimization with Stochastic Mirror Descent | Jun 25, 2019 | Continuous ControlPolicy Gradient Methods | —Unverified | 0 |
| Policy Search by Target Distribution Learning for Continuous Control | May 27, 2019 | continuous-controlContinuous Control | —Unverified | 0 |
| Policy Search for Motor Primitives in Robotics | Dec 1, 2008 | Imitation LearningPolicy Gradient Methods | —Unverified | 0 |
| Policy Testing in Markov Decision Processes | May 21, 2025 | Policy Gradient Methods | —Unverified | 0 |