| Mean-Variance Policy Iteration for Risk-Averse Reinforcement Learning | Apr 22, 2020 | MuJoCoreinforcement-learning | —Unverified | 0 | 0 |
| PGPS : Coupling Policy Gradient with Population-based Search | Jan 1, 2021 | Deep Reinforcement LearningMuJoCo | —Unverified | 0 | 0 |
| Phasic Diversity Optimization for Population-Based Reinforcement Learning | Mar 17, 2024 | DiversityMuJoCo | —Unverified | 0 | 0 |
| Policy-Driven World Model Adaptation for Robust Offline Model-based Reinforcement Learning | May 19, 2025 | D4RLmodel | —Unverified | 0 | 0 |
| Policy Gradient with Kernel Quadrature | Oct 23, 2023 | Causal DiscoveryMuJoCo | —Unverified | 0 | 0 |
| Policy Gradient With Serial Markov Chain Reasoning | Oct 13, 2022 | Decision MakingMuJoCo | —Unverified | 0 | 0 |
| Policy Optimization by Genetic Distillation | Nov 3, 2017 | Deep Reinforcement LearningImitation Learning | —Unverified | 0 | 0 |