| Stabilizing Policy Gradients for Stochastic Differential Equations via Consistency with Perturbation Process | Mar 7, 2024 | Drug DesignPolicy Gradient Methods | —Unverified | 0 |
| Fill-and-Spill: Deep Reinforcement Learning Policy Gradient Methods for Reservoir Operation Decision and Control | Mar 7, 2024 | Deep Reinforcement LearningPolicy Gradient Methods | —Unverified | 0 |
| Towards Provable Log Density Policy Gradient | Mar 3, 2024 | Policy Gradient Methodsreinforcement-learning | —Unverified | 0 |
| Reusing Historical Trajectories in Natural Policy Gradient via Importance Sampling: Convergence and Convergence Rate | Mar 1, 2024 | Policy Gradient Methods | —Unverified | 0 |
| When Do Off-Policy and On-Policy Policy Gradient Methods Align? | Feb 19, 2024 | Policy Gradient Methods | —Unverified | 0 |
| Identifying Policy Gradient Subspaces | Jan 12, 2024 | continuous-controlContinuous Control | —Unverified | 0 |
| Global Convergence of Natural Policy Gradient with Hessian-aided Momentum Variance Reduction | Jan 2, 2024 | MuJoCoPolicy Gradient Methods | —Unverified | 0 |
| Training Diffusion Models Towards Diverse Image Generation with Reinforcement Learning | Jan 1, 2024 | Decision MakingDiversity | —Unverified | 0 |
| Optimistic Policy Gradient in Multi-Player Markov Games with a Single Controller: Convergence Beyond the Minty Property | Dec 19, 2023 | Policy Gradient Methods | —Unverified | 0 |
| Privacy Preserving Multi-Agent Reinforcement Learning in Supply Chains | Dec 9, 2023 | Multi-agent Reinforcement LearningPolicy Gradient Methods | —Unverified | 0 |