| Policy gradient learning methods for stochastic control with exit time and applications to share repurchase pricing | Feb 14, 2023 | Policy Gradient Methods | —Unverified | 0 |
| Policy Gradient Methods Find the Nash Equilibrium in N-player General-sum Linear-quadratic Games | Jul 27, 2021 | Policy Gradient Methods | —Unverified | 0 |
| Policy Gradient Methods for Designing Dynamic Output Feedback Controllers | Oct 18, 2022 | Policy Gradient Methods | —Unverified | 0 |
| Policy Gradient Methods for Discrete Time Linear Quadratic Regulator With Random Parameters | Mar 29, 2023 | Policy Gradient Methodsreinforcement-learning | —Unverified | 0 |
| Policy Gradient Methods for Off-policy Control | Dec 13, 2015 | Policy Gradient Methods | —Unverified | 0 |
| Policy Gradient Methods for Reinforcement Learning with Function Approximation and Action-Dependent Baselines | Jun 20, 2017 | Policy Gradient Methodsreinforcement-learning | —Unverified | 0 |
| Policy Gradient Methods for Risk-Sensitive Distributional Reinforcement Learning with Provable Convergence | May 23, 2024 | Distributional Reinforcement LearningPolicy Gradient Methods | —Unverified | 0 |
| Policy Gradient Methods for the Noisy Linear Quadratic Regulator over a Finite Horizon | Nov 20, 2020 | Policy Gradient Methods | —Unverified | 0 |
| Policy Gradient Optimization of Thompson Sampling Policies | Jun 30, 2020 | Policy Gradient MethodsThompson Sampling | —Unverified | 0 |
| Policy Gradients for Contextual Recommendations | Feb 12, 2018 | Decision MakingMulti-Armed Bandits | —Unverified | 0 |