| An Improved Analysis of (Variance-Reduced) Policy Gradient and Natural Policy Gradient Methods | Nov 15, 2022 | Policy Gradient Methods | —Unverified | 0 |
| Geometry and convergence of natural policy gradient methods | Nov 3, 2022 | Policy Gradient Methods | —Unverified | 0 |
| Convergence of policy gradient methods for finite-horizon exploratory linear-quadratic control problems | Nov 1, 2022 | Policy Gradient Methods | —Unverified | 0 |
| Symmetric (Optimistic) Natural Policy Gradient for Multi-agent Learning with Parameter Convergence | Oct 23, 2022 | Policy Gradient Methods | —Unverified | 0 |
| Policy Gradient Methods for Designing Dynamic Output Feedback Controllers | Oct 18, 2022 | Policy Gradient Methods | —Unverified | 0 |
| On the convergence of policy gradient methods to Nash equilibria in general stochastic games | Oct 17, 2022 | Policy Gradient Methods | —Unverified | 0 |
| Linear Convergence of Natural Policy Gradient Methods with Log-Linear Policies | Oct 4, 2022 | Policy Gradient Methods | —Unverified | 0 |
| Is Reinforcement Learning (Not) for Natural Language Processing: Benchmarks, Baselines, and Building Blocks for Natural Language Policy Optimization | Oct 3, 2022 | Decision MakingPolicy Gradient Methods | CodeCode Available | 1 |
| SoftTreeMax: Policy Gradient with Tree Search | Sep 28, 2022 | Policy Gradient Methods | —Unverified | 0 |
| Asynchronous Actor-Critic for Multi-Agent Reinforcement Learning | Sep 20, 2022 | Decision MakingMulti-agent Reinforcement Learning | —Unverified | 0 |