| Stochastic first-order methods for average-reward Markov decision processes | May 11, 2022 | Policy Gradient Methods | —Unverified | 0 | 0 |
| Stochastic Policy Gradient Methods: Improved Sample Complexity for Fisher-non-degenerate Policies | Feb 3, 2023 | Policy Gradient Methods | —Unverified | 0 | 0 |
| Stochastic Recursive Momentum for Policy Gradient Methods | Mar 9, 2020 | Policy Gradient Methods | —Unverified | 0 | 0 |
| Stochastic Second-Order Methods Improve Best-Known Sample Complexity of SGD for Gradient-Dominated Function | May 25, 2022 | Policy Gradient MethodsReinforcement Learning (RL) | —Unverified | 0 | 0 |
| Stochastic Variance Reduction for Policy Gradient Estimation | Oct 17, 2017 | continuous-controlContinuous Control | —Unverified | 0 | 0 |
| Strategic bidding in freight transport using deep reinforcement learning | Feb 18, 2021 | Deep Reinforcement LearningFairness | —Unverified | 0 | 0 |
| Strongly-polynomial time and validation analysis of policy gradient methods | Sep 28, 2024 | Policy Gradient MethodsReinforcement Learning (RL) | —Unverified | 0 | 0 |
| Symmetric (Optimistic) Natural Policy Gradient for Multi-agent Learning with Parameter Convergence | Oct 23, 2022 | Policy Gradient Methods | —Unverified | 0 | 0 |
| Tesseract: Tensorised Actors for Multi-Agent Reinforcement Learning | May 31, 2021 | Learning TheoryMulti-agent Reinforcement Learning | —Unverified | 0 | 0 |
| Theoretical Guarantees of Fictitious Discount Algorithms for Episodic Reinforcement Learning and Global Convergence of Policy Gradient Methods | Sep 13, 2021 | Policy Gradient Methodsreinforcement-learning | —Unverified | 0 | 0 |