| Learning Self-Imitating Diverse Policies | May 25, 2018 | continuous-controlContinuous Control | —Unverified | 0 |
| Learning to Interrupt: A Hierarchical Deep Reinforcement Learning Framework for Efficient Exploration | Jul 30, 2018 | Deep Reinforcement LearningEfficient Exploration | —Unverified | 0 |
| Lifelong Learning of Factored Policies via Policy Gradients | Jun 12, 2020 | continuous-controlContinuous Control | —Unverified | 0 |
| Policy Gradient Methods for Distortion Risk Measures | Jul 9, 2021 | Policy Gradient Methodsreinforcement-learning | —Unverified | 0 |
| Linear convergence of a policy gradient method for some finite horizon continuous time control problems | Mar 22, 2022 | Policy Gradient Methodsreinforcement-learning | —Unverified | 0 |
| Linear Convergence of Natural Policy Gradient Methods with Log-Linear Policies | Oct 4, 2022 | Policy Gradient Methods | —Unverified | 0 |
| Linear Function Approximation as a Computationally Efficient Method to Solve Classical Reinforcement Learning Challenges | May 27, 2024 | AcrobotPolicy Gradient Methods | —Unverified | 0 |
| Linear-Quadratic Mean-Field Reinforcement Learning: Convergence of Policy Gradient Methods | Oct 9, 2019 | Policy Gradient Methodsreinforcement-learning | —Unverified | 0 |
| Local Advantage Actor-Critic for Robust Multi-Agent Deep Reinforcement Learning | Oct 16, 2021 | Deep Reinforcement LearningMulti-agent Reinforcement Learning | —Unverified | 0 |
| Local Pairwise Distance Matching for Backpropagation-Free Reinforcement Learning | Jul 15, 2025 | Policy Gradient Methodsreinforcement-learning | —Unverified | 0 |