SOTAVerified

Policy Gradient Methods

Papers

Showing 221230 of 382 papers

TitleStatusHype
Reinforcement Learning in Linear Quadratic Deep Structured Teams: Global Convergence of Policy Gradient Methods0
Policy Optimization for Markovian Jump Linear Quadratic Control: Gradient-Based Methods and Global Convergence0
Policy Gradient Methods for the Noisy Linear Quadratic Regulator over a Finite Horizon0
Optimal Control-Based Baseline for Guided Exploration in Policy Gradient Methods0
A Study of Policy Gradient on a Class of Exactly Solvable Models0
Experimental design for MRI by greedy policy searchCode1
Batch Reinforcement Learning with a Nonparametric Off-Policy Policy Gradient0
Sample Efficient Reinforcement Learning with REINFORCE0
Rethinking Deep Policy Gradients via State-Wise Policy Improvement0
Efficient Wasserstein Natural Gradients for Reinforcement LearningCode1
Show:102550
← PrevPage 23 of 39Next →

No leaderboard results yet.