SOTAVerified

Policy Gradient Methods

Papers

Showing 231240 of 382 papers

TitleStatusHype
Model-free and Bayesian Ensembling Model-based Deep Reinforcement Learning for Particle Accelerator Control Demonstrated on the FERMI FELCode0
Sample Complexity of Policy Gradient Finding Second-Order Stationary Points0
Reinforcement Learning in Linear Quadratic Deep Structured Teams: Global Convergence of Policy Gradient Methods0
Policy Optimization for Markovian Jump Linear Quadratic Control: Gradient-Based Methods and Global Convergence0
Policy Gradient Methods for the Noisy Linear Quadratic Regulator over a Finite Horizon0
Optimal Control-Based Baseline for Guided Exploration in Policy Gradient Methods0
A Study of Policy Gradient on a Class of Exactly Solvable Models0
Batch Reinforcement Learning with a Nonparametric Off-Policy Policy Gradient0
Sample Efficient Reinforcement Learning with REINFORCE0
Rethinking Deep Policy Gradients via State-Wise Policy Improvement0
Show:102550
← PrevPage 24 of 39Next →

No leaderboard results yet.