SOTAVerified

Policy Gradient Methods

Papers

Showing 151160 of 382 papers

TitleStatusHype
Stochastic Second-Order Methods Improve Best-Known Sample Complexity of SGD for Gradient-Dominated Function0
The Sufficiency of Off-Policyness and Soft Clipping: PPO is still Insufficient according to an Off-Policy MeasureCode1
Momentum-Based Policy Gradient with Second-Order Information0
Stochastic first-order methods for average-reward Markov decision processes0
Learning to Constrain Policy Optimization with Virtual Trust Region0
Independent Natural Policy Gradient Methods for Potential Games: Finite-time Global Convergence with Entropy Regularization0
Synthesis of Stabilizing Recurrent Equilibrium Network ControllersCode0
Asynchronous, Option-Based Multi-Agent Policy Gradient: A Conditional Reasoning Approach0
Non-Parametric Stochastic Policy Gradient with Strategic Retreat for Non-Stationary Environment0
Linear convergence of a policy gradient method for some finite horizon continuous time control problems0
Show:102550
← PrevPage 16 of 39Next →

No leaderboard results yet.