SOTAVerified

Policy Gradient Methods

Papers

Showing 251300 of 382 papers

TitleStatusHype
Multi Pseudo Q-learning Based Deterministic Policy Gradient for Tracking Control of Autonomous Underwater Vehicles0
Natural Policy Gradient Methods with Parameter-based Exploration for Control Tasks0
Natural Policy Gradients In Reinforcement Learning Explained0
Neural MMO v1.3: A Massively Multiagent Game Environment for Training and Evaluating Neural Networks0
Neural Policy Gradient Methods: Global Optimality and Rates of Convergence0
Non-Parametric Stochastic Policy Gradient with Strategic Retreat for Non-Stationary Environment0
Object Exchangeability in Reinforcement Learning: Extended Abstract0
Off-OAB: Off-Policy Policy Gradient Method with Optimal Action-Dependent Baseline0
On a Connection between Importance Sampling and the Likelihood Ratio Policy Gradient0
On Global Convergence Rates for Federated Policy Gradient under Heterogeneous Environment0
On the Convergence of Discounted Policy Gradient Methods0
On the convergence of policy gradient methods to Nash equilibria in general stochastic games0
On the Convergence Rates of Policy Gradient Methods0
On the Global Convergence of Risk-Averse Policy Gradient Methods with Expected Conditional Risk Measures0
On the Global Convergence Rates of Softmax Policy Gradient Methods0
On the Linear convergence of Natural Policy Gradient Algorithm0
On the Optimization Landscape of Dynamic Output Feedback: A Case Study for Linear Quadratic Regulator0
On the Theory of Policy Gradient Methods: Optimality, Approximation, and Distribution Shift0
Optimal Resource Allocation in Wireless Control Systems via Deep Policy Gradient0
Acceleration in Policy Optimization0
Optimistic Policy Gradient in Multi-Player Markov Games with a Single Controller: Convergence Beyond the Minty Property0
Optimistic policy iteration and natural actor-critic: A unifying view and a non-optimality result0
Optimization Landscape of Policy Gradient Methods for Discrete-time Static Output Feedback0
Optimizing over a Restricted Policy Class in Markov Decision Processes0
Optimizing Solution-Samplers for Combinatorial Problems: The Landscape of Policy-Gradient Methods0
Ordering-based Conditions for Global Convergence of Policy Gradient Methods0
PAGE-PG: A Simple and Loopless Variance-Reduced Policy Gradient Method with Probabilistic Gradient Estimation0
PGPS : Coupling Policy Gradient with Population-based Search0
PG-Rainbow: Using Distributional Reinforcement Learning in Policy Gradient Methods0
Policy Gradient for Coherent Risk Measures0
Policy Gradient for Rectangular Robust Markov Decision Processes0
Policy gradient learning methods for stochastic control with exit time and applications to share repurchase pricing0
Policy Gradient Methods Find the Nash Equilibrium in N-player General-sum Linear-quadratic Games0
Policy Gradient Methods for Designing Dynamic Output Feedback Controllers0
Policy Gradient Methods for Discrete Time Linear Quadratic Regulator With Random Parameters0
Policy Gradient Methods for Off-policy Control0
Policy Gradient Methods for Reinforcement Learning with Function Approximation and Action-Dependent Baselines0
Policy Gradient Methods for Risk-Sensitive Distributional Reinforcement Learning with Provable Convergence0
Policy Gradient Methods for the Noisy Linear Quadratic Regulator over a Finite Horizon0
Policy Gradient Optimization of Thompson Sampling Policies0
Policy Gradients for Contextual Recommendations0
Policy Learning and Evaluation with Randomized Quasi-Monte Carlo0
Policy Mirror Descent Inherently Explores Action Space0
Policy Optimization by Genetic Distillation0
Policy Optimization for Markovian Jump Linear Quadratic Control: Gradient-Based Methods and Global Convergence0
Policy Optimization for H_2 Linear Control with H_ Robustness Guarantee: Implicit Regularization and Global Convergence0
Policy Optimization with Demonstrations0
Policy Optimization with Stochastic Mirror Descent0
Policy Search by Target Distribution Learning for Continuous Control0
Policy Search for Motor Primitives in Robotics0
Show:102550
← PrevPage 6 of 8Next →

No leaderboard results yet.