SOTAVerified

Policy Gradient Methods

Papers

Showing 201250 of 382 papers

TitleStatusHype
Evolution Strategies as an Alternate Learning method for Hierarchical Reinforcement Learning0
Asynchronous Multi-Agent Actor-Critic with Macro-Actions0
Theoretical Guarantees of Fictitious Discount Algorithms for Episodic Reinforcement Learning and Global Convergence of Policy Gradient Methods0
A general class of surrogate functions for stable and efficient reinforcement learningCode0
Value-Based Reinforcement Learning for Continuous Control Robotic Manipulation in Multi-Task Sparse Reward Settings0
Policy Gradient Methods Find the Nash Equilibrium in N-player General-sum Linear-quadratic Games0
Hindsight Value Function for Variance Reduction in Stochastic Dynamic EnvironmentCode0
Proximal Policy Optimization for Tracking Control Exploiting Future Reference Information0
Greedification Operators for Policy Optimization: Investigating Forward and Reverse KL Divergences0
Fine-Grained AutoAugmentation for Multi-Label Classification0
Policy Gradient Methods for Distortion Risk Measures0
Curious Explorer: a provable exploration strategy in Policy Learning0
Modularity in Reinforcement Learning via Algorithmic Independence in Credit Assignment0
End-to-End Neuro-Symbolic Architecture for Image-to-Image Reasoning Tasks0
Ad Headline Generation using Self-Critical Masked Language Model0
Tesseract: Tensorised Actors for Multi-Agent Reinforcement Learning0
Meta Learning the Step Size in Policy Gradient Methods0
Controlling an Inverted Pendulum with Policy Gradient Methods-A Tutorial0
On the Linear convergence of Natural Policy Gradient Algorithm0
Semi-On-Policy Training for Sample Efficient Multi-Agent Policy Gradients0
Softmax Policy Gradient Methods Can Take Exponential Time to Converge0
Factored Policy Gradients: Leveraging Structure for Efficient Learning in MOMDPs0
Strategic bidding in freight transport using deep reinforcement learning0
Provably Efficient Policy Optimization for Two-Player Zero-Sum Markov Games0
Independent Policy Gradient Methods for Competitive Reinforcement Learning0
PGPS : Coupling Policy Gradient with Population-based Search0
Incremental Policy Gradients for Online Reinforcement Learning Control0
Self-Supervised Continuous Control without Policy Gradient0
2D or not 2D? Adaptive 3D Convolution Selection for Efficient Video Recognition0
Difference Rewards Policy Gradients0
Model-free and Bayesian Ensembling Model-based Deep Reinforcement Learning for Particle Accelerator Control Demonstrated on the FERMI FELCode0
Sample Complexity of Policy Gradient Finding Second-Order Stationary Points0
Reinforcement Learning in Linear Quadratic Deep Structured Teams: Global Convergence of Policy Gradient Methods0
Policy Optimization for Markovian Jump Linear Quadratic Control: Gradient-Based Methods and Global Convergence0
Policy Gradient Methods for the Noisy Linear Quadratic Regulator over a Finite Horizon0
Optimal Control-Based Baseline for Guided Exploration in Policy Gradient Methods0
A Study of Policy Gradient on a Class of Exactly Solvable Models0
Batch Reinforcement Learning with a Nonparametric Off-Policy Policy Gradient0
Sample Efficient Reinforcement Learning with REINFORCE0
Rethinking Deep Policy Gradients via State-Wise Policy Improvement0
Evolutionary Selective Imitation: Interpretable Agents by Imitation Learning Without a Demonstrator0
Approximation Benefits of Policy Gradient Methods with Aggregated States0
On Linear Convergence of Policy Gradient Methods for Finite MDPs0
PC-PG: Policy Cover Directed Exploration for Provable Policy Gradient LearningCode0
Fast Global Convergence of Natural Policy Gradient Methods with Entropy Regularization0
Momentum-Based Policy Gradient MethodsCode0
Policy Gradient Optimization of Thompson Sampling Policies0
An operator view of policy gradient methods0
Lifelong Learning of Factored Policies via Policy Gradients0
Zeroth-Order Supervised Policy Improvement0
Show:102550
← PrevPage 5 of 8Next →

No leaderboard results yet.