SOTAVerified

Policy Gradient Methods

Papers

Showing 276300 of 382 papers

TitleStatusHype
DeepGait: Planning and Control of Quadrupedal Gaits using Deep Reinforcement Learning0
Deep Reinforcement Learning Algorithm for Dynamic Pricing of Express Lanes with Multiple Access LocationsCode0
Transfer Reward Learning for Policy Gradient-Based Text Generation0
Multi Pseudo Q-learning Based Deterministic Policy Gradient for Tracking Control of Autonomous Underwater Vehicles0
Neural Policy Gradient Methods: Global Optimality and Rates of Convergence0
Trajectory-wise Control Variates for Variance Reduction in Policy Gradient Methods0
Health-Informed Policy Gradients for Multi-Agent Reinforcement LearningCode0
On the Theory of Policy Gradient Methods: Optimality, Approximation, and Distribution Shift0
Hindsight Trust Region Policy OptimizationCode0
Variance Reduction in Actor Critic Methods (ACM)0
Shapley Q-value: A Local Reward Approach to Solve Global Reward GamesCode0
Policy Optimization with Stochastic Mirror Descent0
Ranking Policy GradientCode0
Entropic Risk Measure in Policy Search0
Global Convergence of Policy Gradient Methods to (Almost) Locally Optimal Policies0
Is the Policy Gradient a Gradient?0
A Hybrid Approach Between Adversarial Generative Networks and Actor-Critic Policy Gradient for Low Rate High-Resolution Image Compression0
Global Optimality Guarantees For Policy Gradient Methods0
Neural Replicator DynamicsCode0
Diversity-Inducing Policy Gradient: Using Maximum Mean Discrepancy to Find a Set of Diverse Policies0
Policy Search by Target Distribution Learning for Continuous Control0
Trajectory-Based Off-Policy Deep Reinforcement LearningCode0
Learning Novel Policies For Tasks0
Object Exchangeability in Reinforcement Learning: Extended Abstract0
Neural Logic Reinforcement LearningCode0
Show:102550
← PrevPage 12 of 16Next →

No leaderboard results yet.