SOTAVerified

Policy Gradient Methods

Papers

Showing 351375 of 382 papers

TitleStatusHype
Divide-and-Conquer Reinforcement LearningCode0
Run, skeleton, run: skeletal model in a physics-based simulationCode0
Hindsight policy gradientsCode0
Policy Optimization by Genetic Distillation0
Action-depedent Control Variates for Policy Optimization via Stein's IdentityCode0
Understanding Early Word Learning in Situated Artificial Agents0
Accelerated Reinforcement Learning0
Stochastic Variance Reduction for Policy Gradient Estimation0
Manifold Regularization for Kernelized LSTD0
Cold-Start Reinforcement Learning with Softmax Policy GradientCode0
Reproducibility of Benchmarked Deep Reinforcement Learning Tasks for Continuous ControlCode0
Policy Gradient Methods for Reinforcement Learning with Function Approximation and Action-Dependent Baselines0
A unified view of entropy-regularized Markov decision processes0
Equivalence Between Policy Gradients and Soft Q-Learning0
Stein Variational Policy Gradient0
Batch Policy Gradient Methods for Improving Neural Conversation Models0
A K-fold Method for Baseline Estimation in Policy Gradient Algorithms0
Sample-efficient Deep Reinforcement Learning for Dialog Control0
Q-Prop: Sample-Efficient Policy Gradient with An Off-Policy CriticCode0
Dual Learning for Machine TranslationCode0
Deep Reinforcement Learning for Dialogue GenerationCode0
Policy Gradient Methods for Off-policy Control0
High-Dimensional Continuous Control Using Generalized Advantage EstimationCode0
Policy Gradient for Coherent Risk Measures0
Efficient Baseline-free Sampling in Parameter Exploring Policy Gradients: Super Symmetric PGPE0
Show:102550
← PrevPage 15 of 16Next →

No leaderboard results yet.