SOTAVerified

Policy Gradient Methods

Papers

Showing 5175 of 382 papers

TitleStatusHype
Neural Logic Reinforcement LearningCode0
Policy Gradient for Robust Markov Decision ProcessesCode0
Momentum-Based Policy Gradient MethodsCode0
Multilinear Tensor Low-Rank Approximation for Policy-Gradient Methods in Reinforcement LearningCode0
Learning Goal-Oriented Visual Dialog via Tempered Policy GradientCode0
Action-depedent Control Variates for Policy Optimization via Stein's IdentityCode0
Hindsight Value Function for Variance Reduction in Stochastic Dynamic EnvironmentCode0
Greedy Actor-Critic: A New Conditional Cross-Entropy Method for Policy ImprovementCode0
Clipped Action Policy GradientCode0
Clipped-Objective Policy Gradients for Pessimistic Policy OptimizationCode0
Improving Exploration in Evolution Strategies for Deep Reinforcement Learning via a Population of Novelty-Seeking AgentsCode0
Oracle Complexity Reduction for Model-free LQR: A Stochastic Variance-Reduced Policy Gradient ApproachCode0
Hindsight-DICE: Stable Credit Assignment for Deep Reinforcement LearningCode0
Neural Replicator DynamicsCode0
Hindsight policy gradientsCode0
Hierarchical Policy-Gradient Reinforcement Learning for Multi-Agent Shepherding Control of Non-Cohesive TargetsCode0
A general class of surrogate functions for stable and efficient reinforcement learningCode0
High-Dimensional Continuous Control Using Generalized Advantage EstimationCode0
Hindsight Trust Region Policy OptimizationCode0
Enabling Efficient, Reliable Real-World Reinforcement Learning with Approximate Physics-Based ModelsCode0
Fast Efficient Hyperparameter Tuning for Policy Gradient MethodsCode0
Evaluating Rewards for Question Generation ModelsCode0
Convergence Guarantees of Model-free Policy Gradient Methods for LQR with Stochastic DataCode0
Fast Efficient Hyperparameter Tuning for Policy GradientsCode0
Global Optimality and Finite Sample Analysis of Softmax Off-Policy Actor Critic under State Distribution MismatchCode0
Show:102550
← PrevPage 3 of 16Next →

No leaderboard results yet.