SOTAVerified

Policy Gradient Methods

Papers

Showing 101125 of 382 papers

TitleStatusHype
Curious Explorer: a provable exploration strategy in Policy Learning0
Entropy annealing for policy mirror descent in continuous time and space0
Entropy Regularization with Discounted Future State Distribution in Policy Gradient Methods0
A reinterpretation of the policy oscillation phenomenon in approximate policy iteration0
Equivalence Between Policy Gradients and Soft Q-Learning0
Equivalence of stochastic and deterministic policy gradients0
Optimal Rates of Convergence for Entropy Regularization in Discounted Markov Decision Processes0
Beyond Exact Gradients: Convergence of Stochastic Soft-Max Policy Gradient Methods with Entropy Regularization0
Evolutionary Policy Optimization0
Evolutionary Selective Imitation: Interpretable Agents by Imitation Learning Without a Demonstrator0
Adversarial Policy Gradient for Alternating Markov Games0
Exchangeable Input Representations for Reinforcement Learning0
Expected Policy Gradients for Reinforcement Learning0
Countering Language Drift via Grounding0
Correcting discount-factor mismatch in on-policy policy gradient methods0
Approximation Benefits of Policy Gradient Methods with Aggregated States0
Factored Policy Gradients: Leveraging Structure for Efficient Learning in MOMDPs0
Fast Global Convergence of Natural Policy Gradient Methods with Entropy Regularization0
Federated Natural Policy Gradient and Actor Critic Methods for Multi-task Reinforcement Learning0
Federated Reinforcement Learning with Constraint Heterogeneity0
Momentum-Based Policy Gradient with Second-Order Information0
Fill-and-Spill: Deep Reinforcement Learning Policy Gradient Methods for Reservoir Operation Decision and Control0
Fine-Grained AutoAugmentation for Multi-Label Classification0
Convergence of policy gradient methods for finite-horizon exploratory linear-quadratic control problems0
A Policy Gradient Framework for Stochastic Optimal Control Problems with Global Convergence Guarantee0
Show:102550
← PrevPage 5 of 16Next →

No leaderboard results yet.