SOTAVerified

Policy Gradient Methods

Papers

Showing 151200 of 382 papers

TitleStatusHype
Statistically Efficient Off-Policy Policy Gradients0
Stein Variational Policy Gradient0
Stepsize Learning for Policy Gradient Methods in Contextual Markov Decision Processes0
Stochastic Dimension-reduced Second-order Methods for Policy Optimization0
Stochastic first-order methods for average-reward Markov decision processes0
Stochastic Policy Gradient Methods: Improved Sample Complexity for Fisher-non-degenerate Policies0
Stochastic Recursive Momentum for Policy Gradient Methods0
Stochastic Second-Order Methods Improve Best-Known Sample Complexity of SGD for Gradient-Dominated Function0
Stochastic Variance Reduction for Policy Gradient Estimation0
Strategic bidding in freight transport using deep reinforcement learning0
Strongly-polynomial time and validation analysis of policy gradient methods0
Symmetric (Optimistic) Natural Policy Gradient for Multi-agent Learning with Parameter Convergence0
Tesseract: Tensorised Actors for Multi-Agent Reinforcement Learning0
Theoretical Guarantees of Fictitious Discount Algorithms for Episodic Reinforcement Learning and Global Convergence of Policy Gradient Methods0
The wisdom of the crowd: reliable deep reinforcement learning through ensembles of Q-functions0
Token-Efficient RL for LLM Reasoning0
Towards Adapting Reinforcement Learning Agents to New Tasks: Insights from Q-Values0
Towards Efficient Risk-Sensitive Policy Gradient: An Iteration Complexity Analysis0
Towards Global Optimality in Cooperative MARL with the Transformation And Distillation Framework0
Towards Provable Log Density Policy Gradient0
Training Diffusion Models Towards Diverse Image Generation with Reinforcement Learning0
Trajectory-wise Control Variates for Variance Reduction in Policy Gradient Methods0
Transfer Reward Learning for Policy Gradient-Based Text Generation0
Truncating Trajectories in Monte Carlo Policy Evaluation: an Adaptive Approach0
Policy Gradient in Partially Observable Environments: Approximation and Convergence0
Understanding Early Word Learning in Situated Artificial Agents0
Understanding Grounded Language Learning Agents0
Value-Based Reinforcement Learning for Continuous Control Robotic Manipulation in Multi-Task Sparse Reward Settings0
Variance Reduced Domain Randomization for Policy Gradient0
Variance Reduction for Policy-Gradient Methods via Empirical Variance Minimization0
Variance Reduction for Policy Gradient with Action-Dependent Factorized Baselines0
Variance Reduction for Reinforcement Learning in Input-Driven Environments0
Variance Reduction in Actor Critic Methods (ACM)0
When Do Off-Policy and On-Policy Policy Gradient Methods Align?0
Diversity-Inducing Policy Gradient: Using Maximum Mean Discrepancy to Find a Set of Diverse Policies0
Fast Global Convergence of Natural Policy Gradient Methods with Entropy Regularization0
Federated Natural Policy Gradient and Actor Critic Methods for Multi-task Reinforcement Learning0
Federated Reinforcement Learning with Constraint Heterogeneity0
Fill-and-Spill: Deep Reinforcement Learning Policy Gradient Methods for Reservoir Operation Decision and Control0
Fine-Grained AutoAugmentation for Multi-Label Classification0
Fingerprint Policy Optimisation for Robust Reinforcement Learning0
Focused Hierarchical RNNs for Conditional Sequence Processing0
f-Policy Gradients: A General Framework for Goal Conditioned RL using f-Divergences0
From Imitation to Refinement -- Residual RL for Precise Assembly0
GACEM: Generalized Autoregressive Cross Entropy Method for Multi-Modal Black Box Constraint Satisfaction0
Geometry and convergence of natural policy gradient methods0
Global Convergence Guarantees for Federated Policy Gradient Methods with Adversaries0
Global Convergence of Natural Policy Gradient with Hessian-aided Momentum Variance Reduction0
Global Convergence of Policy Gradient Methods for the Linear Quadratic Regulator0
Global Convergence of Policy Gradient Methods for Linearized Control Problems0
Show:102550
← PrevPage 4 of 8Next →

No leaderboard results yet.