| On the Convergence of Discounted Policy Gradient Methods | Dec 28, 2022 | Policy Gradient Methodsreinforcement-learning | —Unverified | 0 |
| Policy Gradient in Robust MDPs with Global Convergence Guarantee | Dec 20, 2022 | Policy Gradient Methods | CodeCode Available | 0 |
| An Improved Analysis of (Variance-Reduced) Policy Gradient and Natural Policy Gradient Methods | Nov 15, 2022 | Policy Gradient Methods | —Unverified | 0 |
| Geometry and convergence of natural policy gradient methods | Nov 3, 2022 | Policy Gradient Methods | —Unverified | 0 |
| Convergence of policy gradient methods for finite-horizon exploratory linear-quadratic control problems | Nov 1, 2022 | Policy Gradient Methods | —Unverified | 0 |
| Symmetric (Optimistic) Natural Policy Gradient for Multi-agent Learning with Parameter Convergence | Oct 23, 2022 | Policy Gradient Methods | —Unverified | 0 |
| Policy Gradient Methods for Designing Dynamic Output Feedback Controllers | Oct 18, 2022 | Policy Gradient Methods | —Unverified | 0 |
| On the convergence of policy gradient methods to Nash equilibria in general stochastic games | Oct 17, 2022 | Policy Gradient Methods | —Unverified | 0 |
| Linear Convergence of Natural Policy Gradient Methods with Log-Linear Policies | Oct 4, 2022 | Policy Gradient Methods | —Unverified | 0 |
| SoftTreeMax: Policy Gradient with Tree Search | Sep 28, 2022 | Policy Gradient Methods | —Unverified | 0 |
| Asynchronous Actor-Critic for Multi-Agent Reinforcement Learning | Sep 20, 2022 | Decision MakingMulti-agent Reinforcement Learning | —Unverified | 0 |
| On the Optimization Landscape of Dynamic Output Feedback: A Case Study for Linear Quadratic Regulator | Sep 12, 2022 | Decision MakingPolicy Gradient Methods | —Unverified | 0 |
| The Performance Impact of Combining Agent Factorization with Different Learning Algorithms for Multiagent Coordination | Sep 9, 2022 | ManagementPolicy Gradient Methods | CodeCode Available | 0 |
| Natural Policy Gradients In Reinforcement Learning Explained | Sep 5, 2022 | Policy Gradient Methodsreinforcement-learning | —Unverified | 0 |
| Towards Global Optimality in Cooperative MARL with the Transformation And Distillation Framework | Jul 12, 2022 | Multi-agent Reinforcement LearningPolicy Gradient Methods | —Unverified | 0 |
| Convergence and Price of Anarchy Guarantees of the Softmax Policy Gradient in Markov Potential Games | Jun 15, 2022 | Policy Gradient Methods | —Unverified | 0 |
| How are policy gradient methods affected by the limits of control? | Jun 14, 2022 | Policy Gradient Methods | —Unverified | 0 |
| Variance Reduction for Policy-Gradient Methods via Empirical Variance Minimization | Jun 14, 2022 | Policy Gradient MethodsReinforcement Learning (RL) | —Unverified | 0 |
| Learning Dynamics and Generalization in Reinforcement Learning | Jun 5, 2022 | Policy Gradient Methodsreinforcement-learning | —Unverified | 0 |
| Stochastic Second-Order Methods Improve Best-Known Sample Complexity of SGD for Gradient-Dominated Function | May 25, 2022 | Policy Gradient MethodsReinforcement Learning (RL) | —Unverified | 0 |
| Momentum-Based Policy Gradient with Second-Order Information | May 17, 2022 | Policy Gradient Methods | —Unverified | 0 |
| Stochastic first-order methods for average-reward Markov decision processes | May 11, 2022 | Policy Gradient Methods | —Unverified | 0 |
| Learning to Constrain Policy Optimization with Virtual Trust Region | Apr 20, 2022 | Atari GamesPolicy Gradient Methods | —Unverified | 0 |
| Independent Natural Policy Gradient Methods for Potential Games: Finite-time Global Convergence with Entropy Regularization | Apr 12, 2022 | Autonomous VehiclesPolicy Gradient Methods | —Unverified | 0 |
| Synthesis of Stabilizing Recurrent Equilibrium Network Controllers | Mar 31, 2022 | Policy Gradient Methods | CodeCode Available | 0 |
| Asynchronous, Option-Based Multi-Agent Policy Gradient: A Conditional Reasoning Approach | Mar 29, 2022 | Hierarchical Reinforcement LearningMulti-agent Reinforcement Learning | —Unverified | 0 |
| Non-Parametric Stochastic Policy Gradient with Strategic Retreat for Non-Stationary Environment | Mar 24, 2022 | Policy Gradient Methods | —Unverified | 0 |
| Linear convergence of a policy gradient method for some finite horizon continuous time control problems | Mar 22, 2022 | Policy Gradient Methodsreinforcement-learning | —Unverified | 0 |
| Policy Learning and Evaluation with Randomized Quasi-Monte Carlo | Feb 16, 2022 | continuous-controlContinuous Control | —Unverified | 0 |
| Independent Policy Gradient for Large-Scale Markov Potential Games: Sharper Rates, Function Approximation, and Game-Agnostic Convergence | Feb 8, 2022 | Multi-agent Reinforcement LearningPolicy Gradient Methods | —Unverified | 0 |
| PAGE-PG: A Simple and Loopless Variance-Reduced Policy Gradient Method with Probabilistic Gradient Estimation | Feb 1, 2022 | Policy Gradient Methods | —Unverified | 0 |
| Leveraging class abstraction for commonsense reinforcement learning via residual policy gradient methods | Jan 28, 2022 | Knowledge GraphsPolicy Gradient Methods | CodeCode Available | 0 |
| Homotopic Policy Mirror Descent: Policy Convergence, Implicit Regularization, and Improved Sample Complexity | Jan 24, 2022 | Policy Gradient Methods | —Unverified | 0 |
| Understanding the Effects of Second-Order Approximations in Natural Policy Gradient Reinforcement Learning | Jan 22, 2022 | Policy Gradient Methodsreinforcement-learning | CodeCode Available | 0 |
| On the Convergence Rates of Policy Gradient Methods | Jan 19, 2022 | Policy Gradient Methods | —Unverified | 0 |
| Reinforcement Learning based Sequential Batch-sampling for Bayesian Optimal Experimental Design | Dec 21, 2021 | Deep Reinforcement LearningExperimental Design | —Unverified | 0 |
| MDPGT: Momentum-based Decentralized Policy Gradient Tracking | Dec 6, 2021 | Multi-agent Reinforcement LearningPolicy Gradient Methods | CodeCode Available | 0 |
| Global Convergence Using Policy Gradient Methods for Model-free Markovian Jump Linear Quadratic Control | Nov 30, 2021 | Policy Gradient Methods | —Unverified | 0 |
| Time Discretization-Invariant Safe Action Repetition for Policy Gradient Methods | Nov 6, 2021 | MuJoCoPolicy Gradient Methods | CodeCode Available | 0 |
| Global Optimality and Finite Sample Analysis of Softmax Off-Policy Actor Critic under State Distribution Mismatch | Nov 4, 2021 | Policy Gradient Methods | CodeCode Available | 0 |
| Proximal Policy Optimization with Continuous Bounded Action Space via the Beta Distribution | Nov 3, 2021 | continuous-controlContinuous Control | —Unverified | 0 |
| Convergence and Optimality of Policy Gradient Methods in Weakly Smooth Settings | Oct 30, 2021 | Policy Gradient Methodsreinforcement-learning | —Unverified | 0 |
| Beyond Exact Gradients: Convergence of Stochastic Soft-Max Policy Gradient Methods with Entropy Regularization | Oct 19, 2021 | Policy Gradient MethodsReinforcement Learning (RL) | —Unverified | 0 |
| Local Advantage Actor-Critic for Robust Multi-Agent Deep Reinforcement Learning | Oct 16, 2021 | Deep Reinforcement LearningMulti-agent Reinforcement Learning | —Unverified | 0 |
| Stabilizing Dynamical Systems via Policy Gradient Methods | Oct 13, 2021 | Policy Gradient Methods | —Unverified | 0 |
| Programmatic Reinforcement Learning without Oracles | Sep 29, 2021 | Bilevel OptimizationDeep Reinforcement Learning | —Unverified | 0 |
| Variance Reduced Domain Randomization for Policy Gradient | Sep 29, 2021 | Deep Reinforcement LearningPolicy Gradient Methods | —Unverified | 0 |
| Efficient Wasserstein and Sinkhorn Policy Optimization | Sep 29, 2021 | Policy Gradient MethodsReinforcement Learning (RL) | —Unverified | 0 |
| Sample-efficient actor-critic algorithms with an etiquette for zero-sum Markov games | Sep 29, 2021 | Policy Gradient Methods | —Unverified | 0 |
| Actor-Critic Policy Optimization in a Large-Scale Imperfect-Information Game | Sep 29, 2021 | counterfactualDeep Reinforcement Learning | —Unverified | 0 |