| Asynchronous Actor-Critic for Multi-Agent Reinforcement Learning | Sep 20, 2022 | Decision MakingMulti-agent Reinforcement Learning | —Unverified | 0 |
| On the Optimization Landscape of Dynamic Output Feedback: A Case Study for Linear Quadratic Regulator | Sep 12, 2022 | Decision MakingPolicy Gradient Methods | —Unverified | 0 |
| The Performance Impact of Combining Agent Factorization with Different Learning Algorithms for Multiagent Coordination | Sep 9, 2022 | ManagementPolicy Gradient Methods | CodeCode Available | 0 |
| Natural Policy Gradients In Reinforcement Learning Explained | Sep 5, 2022 | Policy Gradient Methodsreinforcement-learning | —Unverified | 0 |
| Towards Global Optimality in Cooperative MARL with the Transformation And Distillation Framework | Jul 12, 2022 | Multi-agent Reinforcement LearningPolicy Gradient Methods | —Unverified | 0 |
| Convergence and Price of Anarchy Guarantees of the Softmax Policy Gradient in Markov Potential Games | Jun 15, 2022 | Policy Gradient Methods | —Unverified | 0 |
| Variance Reduction for Policy-Gradient Methods via Empirical Variance Minimization | Jun 14, 2022 | Policy Gradient MethodsReinforcement Learning (RL) | —Unverified | 0 |
| How are policy gradient methods affected by the limits of control? | Jun 14, 2022 | Policy Gradient Methods | —Unverified | 0 |
| Learning Dynamics and Generalization in Reinforcement Learning | Jun 5, 2022 | Policy Gradient Methodsreinforcement-learning | —Unverified | 0 |
| Stochastic Second-Order Methods Improve Best-Known Sample Complexity of SGD for Gradient-Dominated Function | May 25, 2022 | Policy Gradient MethodsReinforcement Learning (RL) | —Unverified | 0 |