| Policy-Aware Model Learning for Policy Gradient Methods | Feb 28, 2020 | modelModel-based Reinforcement Learning | CodeCode Available | 0 |
| Multilinear Tensor Low-Rank Approximation for Policy-Gradient Methods in Reinforcement Learning | Jan 8, 2025 | Policy Gradient MethodsReinforcement Learning (RL) | CodeCode Available | 0 |
| The Performance Impact of Combining Agent Factorization with Different Learning Algorithms for Multiagent Coordination | Sep 9, 2022 | ManagementPolicy Gradient Methods | CodeCode Available | 0 |
| Policy Gradient for Robust Markov Decision Processes | Oct 29, 2024 | Policy Gradient Methods | CodeCode Available | 0 |
| V-MPO: On-Policy Maximum a Posteriori Policy Optimization for Discrete and Continuous Control | Sep 26, 2019 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Near-Optimal Policy Identification in Robust Constrained Markov Decision Processes via Epigraph Form | Aug 29, 2024 | FormPolicy Gradient Methods | CodeCode Available | 0 |
| Improving Exploration in Evolution Strategies for Deep Reinforcement Learning via a Population of Novelty-Seeking Agents | Dec 18, 2017 | Deep Reinforcement LearningPolicy Gradient Methods | CodeCode Available | 0 |
| Convergence Guarantees of Model-free Policy Gradient Methods for LQR with Stochastic Data | Feb 27, 2025 | Policy Gradient Methods | CodeCode Available | 0 |
| Neural Logic Reinforcement Learning | Apr 24, 2019 | Deep Reinforcement LearningInductive logic programming | CodeCode Available | 0 |
| On the Convergence Theory of Debiased Model-Agnostic Meta-Reinforcement Learning | Feb 12, 2020 | Meta-LearningMeta Reinforcement Learning | CodeCode Available | 0 |