| Policy Learning and Evaluation with Randomized Quasi-Monte Carlo | Feb 16, 2022 | continuous-controlContinuous Control | —Unverified | 0 |
| Independent Policy Gradient for Large-Scale Markov Potential Games: Sharper Rates, Function Approximation, and Game-Agnostic Convergence | Feb 8, 2022 | Multi-agent Reinforcement LearningPolicy Gradient Methods | —Unverified | 0 |
| PAGE-PG: A Simple and Loopless Variance-Reduced Policy Gradient Method with Probabilistic Gradient Estimation | Feb 1, 2022 | Policy Gradient Methods | —Unverified | 0 |
| Leveraging class abstraction for commonsense reinforcement learning via residual policy gradient methods | Jan 28, 2022 | Knowledge GraphsPolicy Gradient Methods | CodeCode Available | 0 |
| Homotopic Policy Mirror Descent: Policy Convergence, Implicit Regularization, and Improved Sample Complexity | Jan 24, 2022 | Policy Gradient Methods | —Unverified | 0 |
| Understanding the Effects of Second-Order Approximations in Natural Policy Gradient Reinforcement Learning | Jan 22, 2022 | Policy Gradient Methodsreinforcement-learning | CodeCode Available | 0 |
| On the Convergence Rates of Policy Gradient Methods | Jan 19, 2022 | Policy Gradient Methods | —Unverified | 0 |
| Reinforcement Learning based Sequential Batch-sampling for Bayesian Optimal Experimental Design | Dec 21, 2021 | Deep Reinforcement LearningExperimental Design | —Unverified | 0 |
| MDPGT: Momentum-based Decentralized Policy Gradient Tracking | Dec 6, 2021 | Multi-agent Reinforcement LearningPolicy Gradient Methods | CodeCode Available | 0 |
| Episodic Policy Gradient Training | Dec 3, 2021 | Policy Gradient MethodsScheduling | CodeCode Available | 1 |