| Bayesian Residual Policy Optimization: Scalable Bayesian Reinforcement Learning with Clairvoyant Experts | Feb 7, 2020 | Decision MakingPolicy Gradient Methods | —Unverified | 0 |
| Neural MMO v1.3: A Massively Multiagent Game Environment for Training and Evaluating Neural Networks | Jan 31, 2020 | Policy Gradient MethodsReinforcement Learning | —Unverified | 0 |
| Deep Reinforcement Learning based Blind mmWave MIMO Beam Alignment | Jan 25, 2020 | Deep Reinforcement LearningPolicy Gradient Methods | —Unverified | 0 |
| A Nonparametric Off-Policy Policy Gradient | Jan 8, 2020 | Density EstimationPolicy Gradient Methods | CodeCode Available | 0 |
| Entropy Regularization with Discounted Future State Distribution in Policy Gradient Methods | Dec 11, 2019 | Policy Gradient Methods | —Unverified | 0 |
| Fast Efficient Hyperparameter Tuning for Policy Gradient Methods | Dec 1, 2019 | Policy Gradient Methods | CodeCode Available | 0 |
| Optimal Resource Allocation in Wireless Control Systems via Deep Policy Gradient | Oct 25, 2019 | Deep Reinforcement LearningPolicy Gradient Methods | —Unverified | 0 |
| Policy Optimization for H_2 Linear Control with H_ Robustness Guarantee: Implicit Regularization and Global Convergence | Oct 21, 2019 | Policy Gradient MethodsReinforcement Learning | —Unverified | 0 |
| All-Action Policy Gradient Methods: A Numerical Integration Approach | Oct 21, 2019 | Allcontinuous-control | —Unverified | 0 |
| Linear-Quadratic Mean-Field Reinforcement Learning: Convergence of Policy Gradient Methods | Oct 9, 2019 | Policy Gradient Methodsreinforcement-learning | —Unverified | 0 |