| Jointly Learning Environments and Control Policies with Projected Stochastic Gradient Ascent | Jun 2, 2020 | Deep Reinforcement LearningPolicy Gradient Methods | CodeCode Available | 0 |
| On the Global Convergence Rates of Softmax Policy Gradient Methods | May 13, 2020 | Open-Ended Question AnsweringPolicy Gradient Methods | —Unverified | 0 |
| Improving Sample Efficiency and Multi-Agent Communication in RL-based Train Rescheduling | Apr 28, 2020 | Policy Gradient Methodsreinforcement-learning | —Unverified | 0 |
| Safe Reinforcement Learning via Projection on a Safe Set: How to Achieve Optimality? | Apr 2, 2020 | Policy Gradient MethodsQ-Learning | —Unverified | 0 |
| Exchangeable Input Representations for Reinforcement Learning | Mar 19, 2020 | Deep Reinforcement LearningPolicy Gradient Methods | —Unverified | 0 |
| Stochastic Recursive Momentum for Policy Gradient Methods | Mar 9, 2020 | Policy Gradient Methods | —Unverified | 0 |
| Policy-Aware Model Learning for Policy Gradient Methods | Feb 28, 2020 | modelModel-based Reinforcement Learning | CodeCode Available | 0 |
| GACEM: Generalized Autoregressive Cross Entropy Method for Multi-Modal Black Box Constraint Satisfaction | Feb 17, 2020 | Policy Gradient MethodsReinforcement Learning | —Unverified | 0 |
| On the Convergence Theory of Debiased Model-Agnostic Meta-Reinforcement Learning | Feb 12, 2020 | Meta-LearningMeta Reinforcement Learning | CodeCode Available | 0 |
| Statistically Efficient Off-Policy Policy Gradients | Feb 10, 2020 | Policy Gradient MethodsReinforcement Learning | —Unverified | 0 |
| Bayesian Residual Policy Optimization: Scalable Bayesian Reinforcement Learning with Clairvoyant Experts | Feb 7, 2020 | Decision MakingPolicy Gradient Methods | —Unverified | 0 |
| Neural MMO v1.3: A Massively Multiagent Game Environment for Training and Evaluating Neural Networks | Jan 31, 2020 | Policy Gradient MethodsReinforcement Learning | —Unverified | 0 |
| Deep Reinforcement Learning based Blind mmWave MIMO Beam Alignment | Jan 25, 2020 | Deep Reinforcement LearningPolicy Gradient Methods | —Unverified | 0 |
| A Nonparametric Off-Policy Policy Gradient | Jan 8, 2020 | Density EstimationPolicy Gradient Methods | CodeCode Available | 0 |
| Entropy Regularization with Discounted Future State Distribution in Policy Gradient Methods | Dec 11, 2019 | Policy Gradient Methods | —Unverified | 0 |
| Fast Efficient Hyperparameter Tuning for Policy Gradient Methods | Dec 1, 2019 | Policy Gradient Methods | CodeCode Available | 0 |
| Optimal Resource Allocation in Wireless Control Systems via Deep Policy Gradient | Oct 25, 2019 | Deep Reinforcement LearningPolicy Gradient Methods | —Unverified | 0 |
| Policy Optimization for H_2 Linear Control with H_ Robustness Guarantee: Implicit Regularization and Global Convergence | Oct 21, 2019 | Policy Gradient MethodsReinforcement Learning | —Unverified | 0 |
| All-Action Policy Gradient Methods: A Numerical Integration Approach | Oct 21, 2019 | Allcontinuous-control | —Unverified | 0 |
| Linear-Quadratic Mean-Field Reinforcement Learning: Convergence of Policy Gradient Methods | Oct 9, 2019 | Policy Gradient Methodsreinforcement-learning | —Unverified | 0 |
| V-MPO: On-Policy Maximum a Posteriori Policy Optimization for Discrete and Continuous Control | Sep 26, 2019 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Guided Adaptive Credit Assignment for Sample Efficient Policy Optimization | Sep 25, 2019 | Instruction FollowingPolicy Gradient Methods | —Unverified | 0 |
| Policy Tree Network | Sep 25, 2019 | Model-based Reinforcement LearningMuJoCo | —Unverified | 0 |
| AUGMENTED POLICY GRADIENT METHODS FOR EFFICIENT REINFORCEMENT LEARNING | Sep 25, 2019 | Policy Gradient Methodsreinforcement-learning | —Unverified | 0 |
| Sample Efficient Policy Gradient Methods with Recursive Variance Reduction | Sep 18, 2019 | Policy Gradient Methodsreinforcement-learning | CodeCode Available | 0 |