| Predicting Multiple Actions for Stochastic Continuous Control | Jan 1, 2018 | continuous-controlContinuous Control | —Unverified | 0 |
| Adversarial Policy Gradient for Alternating Markov Games | Jan 1, 2018 | Policy Gradient MethodsReinforcement Learning | —Unverified | 0 |
| Action-dependent Control Variates for Policy Optimization via Stein Identity | Jan 1, 2018 | Policy Gradient Methodsreinforcement-learning | —Unverified | 0 |
| Understanding Grounded Language Learning Agents | Jan 1, 2018 | Grounded language learningPolicy Gradient Methods | —Unverified | 0 |
| Improving Exploration in Evolution Strategies for Deep Reinforcement Learning via a Population of Novelty-Seeking Agents | Dec 18, 2017 | Deep Reinforcement LearningPolicy Gradient Methods | CodeCode Available | 0 |
| Bayesian Policy Gradients via Alpha Divergence Dropout Inference | Dec 6, 2017 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Adaptive Batch Size for Safe Policy Gradients | Dec 1, 2017 | Policy Gradient MethodsReinforcement Learning | —Unverified | 0 |
| Divide-and-Conquer Reinforcement Learning | Nov 27, 2017 | Deep Reinforcement LearningPolicy Gradient Methods | CodeCode Available | 0 |
| Run, skeleton, run: skeletal model in a physics-based simulation | Nov 18, 2017 | NavigatePolicy Gradient Methods | CodeCode Available | 0 |
| Hindsight policy gradients | Nov 16, 2017 | Policy Gradient Methodsreinforcement-learning | CodeCode Available | 0 |