| Policy Optimization by Genetic Distillation | Nov 3, 2017 | Deep Reinforcement LearningImitation Learning | —Unverified | 0 |
| Action-depedent Control Variates for Policy Optimization via Stein's Identity | Oct 30, 2017 | Policy Gradient Methodsreinforcement-learning | CodeCode Available | 0 |
| Understanding Early Word Learning in Situated Artificial Agents | Oct 26, 2017 | Grounded language learningPolicy Gradient Methods | —Unverified | 0 |
| Accelerated Reinforcement Learning | Oct 23, 2017 | Policy Gradient Methodsreinforcement-learning | —Unverified | 0 |
| Stochastic Variance Reduction for Policy Gradient Estimation | Oct 17, 2017 | continuous-controlContinuous Control | —Unverified | 0 |
| Manifold Regularization for Kernelized LSTD | Oct 15, 2017 | Policy Gradient MethodsReinforcement Learning | —Unverified | 0 |
| Cold-Start Reinforcement Learning with Softmax Policy Gradient | Sep 27, 2017 | Image CaptioningPolicy Gradient Methods | CodeCode Available | 0 |
| Reproducibility of Benchmarked Deep Reinforcement Learning Tasks for Continuous Control | Aug 10, 2017 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Proximal Policy Optimization Algorithms | Jul 20, 2017 | Continuous ControlDota 2 | CodeCode Available | 2 |
| Policy Gradient Methods for Reinforcement Learning with Function Approximation and Action-Dependent Baselines | Jun 20, 2017 | Policy Gradient Methodsreinforcement-learning | —Unverified | 0 |