| V-MPO: On-Policy Maximum a Posteriori Policy Optimization for Discrete and Continuous Control | Sep 26, 2019 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Guided Adaptive Credit Assignment for Sample Efficient Policy Optimization | Sep 25, 2019 | Instruction FollowingPolicy Gradient Methods | —Unverified | 0 |
| Policy Tree Network | Sep 25, 2019 | Model-based Reinforcement LearningMuJoCo | —Unverified | 0 |
| AUGMENTED POLICY GRADIENT METHODS FOR EFFICIENT REINFORCEMENT LEARNING | Sep 25, 2019 | Policy Gradient Methodsreinforcement-learning | —Unverified | 0 |
| Sample Efficient Policy Gradient Methods with Recursive Variance Reduction | Sep 18, 2019 | Policy Gradient Methodsreinforcement-learning | CodeCode Available | 0 |
| DeepGait: Planning and Control of Quadrupedal Gaits using Deep Reinforcement Learning | Sep 18, 2019 | Deep Reinforcement LearningMotion Planning | —Unverified | 0 |
| Deep Reinforcement Learning Algorithm for Dynamic Pricing of Express Lanes with Multiple Access Locations | Sep 10, 2019 | Deep Reinforcement LearningPolicy Gradient Methods | CodeCode Available | 0 |
| Transfer Reward Learning for Policy Gradient-Based Text Generation | Sep 9, 2019 | Conditional Text GenerationImage Captioning | —Unverified | 0 |
| Multi Pseudo Q-learning Based Deterministic Policy Gradient for Tracking Control of Autonomous Underwater Vehicles | Sep 7, 2019 | Policy Gradient MethodsQ-Learning | —Unverified | 0 |
| Neural Policy Gradient Methods: Global Optimality and Rates of Convergence | Aug 29, 2019 | Policy Gradient Methods | —Unverified | 0 |