| S2VG: Soft Stochastic Value Gradient method | Sep 25, 2019 | Model-based Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Policy Tree Network | Sep 25, 2019 | Model-based Reinforcement LearningMuJoCo | —Unverified | 0 |
| Model Imitation for Model-Based Reinforcement Learning | Sep 25, 2019 | modelModel-based Reinforcement Learning | —Unverified | 0 |
| Policy Optimization In the Face of Uncertainty | Sep 25, 2019 | continuous-controlContinuous Control | —Unverified | 0 |
| Improving Exploration of Deep Reinforcement Learning using Planning for Policy Search | Sep 25, 2019 | Deep Reinforcement LearningModel-based Reinforcement Learning | —Unverified | 0 |
| Risk Averse Value Expansion for Sample Efficient and Robust Policy Learning | Sep 25, 2019 | Model-based Reinforcement LearningMuJoCo | —Unverified | 0 |
| Counterfactual Regularization for Model-Based Reinforcement Learning | Sep 25, 2019 | counterfactualmodel | —Unverified | 0 |
| Learning by shaking: Computing policy gradients by physical forward-propagation | Sep 25, 2019 | Model-based Reinforcement LearningReinforcement Learning (RL) | —Unverified | 0 |
| Revisit Policy Optimization in Matrix Form | Sep 19, 2019 | FormModel-based Reinforcement Learning | —Unverified | 0 |
| Policy Prediction Network: Model-Free Behavior Policy with Model-Based Learning in Continuous Action Space | Sep 15, 2019 | continuous-controlContinuous Control | —Unverified | 0 |