| Enabling Efficient, Reliable Real-World Reinforcement Learning with Approximate Physics-Based Models | Jul 16, 2023 | Policy Gradient Methods | CodeCode Available | 0 |
| Deep Reinforcement Learning for Dialogue Generation | Jun 5, 2016 | ChatbotDeep Reinforcement Learning | CodeCode Available | 0 |
| Sample Efficient Policy Gradient Methods with Recursive Variance Reduction | Sep 18, 2019 | Policy Gradient Methodsreinforcement-learning | CodeCode Available | 0 |
| Fast Efficient Hyperparameter Tuning for Policy Gradients | Feb 18, 2019 | Meta-LearningPolicy Gradient Methods | CodeCode Available | 0 |
| Action-depedent Control Variates for Policy Optimization via Stein's Identity | Oct 30, 2017 | Policy Gradient Methodsreinforcement-learning | CodeCode Available | 0 |
| Remember and Forget for Experience Replay | Jul 16, 2018 | Deep Reinforcement LearningPolicy Gradient Methods | CodeCode Available | 0 |
| Reproducibility of Benchmarked Deep Reinforcement Learning Tasks for Continuous Control | Aug 10, 2017 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Where Did My Optimum Go?: An Empirical Analysis of Gradient Descent Optimization in Policy Gradient Methods | Oct 5, 2018 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Shapley Q-value: A Local Reward Approach to Solve Global Reward Games | Jul 11, 2019 | Multi-agent Reinforcement LearningPolicy Gradient Methods | CodeCode Available | 0 |
| Rethinking Action Spaces for Reinforcement Learning in End-to-end Dialog Agents with Latent Variable Models | Feb 23, 2019 | Decision MakingDialogue Generation | CodeCode Available | 0 |