| Deep Reinforcement Learning Algorithm for Dynamic Pricing of Express Lanes with Multiple Access Locations | Sep 10, 2019 | Deep Reinforcement LearningPolicy Gradient Methods | CodeCode Available | 0 | 5 |
| Learning Zero-Sum Linear Quadratic Games with Improved Sample Complexity and Last-Iterate Convergence | Sep 8, 2023 | Multi-agent Reinforcement LearningPolicy Gradient Methods | CodeCode Available | 0 | 5 |
| Fast Efficient Hyperparameter Tuning for Policy Gradients | Feb 18, 2019 | Meta-LearningPolicy Gradient Methods | CodeCode Available | 0 | 5 |
| Neural Replicator Dynamics | Jun 1, 2019 | counterfactualDeep Reinforcement Learning | CodeCode Available | 0 | 5 |
| MDPGT: Momentum-based Decentralized Policy Gradient Tracking | Dec 6, 2021 | Multi-agent Reinforcement LearningPolicy Gradient Methods | CodeCode Available | 0 | 5 |
| Evaluating Rewards for Question Generation Models | Feb 28, 2019 | Machine TranslationPolicy Gradient Methods | CodeCode Available | 0 | 5 |
| Fast Efficient Hyperparameter Tuning for Policy Gradient Methods | Dec 1, 2019 | Policy Gradient Methods | CodeCode Available | 0 | 5 |
| Accelerated Policy Gradient: On the Convergence Rates of the Nesterov Momentum for Reinforcement Learning | Oct 18, 2023 | Policy Gradient Methodsreinforcement-learning | CodeCode Available | 0 | 5 |
| Dual Learning for Machine Translation | Nov 1, 2016 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| A Nonparametric Off-Policy Policy Gradient | Jan 8, 2020 | Density EstimationPolicy Gradient Methods | CodeCode Available | 0 | 5 |