| Momentum-Based Policy Gradient with Second-Order Information | May 17, 2022 | Policy Gradient Methods | —Unverified | 0 |
| Fill-and-Spill: Deep Reinforcement Learning Policy Gradient Methods for Reservoir Operation Decision and Control | Mar 7, 2024 | Deep Reinforcement LearningPolicy Gradient Methods | —Unverified | 0 |
| Fine-Grained AutoAugmentation for Multi-Label Classification | Jul 12, 2021 | ClassificationData Augmentation | —Unverified | 0 |
| Convergence of policy gradient methods for finite-horizon exploratory linear-quadratic control problems | Nov 1, 2022 | Policy Gradient Methods | —Unverified | 0 |
| Fingerprint Policy Optimisation for Robust Reinforcement Learning | May 27, 2018 | Bayesian OptimisationContinuous Control | —Unverified | 0 |
| Focused Hierarchical RNNs for Conditional Sequence Processing | Jun 12, 2018 | Open-Domain Question AnsweringPolicy Gradient Methods | —Unverified | 0 |
| A Policy Gradient Framework for Stochastic Optimal Control Problems with Global Convergence Guarantee | Feb 11, 2023 | Policy Gradient Methods | —Unverified | 0 |
| Ad Headline Generation using Self-Critical Masked Language Model | Jun 1, 2021 | Headline GenerationLanguage Modeling | —Unverified | 0 |
| Actor-Critic Policy Optimization in a Large-Scale Imperfect-Information Game | Sep 29, 2021 | counterfactualDeep Reinforcement Learning | —Unverified | 0 |
| Geometry and convergence of natural policy gradient methods | Nov 3, 2022 | Policy Gradient Methods | —Unverified | 0 |