| On the Convergence Theory of Debiased Model-Agnostic Meta-Reinforcement Learning | Feb 12, 2020 | Meta-LearningMeta Reinforcement Learning | CodeCode Available | 0 | 5 |
| Commodities Trading through Deep Policy Gradient Methods | Aug 10, 2023 | Algorithmic TradingDeep Reinforcement Learning | —Unverified | 0 | 0 |
| An Off-policy Policy Gradient Theorem Using Emphatic Weightings | Nov 22, 2018 | Policy Gradient MethodsReinforcement Learning | —Unverified | 0 | 0 |
| An Improved Analysis of (Variance-Reduced) Policy Gradient and Natural Policy Gradient Methods | Nov 15, 2022 | Policy Gradient Methods | —Unverified | 0 | 0 |
| Momentum-Based Policy Gradient with Second-Order Information | May 17, 2022 | Policy Gradient Methods | —Unverified | 0 | 0 |
| Factored Policy Gradients: Leveraging Structure for Efficient Learning in MOMDPs | Feb 20, 2021 | Policy Gradient Methods | —Unverified | 0 | 0 |
| CaLcs: Continuously Approximating Longest Common Subsequence for Sequence Level Optimization | Oct 1, 2018 | Abstractive Text SummarizationImage Captioning | —Unverified | 0 | 0 |
| BOTS: Batch Bayesian Optimization of Extended Thompson Sampling for Severely Episode-Limited RL Settings | Nov 30, 2024 | Bayesian OptimizationPolicy Gradient Methods | —Unverified | 0 | 0 |
| Adaptive Batch Size for Safe Policy Gradients | Dec 1, 2017 | Policy Gradient MethodsReinforcement Learning | —Unverified | 0 | 0 |
| Evolutionary Policy Optimization | Apr 17, 2025 | Policy Gradient MethodsReinforcement Learning (RL) | —Unverified | 0 | 0 |