| Evolution Strategies as an Alternate Learning method for Hierarchical Reinforcement Learning | Sep 29, 2021 | Hierarchical Reinforcement LearningPolicy Gradient Methods | —Unverified | 0 | 0 |
| CaLcs: Continuously Approximating Longest Common Subsequence for Sequence Level Optimization | Oct 1, 2018 | Abstractive Text SummarizationImage Captioning | —Unverified | 0 | 0 |
| BOTS: Batch Bayesian Optimization of Extended Thompson Sampling for Severely Episode-Limited RL Settings | Nov 30, 2024 | Bayesian OptimizationPolicy Gradient Methods | —Unverified | 0 | 0 |
| Adaptive Batch Size for Safe Policy Gradients | Dec 1, 2017 | Policy Gradient MethodsReinforcement Learning | —Unverified | 0 | 0 |
| Evolutionary Selective Imitation: Interpretable Agents by Imitation Learning Without a Demonstrator | Sep 17, 2020 | Imitation LearningOpenAI Gym | —Unverified | 0 | 0 |
| Evolutionary Policy Optimization | Apr 17, 2025 | Policy Gradient MethodsReinforcement Learning (RL) | —Unverified | 0 | 0 |
| Beyond Stationarity: Convergence Analysis of Stochastic Softmax Policy Gradient Methods | Oct 4, 2023 | Decision MakingPolicy Gradient Methods | —Unverified | 0 | 0 |
| Optimal Rates of Convergence for Entropy Regularization in Discounted Markov Decision Processes | Jun 6, 2024 | Policy Gradient Methods | —Unverified | 0 | 0 |
| Beyond Exact Gradients: Convergence of Stochastic Soft-Max Policy Gradient Methods with Entropy Regularization | Oct 19, 2021 | Policy Gradient MethodsReinforcement Learning (RL) | —Unverified | 0 | 0 |
| Analysis of On-policy Policy Gradient Methods under the Distribution Mismatch | Mar 28, 2025 | Policy Gradient Methods | —Unverified | 0 | 0 |