| Confidence-Controlled Exploration: Efficient Sparse-Reward Policy Learning for Robot Navigation | Jun 9, 2023 | Policy Gradient Methodsreinforcement-learning | —Unverified | 0 | 0 |
| Entropy annealing for policy mirror descent in continuous time and space | May 30, 2024 | Policy Gradient Methods | —Unverified | 0 | 0 |
| Entropic Risk Measure in Policy Search | Jun 21, 2019 | Policy Gradient Methods | —Unverified | 0 | 0 |
| Enhanced DACER Algorithm with High Diffusion Efficiency | May 29, 2025 | DenoisingImitation Learning | —Unverified | 0 | 0 |
| End-to-End Neuro-Symbolic Architecture for Image-to-Image Reasoning Tasks | Jun 6, 2021 | Image ReconstructionPolicy Gradient Methods | —Unverified | 0 | 0 |
| Batch Reinforcement Learning with a Nonparametric Off-Policy Policy Gradient | Oct 27, 2020 | Policy Gradient Methodsreinforcement-learning | —Unverified | 0 | 0 |
| Almost sure convergence rates of stochastic gradient methods under gradient domination | May 22, 2024 | Policy Gradient Methodsreinforcement-learning | —Unverified | 0 | 0 |
| Elementary Analysis of Policy Gradient Methods | Apr 4, 2024 | Policy Gradient Methods | —Unverified | 0 | 0 |
| Batch Policy Gradient Methods for Improving Neural Conversation Models | Feb 10, 2017 | ChatbotPolicy Gradient Methods | —Unverified | 0 | 0 |
| Efficient Wasserstein and Sinkhorn Policy Optimization | Sep 29, 2021 | Policy Gradient MethodsReinforcement Learning (RL) | —Unverified | 0 | 0 |