| Solving Rubik's Cube Without Tricky Sampling | Nov 29, 2024 | Policy Gradient MethodsReinforcement Learning (RL) | —Unverified | 0 | 0 |
| Solving Zero-Sum Convex Markov Games | Jun 19, 2025 | Policy Gradient Methods | —Unverified | 0 | 0 |
| SPPD: Self-training with Process Preference Learning Using Dynamic Value Margin | Feb 19, 2025 | GPULogical Reasoning | —Unverified | 0 | 0 |
| Stabilizing Dynamical Systems via Policy Gradient Methods | Oct 13, 2021 | Policy Gradient Methods | —Unverified | 0 | 0 |
| Stabilizing Policy Gradients for Stochastic Differential Equations via Consistency with Perturbation Process | Mar 7, 2024 | Drug DesignPolicy Gradient Methods | —Unverified | 0 | 0 |
| StartNet: Online Detection of Action Start in Untrimmed Videos | Mar 23, 2019 | Action ClassificationPolicy Gradient Methods | —Unverified | 0 | 0 |
| Statistically Efficient Off-Policy Policy Gradients | Feb 10, 2020 | Policy Gradient MethodsReinforcement Learning | —Unverified | 0 | 0 |
| Stein Variational Policy Gradient | Apr 7, 2017 | Bayesian Inferencecontinuous-control | —Unverified | 0 | 0 |
| Stepsize Learning for Policy Gradient Methods in Contextual Markov Decision Processes | Jun 13, 2023 | Meta Reinforcement LearningPolicy Gradient Methods | —Unverified | 0 | 0 |
| Stochastic Dimension-reduced Second-order Methods for Policy Optimization | Jan 28, 2023 | Policy Gradient MethodsSecond-order methods | —Unverified | 0 | 0 |