| SOAC: The Soft Option Actor-Critic Architecture | Jun 25, 2020 | MuJoCoTransfer Learning | —Unverified | 0 | 0 |
| Soft Actor-Critic Algorithm with Truly-satisfied Inequality Constraint | Mar 8, 2023 | MuJoCo | —Unverified | 0 | 0 |
| SoftDICE for Imitation Learning: Rethinking Off-policy Distribution Matching | Jun 6, 2021 | Imitation LearningMuJoCo | —Unverified | 0 | 0 |
| Soft policy optimization using dual-track advantage estimator | Sep 15, 2020 | MuJoCoReinforcement Learning (RL) | —Unverified | 0 | 0 |
| Solving Minimum-Cost Reach Avoid using Reinforcement Learning | Oct 29, 2024 | MuJoCoreinforcement-learning | —Unverified | 0 | 0 |
| SparseDice: Imitation Learning for Temporally Sparse Data via Regularization | Jun 13, 2021 | Imitation LearningMuJoCo | —Unverified | 0 | 0 |
| SPP-RL: State Planning Policy Reinforcement Learning | Sep 29, 2021 | MuJoCoreinforcement-learning | —Unverified | 0 | 0 |
| Stabilizing Off-Policy Reinforcement Learning with Conservative Policy Gradients | Sep 25, 2019 | Deep Reinforcement LearningMuJoCo | —Unverified | 0 | 0 |
| Multiagent Model-based Credit Assignment for Continuous Control | Dec 27, 2021 | continuous-controlContinuous Control | —Unverified | 0 | 0 |
| Stochastic Variance Reduction for Policy Gradient Estimation | Oct 17, 2017 | continuous-controlContinuous Control | —Unverified | 0 | 0 |