| Reward Shaping Using Convolutional Neural Network | Oct 30, 2022 | MuJoCo | —Unverified | 0 | 0 |
| Risk Averse Value Expansion for Sample Efficient and Robust Policy Learning | Sep 25, 2019 | Model-based Reinforcement LearningMuJoCo | —Unverified | 0 | 0 |
| Risk-Sensitive Generative Adversarial Imitation Learning | Aug 13, 2018 | Imitation LearningMuJoCo | —Unverified | 0 | 0 |
| Surfer: Progressive Reasoning with World Models for Robotic Manipulation | Jun 20, 2023 | Decision MakingMuJoCo | —Unverified | 0 | 0 |
| Robust Adversarial Reinforcement Learning via Bounded Rationality Curricula | Nov 3, 2023 | MuJoCoreinforcement-learning | —Unverified | 0 | 0 |
| Robust Constrained Reinforcement Learning for Continuous Control with Model Misspecification | Oct 20, 2020 | continuous-controlContinuous Control | —Unverified | 0 | 0 |
| On the Benefits of Inducing Local Lipschitzness for Robust Generative Adversarial Imitation Learning | Jun 30, 2021 | Imitation LearningMuJoCo | —Unverified | 0 | 0 |
| Robust Imitation of Diverse Behaviors | Jul 10, 2017 | Imitation LearningMuJoCo | —Unverified | 0 | 0 |
| Robust Model Based Reinforcement Learning Using L_1 Adaptive Control | Mar 21, 2024 | Model-based Reinforcement LearningMuJoCo | —Unverified | 0 | 0 |
| Robust Reinforcement Learning for Continuous Control with Model Misspecification | Jun 18, 2019 | continuous-controlContinuous Control | —Unverified | 0 | 0 |
| Robust Reinforcement Learning through Efficient Adversarial Herding | Jun 12, 2023 | MuJoCoreinforcement-learning | —Unverified | 0 | 0 |
| rQdia: Regularizing Q-Value Distributions With Image Augmentation | Jun 26, 2025 | continuous-controlContinuous Control | —Unverified | 0 | 0 |
| Safe adaptation in multiagent competition | Mar 14, 2022 | MuJoCo | —Unverified | 0 | 0 |
| Safe Domain Randomization via Uncertainty-Aware Out-of-Distribution Detection and Policy Adaptation | Jul 8, 2025 | MuJoCoOut-of-Distribution Detection | —Unverified | 0 | 0 |
| Safe Policy Learning for Continuous Control | Sep 25, 2019 | continuous-controlContinuous Control | —Unverified | 0 | 0 |
| SALE-Based Offline Reinforcement Learning with Ensemble Q-Networks | Jan 7, 2025 | D4RLDiversity | —Unverified | 0 | 0 |
| Sample-efficient Adversarial Imitation Learning | Mar 14, 2023 | Decision MakingImitation Learning | —Unverified | 0 | 0 |
| Scalable Multi-agent Covering Option Discovery based on Kronecker Graphs | Jul 21, 2023 | MuJoCoRepresentation Learning | —Unverified | 0 | 0 |
| SEERL: Sample Efficient Ensemble Reinforcement Learning | Jan 15, 2020 | continuous-controlContinuous Control | —Unverified | 0 | 0 |
| Self-Supervised Continuous Control without Policy Gradient | Jan 1, 2021 | continuous-controlContinuous Control | —Unverified | 0 | 0 |
| Relevance-Guided Modeling of Object Dynamics for Reinforcement Learning | Mar 3, 2020 | Atari GamesDeep Reinforcement Learning | —Unverified | 0 | 0 |
| Taming Multi-Agent Reinforcement Learning with Estimator Variance Reduction | Sep 2, 2022 | MuJoCoMulti-agent Reinforcement Learning | —Unverified | 0 | 0 |
| SEREN: Knowing When to Explore and When to Exploit | May 30, 2022 | MuJoCoReinforcement Learning (RL) | —Unverified | 0 | 0 |
| Similarity-based Knowledge Transfer for Cross-Domain Reinforcement Learning | Dec 5, 2023 | MuJoCoreinforcement-learning | —Unverified | 0 | 0 |
| Simple Emergent Action Representations from Multi-Task Policy Training | Oct 18, 2022 | Deep Reinforcement LearningMuJoCo | —Unverified | 0 | 0 |
| Simultaneous Training of First- and Second-Order Optimizers in Population-Based Reinforcement Learning | Aug 27, 2024 | MuJoCoReinforcement Learning (RL) | —Unverified | 0 | 0 |
| Skill Transfer in Deep Reinforcement Learning under Morphological Heterogeneity | Aug 14, 2019 | DecoderDeep Reinforcement Learning | —Unverified | 0 | 0 |
| Small Dataset, Big Gains: Enhancing Reinforcement Learning by Offline Pre-Training with Model Based Augmentation | Dec 15, 2023 | Data AugmentationMuJoCo | —Unverified | 0 | 0 |
| Smooth Imitation Learning via Smooth Costs and Smooth Policies | Nov 3, 2021 | continuous-controlContinuous Control | —Unverified | 0 | 0 |
| SOAC: The Soft Option Actor-Critic Architecture | Jun 25, 2020 | MuJoCoTransfer Learning | —Unverified | 0 | 0 |
| Soft Actor-Critic Algorithm with Truly-satisfied Inequality Constraint | Mar 8, 2023 | MuJoCo | —Unverified | 0 | 0 |
| SoftDICE for Imitation Learning: Rethinking Off-policy Distribution Matching | Jun 6, 2021 | Imitation LearningMuJoCo | —Unverified | 0 | 0 |
| Soft policy optimization using dual-track advantage estimator | Sep 15, 2020 | MuJoCoReinforcement Learning (RL) | —Unverified | 0 | 0 |
| Solving Minimum-Cost Reach Avoid using Reinforcement Learning | Oct 29, 2024 | MuJoCoreinforcement-learning | —Unverified | 0 | 0 |
| SparseDice: Imitation Learning for Temporally Sparse Data via Regularization | Jun 13, 2021 | Imitation LearningMuJoCo | —Unverified | 0 | 0 |
| SPP-RL: State Planning Policy Reinforcement Learning | Sep 29, 2021 | MuJoCoreinforcement-learning | —Unverified | 0 | 0 |
| Stabilizing Off-Policy Reinforcement Learning with Conservative Policy Gradients | Sep 25, 2019 | Deep Reinforcement LearningMuJoCo | —Unverified | 0 | 0 |
| Multiagent Model-based Credit Assignment for Continuous Control | Dec 27, 2021 | continuous-controlContinuous Control | —Unverified | 0 | 0 |
| Stochastic Variance Reduction for Policy Gradient Estimation | Oct 17, 2017 | continuous-controlContinuous Control | —Unverified | 0 | 0 |
| Structural Estimation of Markov Decision Processes in High-Dimensional State Space with Finite-Time Guarantees | Oct 4, 2022 | Imitation LearningMuJoCo | —Unverified | 0 | 0 |
| Supported Trust Region Optimization for Offline Reinforcement Learning | Nov 15, 2023 | MuJoCoreinforcement-learning | —Unverified | 0 | 0 |
| Surrogate-Assisted Evolutionary Reinforcement Learning Based on Autoencoder and Hyperbolic Neural Network | May 26, 2025 | Evolutionary AlgorithmsMuJoCo | —Unverified | 0 | 0 |
| Symmetric Q-learning: Reducing Skewness of Bellman Error in Online Reinforcement Learning | Mar 12, 2024 | continuous-controlContinuous Control | —Unverified | 0 | 0 |
| Temporal Abstraction in Reinforcement Learning with Offline Data | Jul 21, 2024 | Hierarchical Reinforcement LearningMuJoCo | —Unverified | 0 | 0 |
| Temporal-adaptive Hierarchical Reinforcement Learning | Feb 6, 2020 | Atari GamesHierarchical Reinforcement Learning | —Unverified | 0 | 0 |
| MinMaxMin Q-learning | Feb 3, 2024 | MuJoCoQ-Learning | —Unverified | 0 | 0 |
| SQT -- std Q-target | Feb 3, 2024 | MuJoCoQ-Learning | —Unverified | 0 | 0 |
| Text-to-Decision Agent: Learning Generalist Policies from Natural Language Supervision | Apr 21, 2025 | MuJoCoZero-shot Generalization | —Unverified | 0 | 0 |
| The Courage to Stop: Overcoming Sunk Cost Fallacy in Deep Reinforcement Learning | Jun 16, 2025 | Deep Reinforcement LearningMuJoCo | —Unverified | 0 | 0 |
| The Exploration-Exploitation Dilemma Revisited: An Entropy Perspective | Aug 19, 2024 | MuJoCo | —Unverified | 0 | 0 |