| STDPG: A Spatio-Temporal Deterministic Policy Gradient Agent for Dynamic Routing in SDN | Apr 21, 2020 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Stealing Deep Reinforcement Learning Models for Fun and Profit | Jun 9, 2020 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Stealthy and Efficient Adversarial Attacks against Deep Reinforcement Learning | May 14, 2020 | Adversarial AttackDeep Reinforcement Learning | —Unverified | 0 |
| Stealthy Imitation: Reward-guided Environment-free Policy Stealing | May 11, 2024 | Deep Reinforcement Learning | —Unverified | 0 |
| Stochastic Dynamic Network Utility Maximization with Application to Disaster Response | Jun 6, 2024 | Deep Reinforcement LearningDisaster Response | —Unverified | 0 |
| Stochastic Variance Reduction for Deep Q-learning | May 20, 2019 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Stone Soup Multi-Target Tracking Feature Extraction For Autonomous Search And Track In Deep Reinforcement Learning Environment | Mar 3, 2025 | Deep Reinforcement LearningManagement | —Unverified | 0 |
| Stop Regressing: Training Value Functions via Classification for Scalable Deep RL | Mar 6, 2024 | Atari GamesDeep Reinforcement Learning | —Unverified | 0 |
| Storage Efficient and Dynamic Flexible Runtime Channel Pruning via Deep Reinforcement Learning | Dec 1, 2020 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Strategically-timed State-Observation Attacks on Deep Reinforcement Learning Agents | Jun 18, 2021 | Adversarial Attackcontinuous-control | —Unverified | 0 |