| Exploring Offline Policy Evaluation for the Continuous-Armed Bandit Problem | Aug 21, 2019 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Stealing Deep Reinforcement Learning Models for Fun and Profit | Jun 9, 2020 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 | 0 |
| STEVE-Audio: Expanding the Goal Conditioning Modalities of Embodied Agents in Minecraft | Dec 1, 2024 | Decision MakingMinecraft | —Unverified | 0 | 0 |
| Stochastic Contextual Bandits with Known Reward Functions | Apr 30, 2016 | Decision MakingMulti-Armed Bandits | —Unverified | 0 | 0 |
| Stochastic Dynamic Power Dispatch with High Generalization and Few-Shot Adaption via Contextual Meta Graph Reinforcement Learning | Jan 19, 2024 | Decision Makingreinforcement-learning | —Unverified | 0 | 0 |
| Stochastic Planning and Lifted Inference | Jan 4, 2017 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Strategising template-guided needle placement for MR-targeted prostate biopsy | Jul 21, 2022 | AnatomyDecision Making | —Unverified | 0 | 0 |
| Streaming Adaptive Submodular Maximization | Aug 17, 2022 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Structure-Adaptive Sequential Testing for Online False Discovery Rate Control | Feb 28, 2020 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Structure and Reduction of MCTS for Explainable-AI | Aug 10, 2024 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |