| Soft Q-Learning with Mutual-Information Regularization | May 1, 2019 | Decision MakingQ-Learning | —Unverified | 0 | 0 |
| Solving Robust Markov Decision Processes: Generic, Reliable, Efficient | Dec 13, 2024 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Solving Transition-Independent Multi-agent MDPs with Sparse Interactions (Extended version) | Nov 29, 2015 | Decision MakingDecision Making Under Uncertainty | —Unverified | 0 | 0 |
| Spatial Privacy Pricing: The Interplay between Privacy, Utility and Price in Geo-Marketplaces | Aug 25, 2020 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| SS-MAIL: Self-Supervised Multi-Agent Imitation Learning | Oct 18, 2021 | Decision MakingImitation Learning | —Unverified | 0 | 0 |
| Stability-penalty-adaptive follow-the-regularized-leader: Sparsity, game-dependency, and best-of-both-worlds | May 26, 2023 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Stagewise Safe Bayesian Optimization with Gaussian Processes | Jun 20, 2018 | Bayesian OptimizationDecision Making | —Unverified | 0 | 0 |
| State2Explanation: Concept-Based Explanations to Benefit Agent Learning and User Understanding | Sep 21, 2023 | Decision MakingSelf-Learning | —Unverified | 0 | 0 |
| State of the Art of User Simulation approaches for conversational information retrieval | Jan 10, 2022 | Decision MakingInformation Retrieval | —Unverified | 0 | 0 |
| State-Separated SARSA: A Practical Sequential Decision-Making Algorithm with Recovering Rewards | Mar 18, 2024 | Decision MakingQ-Learning | —Unverified | 0 | 0 |
| Exploring Offline Policy Evaluation for the Continuous-Armed Bandit Problem | Aug 21, 2019 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Stealing Deep Reinforcement Learning Models for Fun and Profit | Jun 9, 2020 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 | 0 |
| STEVE-Audio: Expanding the Goal Conditioning Modalities of Embodied Agents in Minecraft | Dec 1, 2024 | Decision MakingMinecraft | —Unverified | 0 | 0 |
| Stochastic Contextual Bandits with Known Reward Functions | Apr 30, 2016 | Decision MakingMulti-Armed Bandits | —Unverified | 0 | 0 |
| Stochastic Dynamic Power Dispatch with High Generalization and Few-Shot Adaption via Contextual Meta Graph Reinforcement Learning | Jan 19, 2024 | Decision Makingreinforcement-learning | —Unverified | 0 | 0 |
| Stochastic Planning and Lifted Inference | Jan 4, 2017 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Strategising template-guided needle placement for MR-targeted prostate biopsy | Jul 21, 2022 | AnatomyDecision Making | —Unverified | 0 | 0 |
| Streaming Adaptive Submodular Maximization | Aug 17, 2022 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Structure-Adaptive Sequential Testing for Online False Discovery Rate Control | Feb 28, 2020 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Structure and Reduction of MCTS for Explainable-AI | Aug 10, 2024 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Structure Learning in Human Sequential Decision-Making | Dec 1, 2008 | Decision Makingreinforcement-learning | —Unverified | 0 | 0 |
| Subgoal-Based Explanations for Unreliable Intelligent Decision Support Systems | Jan 11, 2022 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Subgoal Discovery Using a Free Energy Paradigm and State Aggregations | Dec 21, 2024 | Reinforcement Learning (RL)Sequential Decision Making | —Unverified | 0 | 0 |
| Successfully Guiding Humans with Imperfect Instructions by Highlighting Potential Errors and Suggesting Corrections | Feb 26, 2024 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Supervised Fine-Tuning as Inverse Reinforcement Learning | Mar 18, 2024 | Decision MakingImitation Learning | —Unverified | 0 | 0 |