| Route Optimization via Environment-Aware Deep Network and Reinforcement Learning | Nov 16, 2021 | Decision Makingreinforcement-learning | —Unverified | 0 | 0 |
| SAAC: Safe Reinforcement Learning as an Adversarial Game of Actor-Critics | Apr 20, 2022 | continuous-controlContinuous Control | —Unverified | 0 | 0 |
| SAC-GLAM: Improving Online RL for LLM agents with Soft Actor-Critic and Hindsight Relabeling | Oct 16, 2024 | Decision MakingReinforcement Learning (RL) | —Unverified | 0 | 0 |
| Safe Policy Improvement by Minimizing Robust Baseline Regret | Jul 13, 2016 | Decision MakingDecision Making Under Uncertainty | —Unverified | 0 | 0 |
| Safe POMDP Online Planning via Shielding | Sep 19, 2023 | Autonomous DrivingDecision Making | —Unverified | 0 | 0 |
| Safe Posterior Sampling for Constrained MDPs with Bounded Constraint Violation | Jan 27, 2023 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Safe Sequential Optimization for Switching Environments | Nov 3, 2023 | Bayesian OptimizationChange Point Detection | —Unverified | 0 | 0 |
| Safe Time-Varying Optimization based on Gaussian Processes with Spatio-Temporal Kernel | Sep 26, 2024 | Bayesian OptimizationChange Detection | —Unverified | 0 | 0 |
| Safety-Aware Algorithms for Adversarial Contextual Bandit | Aug 1, 2017 | Decision MakingMulti-Armed Bandits | —Unverified | 0 | 0 |
| Safety-aware Causal Representation for Trustworthy Offline Reinforcement Learning in Autonomous Driving | Oct 31, 2023 | Autonomous DrivingAutonomous Vehicles | —Unverified | 0 | 0 |
| Sample-efficient Adversarial Imitation Learning | Mar 14, 2023 | Decision MakingImitation Learning | —Unverified | 0 | 0 |
| Sample-Efficient Behavior Cloning Using General Domain Knowledge | Jan 27, 2025 | Car RacingFeature Engineering | —Unverified | 0 | 0 |
| Sample-efficient Safe Learning for Online Nonlinear Control with Control Barrier Functions | Jul 29, 2022 | Decision MakingReinforcement Learning (RL) | —Unverified | 0 | 0 |
| Sampling Through the Lens of Sequential Decision Making | Aug 17, 2022 | Decision MakingInformation Retrieval | —Unverified | 0 | 0 |
| SAPO-RL: Sequential Actuator Placement Optimization for Fuselage Assembly via Reinforcement Learning | Apr 24, 2025 | Decision MakingQ-Learning | —Unverified | 0 | 0 |
| Learning NP-Hard Multi-Agent Assignment Planning using GNN: Inference on a Random Graph and Provable Auction-Fitted Q-learning | May 29, 2019 | Combinatorial OptimizationDecision Making | —Unverified | 0 | 0 |
| Scalable Bayesian Inference in the Era of Deep Learning: From Gaussian Processes to Deep Neural Networks | Apr 29, 2024 | Bayesian InferenceGaussian Processes | —Unverified | 0 | 0 |
| Scalable First-Order Methods for Robust MDPs | May 11, 2020 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Scalable Thompson Sampling via Optimal Transport | Feb 19, 2019 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Scaling Multi-Armed Bandit Algorithms | Jul 25, 2019 | Multi-Armed BanditsSequential Decision Making | —Unverified | 0 | 0 |
| Scaling up ML-based Black-box Planning with Partial STRIPS Models | Jul 10, 2022 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Second-order Quantile Methods for Experts and Combinatorial Games | Feb 27, 2015 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Selective Perception: Optimizing State Descriptions with Reinforcement Learning for Language Model Actors | Jul 21, 2023 | Decision MakingLanguage Modeling | —Unverified | 0 | 0 |
| Selective Reviews of Bandit Problems in AI via a Statistical View | Dec 3, 2024 | Decision MakingDecision Making Under Uncertainty | —Unverified | 0 | 0 |
| Self-Evaluation for Job-Shop Scheduling | Feb 12, 2025 | Combinatorial OptimizationDecision Making | —Unverified | 0 | 0 |