| Route Optimization via Environment-Aware Deep Network and Reinforcement Learning | Nov 16, 2021 | Decision Makingreinforcement-learning | —Unverified | 0 | 0 |
| SAAC: Safe Reinforcement Learning as an Adversarial Game of Actor-Critics | Apr 20, 2022 | continuous-controlContinuous Control | —Unverified | 0 | 0 |
| SAC-GLAM: Improving Online RL for LLM agents with Soft Actor-Critic and Hindsight Relabeling | Oct 16, 2024 | Decision MakingReinforcement Learning (RL) | —Unverified | 0 | 0 |
| Safe Policy Improvement by Minimizing Robust Baseline Regret | Jul 13, 2016 | Decision MakingDecision Making Under Uncertainty | —Unverified | 0 | 0 |
| Safe POMDP Online Planning via Shielding | Sep 19, 2023 | Autonomous DrivingDecision Making | —Unverified | 0 | 0 |
| Safe Posterior Sampling for Constrained MDPs with Bounded Constraint Violation | Jan 27, 2023 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Safe Sequential Optimization for Switching Environments | Nov 3, 2023 | Bayesian OptimizationChange Point Detection | —Unverified | 0 | 0 |
| Safe Time-Varying Optimization based on Gaussian Processes with Spatio-Temporal Kernel | Sep 26, 2024 | Bayesian OptimizationChange Detection | —Unverified | 0 | 0 |
| Safety-Aware Algorithms for Adversarial Contextual Bandit | Aug 1, 2017 | Decision MakingMulti-Armed Bandits | —Unverified | 0 | 0 |
| Safety-aware Causal Representation for Trustworthy Offline Reinforcement Learning in Autonomous Driving | Oct 31, 2023 | Autonomous DrivingAutonomous Vehicles | —Unverified | 0 | 0 |
| Sample-efficient Adversarial Imitation Learning | Mar 14, 2023 | Decision MakingImitation Learning | —Unverified | 0 | 0 |
| Sample-Efficient Behavior Cloning Using General Domain Knowledge | Jan 27, 2025 | Car RacingFeature Engineering | —Unverified | 0 | 0 |
| Sample-efficient Safe Learning for Online Nonlinear Control with Control Barrier Functions | Jul 29, 2022 | Decision MakingReinforcement Learning (RL) | —Unverified | 0 | 0 |
| Sampling Through the Lens of Sequential Decision Making | Aug 17, 2022 | Decision MakingInformation Retrieval | —Unverified | 0 | 0 |
| SAPO-RL: Sequential Actuator Placement Optimization for Fuselage Assembly via Reinforcement Learning | Apr 24, 2025 | Decision MakingQ-Learning | —Unverified | 0 | 0 |
| Learning NP-Hard Multi-Agent Assignment Planning using GNN: Inference on a Random Graph and Provable Auction-Fitted Q-learning | May 29, 2019 | Combinatorial OptimizationDecision Making | —Unverified | 0 | 0 |
| Scalable Bayesian Inference in the Era of Deep Learning: From Gaussian Processes to Deep Neural Networks | Apr 29, 2024 | Bayesian InferenceGaussian Processes | —Unverified | 0 | 0 |
| Scalable First-Order Methods for Robust MDPs | May 11, 2020 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Scalable Thompson Sampling via Optimal Transport | Feb 19, 2019 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Scaling Multi-Armed Bandit Algorithms | Jul 25, 2019 | Multi-Armed BanditsSequential Decision Making | —Unverified | 0 | 0 |
| Scaling up ML-based Black-box Planning with Partial STRIPS Models | Jul 10, 2022 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Second-order Quantile Methods for Experts and Combinatorial Games | Feb 27, 2015 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Selective Perception: Optimizing State Descriptions with Reinforcement Learning for Language Model Actors | Jul 21, 2023 | Decision MakingLanguage Modeling | —Unverified | 0 | 0 |
| Selective Reviews of Bandit Problems in AI via a Statistical View | Dec 3, 2024 | Decision MakingDecision Making Under Uncertainty | —Unverified | 0 | 0 |
| Self-Evaluation for Job-Shop Scheduling | Feb 12, 2025 | Combinatorial OptimizationDecision Making | —Unverified | 0 | 0 |
| Self-evolving Autoencoder Embedded Q-Network | Feb 18, 2024 | Decision MakingReinforcement Learning (RL) | —Unverified | 0 | 0 |
| Self-Generated In-Context Examples Improve LLM Agents for Sequential Decision-Making Tasks | May 1, 2025 | Decision MakingLarge Language Model | —Unverified | 0 | 0 |
| Self-Supervised Reinforcement Learning that Transfers using Random Features | May 26, 2023 | Decision MakingModel Predictive Control | —Unverified | 0 | 0 |
| Semi-Parametric Batched Global Multi-Armed Bandits with Covariates | Mar 1, 2025 | Decision MakingMulti-Armed Bandits | —Unverified | 0 | 0 |
| SENTINEL: Taming Uncertainty with Ensemble-based Distributional Reinforcement Learning | Feb 22, 2021 | Decision MakingDistributional Reinforcement Learning | —Unverified | 0 | 0 |
| SeqMvRL: A Sequential Fusion Framework for Multi-view Representation Learning | Jan 1, 2025 | Representation LearningSequential Decision Making | —Unverified | 0 | 0 |
| Sequential Batch Learning in Finite-Action Linear Contextual Bandits | Apr 14, 2020 | Decision MakingMulti-Armed Bandits | —Unverified | 0 | 0 |
| Sequential Bayesian experimental designs via reinforcement learning | Feb 14, 2022 | Bayesian InferenceDecision Making | —Unverified | 0 | 0 |
| Sequential Decision-Making for Inline Text Autocomplete | Mar 21, 2024 | Decision MakingLanguage Modelling | —Unverified | 0 | 0 |
| Sequential Decision Making on Unmatched Data using Bayesian Kernel Embeddings | Oct 25, 2022 | Bayesian OptimizationDecision Making | —Unverified | 0 | 0 |
| Sequential Fair Resource Allocation under a Markov Decision Process Framework | Jan 10, 2023 | Decision MakingFairness | —Unverified | 0 | 0 |
| Sequential Information Design: Learning to Persuade in the Dark | Sep 8, 2022 | Decision MakingPersuasiveness | —Unverified | 0 | 0 |
| Sequential Stochastic Optimization in Separable Learning Environments | Aug 21, 2021 | Decision MakingDecision Making Under Uncertainty | —Unverified | 0 | 0 |
| Sequential Treatment Effect Estimation with Unmeasured Confounders | May 14, 2025 | counterfactualSequential Decision Making | —Unverified | 0 | 0 |
| Servant of Many Masters: Shifting priorities in Pareto-optimal sequential decision-making | Oct 31, 2017 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Shaping Laser Pulses with Reinforcement Learning | Mar 1, 2025 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 | 0 |
| Sharp Thresholds of the Information Cascade Fragility Under a Mismatched Model | Jun 7, 2020 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Short-Long Policy Evaluation with Novel Actions | Jul 4, 2024 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Similarities between policy gradient methods (PGM) in Reinforcement learning (RL) and supervised learning (SL) | Apr 12, 2019 | Decision MakingPolicy Gradient Methods | —Unverified | 0 | 0 |
| Simulating Network Paths with Recurrent Buffering Units | Feb 23, 2022 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Single and Multi-Agent Deep Reinforcement Learning for AI-Enabled Wireless Networks: A Tutorial | Nov 6, 2020 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 | 0 |
| Situated Language Learning via Interactive Narratives | Mar 18, 2021 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Sliding-Window Thompson Sampling for Non-Stationary Settings | Sep 8, 2024 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| SMART: Self-supervised Multi-task pretrAining with contRol Transformers | Jan 24, 2023 | Decision MakingImitation Learning | —Unverified | 0 | 0 |
| Socially-Optimal Mechanism Design for Incentivized Online Learning | Dec 29, 2021 | Decision MakingEdge-computing | —Unverified | 0 | 0 |