| SF-DQN: Provable Knowledge Transfer using Successor Feature for Deep Reinforcement Learning | May 24, 2024 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 | 0 |
| SHIRE: Enhancing Sample Efficiency using Human Intuition in REinforcement Learning | Sep 16, 2024 | Deep Reinforcement LearningOptical Flow Estimation | —Unverified | 0 | 0 |
| Should artificial agents ask for help in human-robot collaborative problem-solving? | May 25, 2020 | Q-Learning | —Unverified | 0 | 0 |
| Show Us the Way: Learning to Manage Dialog from Demonstrations | Apr 17, 2020 | dialog state trackingManagement | —Unverified | 0 | 0 |
| Simple Agent, Complex Environment: Efficient Reinforcement Learning with Agent States | Feb 10, 2021 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |
| Simultaneously Evolving Deep Reinforcement Learning Models using Multifactorial Optimization | Feb 25, 2020 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 | 0 |
| Simultaneously Updating All Persistence Values in Reinforcement Learning | Nov 21, 2022 | AllAtari Games | —Unverified | 0 | 0 |
| Single-Agent vs. Multi-Agent Techniques for Concurrent Reinforcement Learning of Negotiation Dialogue Policies | Jun 1, 2014 | Dialogue ManagementMulti-agent Reinforcement Learning | —Unverified | 0 | 0 |
| Data-Incremental Continual Offline Reinforcement Learning | Apr 19, 2024 | Continual LearningOffline RL | —Unverified | 0 | 0 |
| Single-Trajectory Distributionally Robust Reinforcement Learning | Jan 27, 2023 | Decision MakingQ-Learning | —Unverified | 0 | 0 |
| SlateFree: a Model-Free Decomposition for Reinforcement Learning with Slate Actions | Sep 5, 2022 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |
| Smart Home Energy Management: Sequence-to-Sequence Load Forecasting and Q-Learning | Sep 25, 2021 | energy managementLoad Forecasting | —Unverified | 0 | 0 |
| Smart Home Energy Management: VAE-GAN synthetic dataset generator and Q-learning | May 14, 2023 | energy managementGenerative Adversarial Network | —Unverified | 0 | 0 |
| Smart Sampling: Self-Attention and Bootstrapping for Improved Ensembled Q-Learning | May 14, 2024 | Q-Learning | —Unverified | 0 | 0 |
| SMAUG: A Sliding Multidimensional Task Window-Based MARL Framework for Adaptive Real-Time Subtask Recognition | Mar 4, 2024 | Hierarchical Reinforcement LearningMulti-agent Reinforcement Learning | —Unverified | 0 | 0 |
| Smoothed Action Value Functions for Learning Gaussian Policies | Mar 6, 2018 | continuous-controlContinuous Control | —Unverified | 0 | 0 |
| Smoothed Q-learning | Mar 15, 2023 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |
| Smooth Q-learning: Accelerate Convergence of Q-learning Using Similarity | Jun 2, 2021 | Q-Learning | —Unverified | 0 | 0 |
| Regularized Softmax Deep Multi-Agent Q-Learning | Mar 22, 2021 | Multi-agent Reinforcement LearningQ-Learning | —Unverified | 0 | 0 |
| Soft Q-Learning with Mutual-Information Regularization | May 1, 2019 | Decision MakingQ-Learning | —Unverified | 0 | 0 |
| Soft Q Network | Dec 20, 2019 | Q-Learning | —Unverified | 0 | 0 |
| Software-Level Accuracy Using Stochastic Computing With Charge-Trap-Flash Based Weight Matrix | Mar 9, 2020 | Q-LearningReinforcement Learning | —Unverified | 0 | 0 |
| SOLO: Search Online, Learn Offline for Combinatorial Optimization Problems | Apr 4, 2021 | Combinatorial OptimizationDecision Making | —Unverified | 0 | 0 |
| A Generalized Minimax Q-learning Algorithm for Two-Player Zero-Sum Stochastic Games | Jun 16, 2019 | Q-Learning | —Unverified | 0 | 0 |
| Solving Discounted Stochastic Two-Player Games with Near-Optimal Time and Sample Complexity | Aug 29, 2019 | Q-LearningReinforcement Learning | —Unverified | 0 | 0 |
| Solving optimal stopping problems with Deep Q-Learning | Jan 24, 2021 | Q-LearningReinforcement Learning (RL) | —Unverified | 0 | 0 |
| Solving the Model Unavailable MARE using Q-Learning Algorithm | Jul 18, 2024 | Q-Learning | —Unverified | 0 | 0 |
| Solving the single-track train scheduling problem via Deep Reinforcement Learning | Sep 1, 2020 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 | 0 |
| Reinforcement Learning With Sparse-Executing Actions via Sparsity Regularization | May 18, 2021 | Atari GamesAutonomous Driving | —Unverified | 0 | 0 |
| Toward Packet Routing with Fully-distributed Multi-agent Deep Reinforcement Learning | May 9, 2019 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 | 0 |
| Speedy Q-Learning | Dec 1, 2011 | Q-Learning | —Unverified | 0 | 0 |
| SPEQ: Stabilization Phases for Efficient Q-Learning in High Update-To-Data Ratio Reinforcement Learning | Jan 15, 2025 | Computational Efficiencycontinuous-control | —Unverified | 0 | 0 |
| Split Deep Q-Learning for Robust Object Singulation | Sep 17, 2019 | feature selectionObject | —Unverified | 0 | 0 |
| Algorithmic collusion under competitive design | Dec 5, 2023 | Q-Learning | —Unverified | 0 | 0 |
| SQLR: Short-Term Memory Q-Learning for Elastic Provisioning | Sep 12, 2019 | BlockingQ-Learning | —Unverified | 0 | 0 |
| Stability of Multi-Agent Learning: Convergence in Network Games with Many Players | Jul 26, 2023 | Q-Learning | —Unverified | 0 | 0 |
| Stability of Multi-Agent Learning in Competitive Networks: Delaying the Onset of Chaos | Dec 19, 2023 | Q-Learning | —Unverified | 0 | 0 |
| Stability of Q-Learning Through Design and Optimism | Jul 5, 2023 | Q-Learning | —Unverified | 0 | 0 |
| Stabilizing Q Learning Via Soft Mellowmax Operator | Dec 17, 2020 | Multi-agent Reinforcement LearningQ-Learning | —Unverified | 0 | 0 |
| Stabilizing Q-learning with Linear Architectures for Provably Efficient Learning | Jun 1, 2022 | Q-Learning | —Unverified | 0 | 0 |
| Stabilizing Transformer-Based Action Sequence Generation For Q-Learning | Oct 23, 2020 | Q-LearningReinforcement Learning (RL) | —Unverified | 0 | 0 |
| State-Augmentation Transformations for Risk-Sensitive Reinforcement Learning | Apr 16, 2018 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |
| State-Aware Proximal Pessimistic Algorithms for Offline Reinforcement Learning | Nov 28, 2022 | Offline RLQ-Learning | —Unverified | 0 | 0 |
| State Distribution-aware Sampling for Deep Q-learning | Apr 23, 2018 | Atari GamesOpenAI Gym | —Unverified | 0 | 0 |
| State Estimation Using Particle Filtering in Adaptive Machine Learning Methods: Integrating Q-Learning and NEAT Algorithms with Noisy Radar Measurements | Apr 10, 2025 | Q-LearningState Estimation | —Unverified | 0 | 0 |
| State-Separated SARSA: A Practical Sequential Decision-Making Algorithm with Recovering Rewards | Mar 18, 2024 | Decision MakingQ-Learning | —Unverified | 0 | 0 |
| Statistically Model Checking PCTL Specifications on Markov Decision Processes via Reinforcement Learning | Apr 1, 2020 | NegationQ-Learning | —Unverified | 0 | 0 |
| STMARL: A Spatio-Temporal Multi-Agent Reinforcement Learning Approach for Cooperative Traffic Light Control | Aug 28, 2019 | Graph Neural NetworkManagement | —Unverified | 0 | 0 |
| Stochastic Approximation for Risk-aware Markov Decision Processes | May 11, 2018 | Q-Learning | —Unverified | 0 | 0 |
| Stochastic Approximation with Delayed Updates: Finite-Time Rates under Markovian Sampling | Feb 19, 2024 | AvgMulti-agent Reinforcement Learning | —Unverified | 0 | 0 |