Supervised Fine Tuning on Curated Data is Reinforcement Learning (and can be improved) Jul 17, 2025 continuous-control Continuous Control
— Unverified 0Supervised Pretraining Can Learn In-Context Reinforcement Learning Jun 26, 2023 Decision Making In-Context Learning
— Unverified 0Supervised Reinforcement Learning with Recurrent Neural Network for Dynamic Treatment Recommendation Jul 4, 2018 Recommendation Systems reinforcement-learning
— Unverified 0SupervisorBot: NLP-Annotated Real-Time Recommendations of Psychotherapy Treatment Strategies with Deep Reinforcement Learning Aug 27, 2022 Deep Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0Supplementing Gradient-Based Reinforcement Learning with Simple Evolutionary Ideas May 10, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0SURF: Semantic-level Unsupervised Reward Function for Machine Translation Jul 1, 2022 Diversity Machine Translation
— Unverified 0SURF: Semi-supervised Reward Learning with Data Augmentation for Feedback-efficient Preference-based Reinforcement Learning Mar 18, 2022 Data Augmentation Reinforcement Learning (RL)
— Unverified 0Surprise-Based Intrinsic Motivation for Deep Reinforcement Learning Mar 6, 2017 continuous-control Continuous Control
— Unverified 0SURREAL-System: Fully-Integrated Stack for Distributed Deep Reinforcement Learning Sep 27, 2019 CPU Deep Reinforcement Learning
— Unverified 0Surrogate-Assisted Evolutionary Reinforcement Learning Based on Autoencoder and Hyperbolic Neural Network May 26, 2025 Evolutionary Algorithms MuJoCo
— Unverified 0Surrogate Models for Enhancing the Efficiency of Neuroevolution in Reinforcement Learning Jul 22, 2019 Evolutionary Algorithms reinforcement-learning
— Unverified 0Survey of Deep Reinforcement Learning for Motion Planning of Autonomous Vehicles Jan 30, 2020 Autonomous Driving Autonomous Vehicles
— Unverified 0Survey of Recent Multi-Agent Reinforcement Learning Algorithms Utilizing Centralized Training Jul 29, 2021 Multi-agent Reinforcement Learning reinforcement-learning
— Unverified 0Survey on Fair Reinforcement Learning: Theory and Practice May 20, 2022 Articles Decision Making
— Unverified 0Survey on Large Language Model-Enhanced Reinforcement Learning: Concept, Taxonomy, and Methods Mar 30, 2024 Autonomous Driving Language Modeling
— Unverified 0Survey on Multi-Agent Q-Learning frameworks for resource management in wireless sensor network May 5, 2021 Management Q-Learning
— Unverified 0Survey on reinforcement learning for language processing Apr 12, 2021 reinforcement-learning Reinforcement Learning
— Unverified 0Survey on Strategic Mining in Blockchain: A Reinforcement Learning Approach Feb 24, 2025 reinforcement-learning Reinforcement Learning
— Unverified 0Surveys without Questions: A Reinforcement Learning Approach Jun 11, 2020 reinforcement-learning Reinforcement Learning
— Unverified 0Survival Analysis on Structured Data using Deep Reinforcement Learning May 28, 2022 Deep Learning Deep Reinforcement Learning
— Unverified 0Survival Instinct in Offline Reinforcement Learning Jun 5, 2023 Offline RL reinforcement-learning
— Unverified 0Survival of the Fittest: Evolutionary Adaptation of Policies for Environmental Shifts Oct 22, 2024 Reinforcement Learning (RL)
— Unverified 0SVDE: Scalable Value-Decomposition Exploration for Cooperative Multi-Agent Reinforcement Learning Mar 16, 2023 Multi-agent Reinforcement Learning reinforcement-learning
— Unverified 0SVQN: Sequential Variational Soft Q-Learning Networks Jan 1, 2020 Decision Making Q-Learning
— Unverified 0Swarm Behavior Cloning Dec 10, 2024 Decision Making Imitation Learning
— Unverified 0Learning from Imperfect Demonstrations with Self-Supervision for Robotic Manipulation Jan 17, 2024 Imitation Learning Reinforcement Learning (RL)
— Unverified 0SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution Feb 25, 2025 Math Reinforcement Learning (RL)
— Unverified 0Switchable Lightweight Anti-symmetric Processing (SLAP) with CNN Outspeeds Data Augmentation by Smaller Sample -- Application in Gomoku Reinforcement Learning Jan 11, 2023 Data Augmentation reinforcement-learning
— Unverified 0Switching Isotropic and Directional Exploration with Parameter Space Noise in Deep Reinforcement Learning Sep 18, 2018 Deep Reinforcement Learning OpenAI Gym
— Unverified 0Switching Linear Dynamics for Variational Bayes Filtering May 29, 2019 Bayesian Inference Model-based Reinforcement Learning
— Unverified 0Switching the Loss Reduces the Cost in Batch Reinforcement Learning Mar 8, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0SwitchMT: An Adaptive Context Switching Methodology for Scalable Multi-Task Learning in Intelligent Autonomous Agents Apr 18, 2025 Atari Games Multi-Task Learning
— Unverified 0Switch Trajectory Transformer with Distributional Value Approximation for Multi-Task Reinforcement Learning Mar 14, 2022 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Symbol-Based Over-the-Air Digital Predistortion Using Reinforcement Learning Nov 23, 2021 reinforcement-learning Reinforcement Learning
— Unverified 0Symbolic Explanation of Affinity-Based Reinforcement Learning Agents with Markov Models Aug 26, 2022 Management reinforcement-learning
— Unverified 0Symbolic Regression Methods for Reinforcement Learning Mar 22, 2019 Decision Making Friction
— Unverified 0Symbolic Reinforcement Learning for Safe RAN Control Mar 11, 2021 reinforcement-learning Reinforcement Learning
— Unverified 0Symmetry-aware Neural Architecture for Embodied Visual Navigation Dec 17, 2021 Deep Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0Symmetry-Aware Neural Architecture for Embodied Visual Exploration Jan 1, 2022 Deep Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0Symmetry Detection in Trajectory Data for More Meaningful Reinforcement Learning Representations Nov 29, 2022 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Symmetry Learning for Function Approximation in Reinforcement Learning Jun 9, 2017 reinforcement-learning Reinforcement Learning
— Unverified 0Symmetry reduction for deep reinforcement learning active control of chaotic spatiotemporal dynamics Apr 9, 2021 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Synchronous vs Asynchronous Reinforcement Learning in a Real World Robot Mar 17, 2025 Decision Making Reinforcement Learning (RL)
— Unverified 0Synergistic Formulaic Alpha Generation for Quantitative Trading based on Reinforcement Learning Jan 5, 2024 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Synergizing AI and Digital Twins for Next-Generation Network Optimization, Forecasting, and Security Mar 8, 2025 Federated Learning Reinforcement Learning (RL)
— Unverified 0Synthesizing Chemical Plant Operation Procedures using Knowledge, Dynamic Simulation and Deep Reinforcement Learning Mar 6, 2019 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Synthesizing Programmatic Policies that Inductively Generalize May 1, 2020 Deep Reinforcement Learning Imitation Learning
— Unverified 0Synthesizing Safe Policies under Probabilistic Constraints with Reinforcement Learning and Bayesian Model Checking May 8, 2020 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Synthesizing world models for bilevel planning Mar 26, 2025 Large Language Model Program Synthesis
— Unverified 0Synthetic Acute Hypotension and Sepsis Datasets Based on MIMIC-III and Published as Part of the Health Gym Project Dec 7, 2021 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0