Steering Your Diffusion Policy with Latent Space Reinforcement Learning Jun 18, 2025 reinforcement-learning Reinforcement Learning
— Unverified 0Stein Variational Goal Generation for adaptive Exploration in Multi-Goal Reinforcement Learning Jun 14, 2022 Multi-Goal Reinforcement Learning reinforcement-learning
— Unverified 0Stein Variational Policy Gradient Apr 7, 2017 Bayesian Inference continuous-control
— Unverified 0Stepping Out of the Shadows: Reinforcement Learning in Shadow Mode Oct 30, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0Step-wise Adaptive Integration of Supervised Fine-tuning and Reinforcement Learning for Task-Specific LLMs May 19, 2025 Mathematical Reasoning Reinforcement Learning (RL)
— Unverified 0Stigmergic Independent Reinforcement Learning for Multi-Agent Collaboration Nov 28, 2019 reinforcement-learning Reinforcement Learning
— Unverified 0Stochastically Dominant Distributional Reinforcement Learning May 17, 2019 Distributional Reinforcement Learning reinforcement-learning
— Unverified 0Stochastic Approximation of Gaussian Free Energy for Risk-Sensitive Reinforcement Learning May 21, 2021 Decision Making reinforcement-learning
— Unverified 0Stochastic Approximation with Markov Noise: Analysis and applications in reinforcement learning Apr 8, 2020 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Stochastic Constraint Programming as Reinforcement Learning Apr 24, 2017 reinforcement-learning Reinforcement Learning
— Unverified 0Stochastic convex optimization for provably efficient apprenticeship learning Dec 31, 2021 Imitation Learning reinforcement-learning
— Unverified 0Stochastic evolution in populations of ideas Sep 14, 2016 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Stochastic Gradient Descent with Dependent Data for Offline Reinforcement Learning Feb 6, 2022 Q-Learning reinforcement-learning
— Unverified 0Black-box Optimizer with Implicit Natural Gradient Oct 9, 2019 Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0Stochastic Intervention for Causal Inference via Reinforcement Learning May 28, 2021 Causal Inference Decision Making
— Unverified 0Stochastic Inverse Reinforcement Learning May 21, 2019 reinforcement-learning Reinforcement Learning
— Unverified 0Stochastic Inverse Reinforcement Learning Oct 23, 2020 reinforcement-learning Reinforcement Learning
— Unverified 0Stochastic Learning Approach to Binary Optimization for Optimal Design of Experiments Jan 15, 2021 Experimental Design Reinforcement Learning (RL)
— Unverified 0Stochastic Lipschitz Q-Learning Apr 24, 2019 Q-Learning Reinforcement Learning
— Unverified 0Stochastic Primal-Dual Methods and Sample Complexity of Reinforcement Learning Dec 8, 2016 reinforcement-learning Reinforcement Learning
— Unverified 0Stochastic Q-learning for Large Discrete Action Spaces May 16, 2024 Decision Making Q-Learning
— Unverified 0Stochastic Reinforcement Learning Feb 11, 2019 reinforcement-learning Reinforcement Learning
— Unverified 0Stochastic Second-Order Methods Improve Best-Known Sample Complexity of SGD for Gradient-Dominated Function May 25, 2022 Policy Gradient Methods Reinforcement Learning (RL)
— Unverified 0Stochastic Variance Reduction for Deep Q-learning May 20, 2019 Deep Reinforcement Learning Q-Learning
— Unverified 0Stochastic Variance Reduction for Policy Gradient Estimation Oct 17, 2017 continuous-control Continuous Control
— Unverified 0Stochastic Variance Reduction Methods for Policy Evaluation Feb 25, 2017 Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0Stock market microstructure inference via multi-agent reinforcement learning Sep 17, 2019 Multi-agent Reinforcement Learning reinforcement-learning
— Unverified 0Stock Trading Optimization through Model-based Reinforcement Learning with Resistance Support Relative Strength May 30, 2022 Decision Making Model-based Reinforcement Learning
— Unverified 0Model Based Reinforcement Learning with Non-Gaussian Environment Dynamics and its Application to Portfolio Optimization Jan 23, 2023 Algorithmic Trading Decision Making
— Unverified 0Stop Regressing: Training Value Functions via Classification for Scalable Deep RL Mar 6, 2024 Atari Games Deep Reinforcement Learning
— Unverified 0Storage Efficient and Dynamic Flexible Runtime Channel Pruning via Deep Reinforcement Learning Dec 1, 2020 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Story Shaping: Teaching Agents Human-like Behavior with Stories Jan 24, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0Straight to the point: reinforcement learning for user guidance in ultrasound Mar 2, 2019 Anatomy Diagnostic
— Unverified 0Strategically Linked Decisions in Long-Term Planning and Reinforcement Learning May 22, 2025 Reinforcement Learning (RL)
— Unverified 0Strategically-timed State-Observation Attacks on Deep Reinforcement Learning Agents Jun 18, 2021 Adversarial Attack continuous-control
— Unverified 0Strategic bidding in freight transport using deep reinforcement learning Feb 18, 2021 Deep Reinforcement Learning Fairness
— Unverified 0Strategic Maneuver and Disruption with Reinforcement Learning Approaches for Multi-Agent Coordination Mar 17, 2022 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Optimizing Trading Strategies in Quantitative Markets using Multi-Agent Reinforcement Learning Mar 15, 2023 Decision Making Multi-agent Reinforcement Learning
— Unverified 0Strategies for Using Proximal Policy Optimization in Mobile Puzzle Games Jul 3, 2020 Reinforcement Learning (RL)
— Unverified 0Strategising template-guided needle placement for MR-targeted prostate biopsy Jul 21, 2022 Anatomy Decision Making
— Unverified 0Strategy and Benchmark for Converting Deep Q-Networks to Event-Driven Spiking Neural Networks Sep 30, 2020 Atari Games Deep Reinforcement Learning
— Unverified 0Stratified Experience Replay: Correcting Multiplicity Bias in Off-Policy Reinforcement Learning Feb 22, 2021 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Stratified Expert Cloning with Adaptive Selection for User Retention in Large-Scale Recommender Systems Apr 8, 2025 Imitation Learning Recommendation Systems
— Unverified 0Stratospheric Aerosol Injection as a Deep Reinforcement Learning Problem May 17, 2019 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Streaming Linear System Identification with Reverse Experience Replay Mar 10, 2021 Reinforcement Learning (RL) Time Series Analysis
— Unverified 0Streaming Traffic Flow Prediction Based on Continuous Reinforcement Learning Dec 24, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0StreamRL: Scalable, Heterogeneous, and Elastic RL for LLMs with Disaggregated Stream Generation Apr 22, 2025 Reinforcement Learning (RL) Scheduling
— Unverified 0Strict Subgoal Execution: Reliable Long-Horizon Planning in Hierarchical Reinforcement Learning Jun 26, 2025 Decision Making Hierarchical Reinforcement Learning
— Unverified 0S-TRIGGER: Continual State Representation Learning via Self-Triggered Generative Replay Feb 25, 2019 Change Detection Continual Learning
— Unverified 0Striving for Simplicity in Off-Policy Deep Reinforcement Learning Sep 25, 2019 Atari Games Deep Reinforcement Learning
— Unverified 0