SoftCTRL: Soft conservative KL-control of Transformer Reinforcement Learning for Autonomous Driving Oct 30, 2024 Autonomous Driving Imitation Learning
— Unverified 0Soft Decomposed Policy-Critic: Bridging the Gap for Effective Continuous Control with Discrete RL Aug 20, 2023 Atari Games continuous-control
— Unverified 0Soft Expert Reward Learning for Vision-and-Language Navigation Jul 21, 2020 Reinforcement Learning (RL) Vision and Language Navigation
— Unverified 0Regularized Softmax Deep Multi-Agent Q-Learning Mar 22, 2021 Multi-agent Reinforcement Learning Q-Learning
— Unverified 0Soft Policy Gradient Method for Maximum Entropy Deep Reinforcement Learning Sep 7, 2019 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Soft policy optimization using dual-track advantage estimator Sep 15, 2020 MuJoCo Reinforcement Learning (RL)
— Unverified 0Soft Q-Learning with Mutual-Information Regularization May 1, 2019 Decision Making Q-Learning
— Unverified 0Soft-Robust Actor-Critic Policy-Gradient Mar 11, 2018 reinforcement-learning Reinforcement Learning
— Unverified 0Soft-Robust Algorithms for Batch Reinforcement Learning Nov 30, 2020 Decision Making reinforcement-learning
— Unverified 0SoK: Adversarial Machine Learning Attacks and Defences in Multi-Agent Reinforcement Learning Jan 11, 2023 Deep Reinforcement Learning Multi-agent Reinforcement Learning
— Unverified 0Solar Power driven EV Charging Optimization with Deep Reinforcement Learning Nov 17, 2022 Deep Reinforcement Learning reinforcement-learning
— Unverified 0SOLD: Slot Object-Centric Latent Dynamics Models for Relational Manipulation Learning from Pixels Oct 11, 2024 Model-based Reinforcement Learning reinforcement-learning
— Unverified 0Solipsistic Reinforcement Learning Mar 9, 2021 Model-based Reinforcement Learning reinforcement-learning
— Unverified 0SoloParkour: Constrained Reinforcement Learning for Visual Locomotion from Privileged Experience Sep 20, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0SOLO: Search Online, Learn Offline for Combinatorial Optimization Problems Apr 4, 2021 Combinatorial Optimization Decision Making
— Unverified 0Solver-Informed RL: Grounding Large Language Models for Authentic Optimization Modeling May 17, 2025 Decision Making reinforcement-learning
— Unverified 0Solve Traveling Salesman Problem by Monte Carlo Tree Search and Deep Neural Network May 14, 2020 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Solving a New 3D Bin Packing Problem with Deep Reinforcement Learning Method Aug 20, 2017 3D Bin Packing Combinatorial Optimization
— Unverified 0Solving Bayesian inverse problems with diffusion priors and off-policy RL Mar 12, 2025 Reinforcement Learning (RL)
— Unverified 0Solving Richly Constrained Reinforcement Learning through State Augmentation and Reward Penalties Jan 27, 2023 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Solving Continual Combinatorial Selection via Deep Reinforcement Learning Sep 9, 2019 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Solving Finite-Horizon MDPs via Low-Rank Tensors Jan 17, 2025 Reinforcement Learning (RL)
— Unverified 0Solving Heterogeneous General Equilibrium Economic Models with Deep Reinforcement Learning Mar 31, 2021 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Solving Math Word Problems with Double-Decoder Transformer Aug 28, 2019 Decoder Math
— Unverified 0Solving Multi-Goal Robotic Tasks with Decision Transformer Oct 8, 2024 Multi-Goal Reinforcement Learning reinforcement-learning
— Unverified 0Normalized Cut with Reinforcement Learning in Constrained Action Space May 20, 2025 Combinatorial Optimization reinforcement-learning
— Unverified 0Solving Online Threat Screening Games using Constrained Action Space Reinforcement Learning Nov 20, 2019 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Solving optimal stopping problems with Deep Q-Learning Jan 24, 2021 Q-Learning Reinforcement Learning (RL)
— Unverified 0Solving Reach-Avoid-Stay Problems Using Deep Deterministic Policy Gradients Oct 3, 2024 Reinforcement Learning (RL)
— Unverified 0Solving robust MDPs as a sequence of static RL problems Oct 8, 2024 Reinforcement Learning (RL)
— Unverified 0Solving Rubik's Cube Without Tricky Sampling Nov 29, 2024 Policy Gradient Methods Reinforcement Learning (RL)
— Unverified 0Solving single-objective tasks by preference multi-objective reinforcement learning Sep 25, 2019 Multi-Objective Reinforcement Learning reinforcement-learning
— Unverified 0Solving Sokoban with forward-backward reinforcement learning May 5, 2021 reinforcement-learning Reinforcement Learning
— Unverified 0Solving Stochastic Games Dec 1, 2009 Multi-agent Reinforcement Learning reinforcement-learning
— Unverified 0Solving the capacitated vehicle routing problem with timing windows using rollouts and MAX-SAT Jun 14, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0Solving the Order Batching and Sequencing Problem using Deep Reinforcement Learning Jun 16, 2020 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Solving the single-track train scheduling problem via Deep Reinforcement Learning Sep 1, 2020 Deep Reinforcement Learning Q-Learning
— Unverified 0Solving the Spike Feature Information Vanishing Problem in Spiking Deep Q Network with Potential Based Normalization Jun 8, 2022 image-classification Image Classification
— Unverified 0Solving the swing-up and balance task for the Acrobot and Pendubot with SAC Dec 18, 2023 Acrobot Position
— Unverified 0Solving the vehicle routing problem with deep reinforcement learning Jul 30, 2022 Combinatorial Optimization Deep Reinforcement Learning
— Unverified 0Some Supervision Required: Incorporating Oracle Policies in Reinforcement Learning via Epistemic Uncertainty Metrics Aug 22, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0SoNIC: Safe Social Navigation with Adaptive Conformal Inference and Constrained Reinforcement Learning Jul 24, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0SortingEnv: An Extendable RL-Environment for an Industrial Sorting Process Mar 13, 2025 Reinforcement Learning (RL)
— Unverified 0Source-Critical Reinforcement Learning for Transferring Spoken Language Understanding to a New Language Aug 19, 2018 Cultural Vocal Bursts Intensity Prediction domain classification
— Unverified 0Source Critical Reinforcement Learning for Transferring Spoken Language Understanding to a New Language Aug 1, 2018 Cultural Vocal Bursts Intensity Prediction domain classification
— Unverified 0So You Think You Can Scale Up Autonomous Robot Data Collection? Nov 4, 2024 Imitation Learning Reinforcement Learning (RL)
— Unverified 0Spacecraft Autonomous Decision-Planning for Collision Avoidance: a Reinforcement Learning Approach Oct 29, 2023 Collision Avoidance Decision Making
— Unverified 0Space Navigator: a Tool for the Optimization of Collision Avoidance Maneuvers Feb 6, 2019 Collision Avoidance reinforcement-learning
— Unverified 0Space Processor Computation Time Analysis for Reinforcement Learning and Run Time Assurance Control Policies May 10, 2024 Reinforcement Learning (RL)
— Unverified 0Sparse Adversarial Attack in Multi-agent Reinforcement Learning May 19, 2022 Adversarial Attack Multi-agent Reinforcement Learning
— Unverified 0