| DIAL: Distribution-Informed Adaptive Learning of Multi-Task Constraints for Safety-Critical Systems | Jan 30, 2025 | Autonomous DrivingImitation Learning | —Unverified | 0 | 0 |
| Directed Policy Gradient for Safe Reinforcement Learning with Human Advice | Aug 13, 2018 | reinforcement-learningReinforcement Learning | —Unverified | 0 | 0 |
| Distributionally Safe Reinforcement Learning under Model Uncertainty: A Single-Level Approach by Differentiable Convex Programming | Oct 3, 2023 | Safe Reinforcement Learning | —Unverified | 0 | 0 |
| Do No Harm: A Counterfactual Approach to Safe Reinforcement Learning | May 19, 2024 | counterfactualFriction | —Unverified | 0 | 0 |
| Doubly Robust Off-Policy Actor-Critic Algorithms for Reinforcement Learning | Dec 11, 2019 | continuous-controlContinuous Control | —Unverified | 0 | 0 |
| Dynamic Model Predictive Shielding for Provably Safe Reinforcement Learning | May 22, 2024 | reinforcement-learningReinforcement Learning | —Unverified | 0 | 0 |
| Efficient Off-Policy Safe Reinforcement Learning Using Trust Region Conditional Value at Risk | Dec 1, 2023 | Reinforcement Learning (RL)Safe Reinforcement Learning | —Unverified | 0 | 0 |
| Enforcing Hard Constraints with Soft Barriers: Safe Reinforcement Learning in Unknown Stochastic Environments | Sep 29, 2022 | Reinforcement Learning (RL)Safe Reinforcement Learning | —Unverified | 0 | 0 |
| Enhance Exploration in Safe Reinforcement Learning with Contrastive Representation Learning | Mar 13, 2025 | Contrastive LearningRepresentation Learning | —Unverified | 0 | 0 |
| Ensuring Safety in Target Pursuit Control: A CBF-Safe Reinforcement Learning Approach | Nov 26, 2024 | Collision Avoidancereinforcement-learning | —Unverified | 0 | 0 |
| Feasibility-Aware Pessimistic Estimation: Toward Long-Horizon Safety in Offline RL | May 13, 2025 | Offline RLSafe Reinforcement Learning | —Unverified | 0 | 0 |
| Feasible Policy Iteration for Safe Reinforcement Learning | Apr 18, 2023 | reinforcement-learningReinforcement Learning | —Unverified | 0 | 0 |
| Flipping-based Policy for Chance-Constrained Markov Decision Processes | Oct 9, 2024 | Reinforcement Learning (RL)Safe Reinforcement Learning | —Unverified | 0 | 0 |
| FOSP: Fine-tuning Offline Safe Policy through World Models | Jul 6, 2024 | Model-based Reinforcement LearningOffline RL | —Unverified | 0 | 0 |
| From Text to Trajectory: Exploring Complex Constraint Representation and Decomposition in Safe Reinforcement Learning | Dec 12, 2024 | Reinforcement Learning (RL)Safe Reinforcement Learning | —Unverified | 0 | 0 |
| Fully Bayesian Recurrent Neural Networks for Safe Reinforcement Learning | Nov 8, 2019 | Collision Avoidancereinforcement-learning | —Unverified | 0 | 0 |
| GenSafe: A Generalizable Safety Enhancer for Safe Reinforcement Learning Algorithms Based on Reduced Order Markov Decision Process Model | Jun 6, 2024 | Autonomous VehiclesDeep Reinforcement Learning | —Unverified | 0 | 0 |
| Gradient Shaping for Multi-Constraint Safe Reinforcement Learning | Dec 23, 2023 | reinforcement-learningReinforcement Learning | —Unverified | 0 | 0 |
| Guided Online Distillation: Promoting Safe Reinforcement Learning by Offline Demonstration | Sep 18, 2023 | Autonomous DrivingDecision Making | —Unverified | 0 | 0 |
| HASARD: A Benchmark for Vision-Based Safe Reinforcement Learning in Embodied Agents | Mar 11, 2025 | NavigateReinforcement Learning (RL) | —Unverified | 0 | 0 |
| Hierarchical Reinforcement Learning for Safe Mapless Navigation with Congestion Estimation | Mar 15, 2025 | Hierarchical Reinforcement LearningMotion Planning | —Unverified | 0 | 0 |
| Lagrangian-based online safe reinforcement learning for state-constrained systems | May 22, 2023 | reinforcement-learningReinforcement Learning | —Unverified | 0 | 0 |
| Implicit Safe Set Algorithm for Provably Safe Reinforcement Learning | May 4, 2024 | continuous-controlContinuous Control | —Unverified | 0 | 0 |
| Improved Regret Bound for Safe Reinforcement Learning via Tighter Cost Pessimism and Reward Optimism | Oct 14, 2024 | Safe Reinforcement Learning | —Unverified | 0 | 0 |
| Iterative Reachability Estimation for Safe Reinforcement Learning | Sep 24, 2023 | MuJoCoreinforcement-learning | —Unverified | 0 | 0 |
| Joint Learning of Policy with Unknown Temporal Constraints for Safe Reinforcement Learning | Apr 30, 2023 | reinforcement-learningReinforcement Learning (RL) | —Unverified | 0 | 0 |
| Learning a Shield from Catastrophic Action Effects: Never Repeat the Same Mistake | Feb 19, 2022 | Continual LearningSafe Reinforcement Learning | —Unverified | 0 | 0 |
| Learning Control Barrier Functions and their application in Reinforcement Learning: A Survey | Apr 22, 2024 | Lifelong learningreinforcement-learning | —Unverified | 0 | 0 |
| Learning Natural Language Constraints for Safe Reinforcement Learning of Language Agents | Apr 4, 2025 | Safe Reinforcement Learning | —Unverified | 0 | 0 |
| Learning to Be Cautious | Oct 29, 2021 | counterfactualSafe Reinforcement Learning | —Unverified | 0 | 0 |
| Learning to explore when mistakes are not allowed | Feb 19, 2025 | Safe ExplorationSafe Reinforcement Learning | —Unverified | 0 | 0 |
| Learning to Recover for Safe Reinforcement Learning | Sep 21, 2023 | reinforcement-learningReinforcement Learning | —Unverified | 0 | 0 |
| LMPriors: Pre-Trained Language Models as Task-Specific Priors | Oct 22, 2022 | Causal InferenceCommon Sense Reasoning | —Unverified | 0 | 0 |
| Long and Short-Term Constraints Driven Safe Reinforcement Learning for Autonomous Driving | Mar 27, 2024 | Autonomous DrivingDecision Making | —Unverified | 0 | 0 |
| Long-term Safe Reinforcement Learning with Binary Feedback | Jan 8, 2024 | reinforcement-learningReinforcement Learning | —Unverified | 0 | 0 |
| Lyapunov-based uncertainty-aware safe reinforcement learning | Jul 29, 2021 | Autonomous DrivingDecision Making | —Unverified | 0 | 0 |
| Make Safe Decisions in Power System: Safe Reinforcement Learning Based Pre-decision Making for Voltage Stability Emergency Control | May 26, 2024 | Active LearningDecision Making | —Unverified | 0 | 0 |
| E^2CFD: Towards Effective and Efficient Cost Function Design for Safe Reinforcement Learning via Large Language Model | Jul 8, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| Penalizing side effects using stepwise relative reachability | Jun 4, 2018 | Reinforcement LearningSafe Reinforcement Learning | —Unverified | 0 | 0 |
| Meta SAC-Lag: Towards Deployable Safe Reinforcement Learning via MetaGradient-based Hyperparameter Tuning | Aug 15, 2024 | Safe ExplorationSafe Reinforcement Learning | —Unverified | 0 | 0 |
| Model-Based Safe Reinforcement Learning with Time-Varying State and Control Constraints: An Application to Intelligent Vehicles | Dec 18, 2021 | Collision Avoidancecontinuous-control | —Unverified | 0 | 0 |
| Modeling Risk in Reinforcement Learning: A Literature Mapping | Dec 8, 2023 | Managementreinforcement-learning | —Unverified | 0 | 0 |
| Uniformly Safe RL with Objective Suppression for Multi-Constraint Safety-Critical Applications | Feb 23, 2024 | Autonomous Drivingreinforcement-learning | —Unverified | 0 | 0 |
| Techno-Economic Modeling and Safe Operational Optimization of Multi-Network Constrained Integrated Community Energy Systems | Feb 8, 2024 | Safe Reinforcement Learning | —Unverified | 0 | 0 |
| Multi-task Safe Reinforcement Learning for Navigating Intersections in Dense Traffic | Feb 19, 2022 | Autonomous Drivingreinforcement-learning | —Unverified | 0 | 0 |
| Networked Multiagent Safe Reinforcement Learning for Low-carbon Demand Management in Distribution Network | Nov 27, 2023 | ManagementSafe Reinforcement Learning | —Unverified | 0 | 0 |
| Offline Guarded Safe Reinforcement Learning for Medical Treatment Optimization Strategies | May 22, 2025 | Offline RLQ-Learning | —Unverified | 0 | 0 |
| On Bellman's principle of optimality and Reinforcement learning for safety-constrained Markov decision process | Feb 25, 2023 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |
| On Reward Structures of Markov Decision Processes | Aug 28, 2023 | reinforcement-learningReinforcement Learning | —Unverified | 0 | 0 |
| On the Properties of Kullback-Leibler Divergence Between Multivariate Gaussian Distributions | Feb 10, 2021 | Anomaly DetectionSafe Reinforcement Learning | —Unverified | 0 | 0 |