| Improved Regret Bound for Safe Reinforcement Learning via Tighter Cost Pessimism and Reward Optimism | Oct 14, 2024 | Safe Reinforcement Learning | —Unverified | 0 | 0 |
| Iterative Reachability Estimation for Safe Reinforcement Learning | Sep 24, 2023 | MuJoCoreinforcement-learning | —Unverified | 0 | 0 |
| Joint Learning of Policy with Unknown Temporal Constraints for Safe Reinforcement Learning | Apr 30, 2023 | reinforcement-learningReinforcement Learning (RL) | —Unverified | 0 | 0 |
| Learning a Shield from Catastrophic Action Effects: Never Repeat the Same Mistake | Feb 19, 2022 | Continual LearningSafe Reinforcement Learning | —Unverified | 0 | 0 |
| Learning Control Barrier Functions and their application in Reinforcement Learning: A Survey | Apr 22, 2024 | Lifelong learningreinforcement-learning | —Unverified | 0 | 0 |
| Learning Natural Language Constraints for Safe Reinforcement Learning of Language Agents | Apr 4, 2025 | Safe Reinforcement Learning | —Unverified | 0 | 0 |
| Learning to Be Cautious | Oct 29, 2021 | counterfactualSafe Reinforcement Learning | —Unverified | 0 | 0 |
| Learning to explore when mistakes are not allowed | Feb 19, 2025 | Safe ExplorationSafe Reinforcement Learning | —Unverified | 0 | 0 |
| Learning to Recover for Safe Reinforcement Learning | Sep 21, 2023 | reinforcement-learningReinforcement Learning | —Unverified | 0 | 0 |
| LMPriors: Pre-Trained Language Models as Task-Specific Priors | Oct 22, 2022 | Causal InferenceCommon Sense Reasoning | —Unverified | 0 | 0 |
| Long and Short-Term Constraints Driven Safe Reinforcement Learning for Autonomous Driving | Mar 27, 2024 | Autonomous DrivingDecision Making | —Unverified | 0 | 0 |
| Long-term Safe Reinforcement Learning with Binary Feedback | Jan 8, 2024 | reinforcement-learningReinforcement Learning | —Unverified | 0 | 0 |
| Lyapunov-based uncertainty-aware safe reinforcement learning | Jul 29, 2021 | Autonomous DrivingDecision Making | —Unverified | 0 | 0 |
| Make Safe Decisions in Power System: Safe Reinforcement Learning Based Pre-decision Making for Voltage Stability Emergency Control | May 26, 2024 | Active LearningDecision Making | —Unverified | 0 | 0 |
| E^2CFD: Towards Effective and Efficient Cost Function Design for Safe Reinforcement Learning via Large Language Model | Jul 8, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| Penalizing side effects using stepwise relative reachability | Jun 4, 2018 | Reinforcement LearningSafe Reinforcement Learning | —Unverified | 0 | 0 |
| Meta SAC-Lag: Towards Deployable Safe Reinforcement Learning via MetaGradient-based Hyperparameter Tuning | Aug 15, 2024 | Safe ExplorationSafe Reinforcement Learning | —Unverified | 0 | 0 |
| Model-Based Safe Reinforcement Learning with Time-Varying State and Control Constraints: An Application to Intelligent Vehicles | Dec 18, 2021 | Collision Avoidancecontinuous-control | —Unverified | 0 | 0 |
| Modeling Risk in Reinforcement Learning: A Literature Mapping | Dec 8, 2023 | Managementreinforcement-learning | —Unverified | 0 | 0 |
| Uniformly Safe RL with Objective Suppression for Multi-Constraint Safety-Critical Applications | Feb 23, 2024 | Autonomous Drivingreinforcement-learning | —Unverified | 0 | 0 |
| Techno-Economic Modeling and Safe Operational Optimization of Multi-Network Constrained Integrated Community Energy Systems | Feb 8, 2024 | Safe Reinforcement Learning | —Unverified | 0 | 0 |
| Multi-task Safe Reinforcement Learning for Navigating Intersections in Dense Traffic | Feb 19, 2022 | Autonomous Drivingreinforcement-learning | —Unverified | 0 | 0 |
| Networked Multiagent Safe Reinforcement Learning for Low-carbon Demand Management in Distribution Network | Nov 27, 2023 | ManagementSafe Reinforcement Learning | —Unverified | 0 | 0 |
| Offline Guarded Safe Reinforcement Learning for Medical Treatment Optimization Strategies | May 22, 2025 | Offline RLQ-Learning | —Unverified | 0 | 0 |
| On Bellman's principle of optimality and Reinforcement learning for safety-constrained Markov decision process | Feb 25, 2023 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |