| Directed Policy Gradient for Safe Reinforcement Learning with Human Advice | Aug 13, 2018 | reinforcement-learningReinforcement Learning | —Unverified | 0 |
| Distributionally Safe Reinforcement Learning under Model Uncertainty: A Single-Level Approach by Differentiable Convex Programming | Oct 3, 2023 | Safe Reinforcement Learning | —Unverified | 0 |
| Do No Harm: A Counterfactual Approach to Safe Reinforcement Learning | May 19, 2024 | counterfactualFriction | —Unverified | 0 |
| Doubly Robust Off-Policy Actor-Critic Algorithms for Reinforcement Learning | Dec 11, 2019 | continuous-controlContinuous Control | —Unverified | 0 |
| Dynamic Model Predictive Shielding for Provably Safe Reinforcement Learning | May 22, 2024 | reinforcement-learningReinforcement Learning | —Unverified | 0 |
| Efficient Off-Policy Safe Reinforcement Learning Using Trust Region Conditional Value at Risk | Dec 1, 2023 | Reinforcement Learning (RL)Safe Reinforcement Learning | —Unverified | 0 |
| Enforcing Hard Constraints with Soft Barriers: Safe Reinforcement Learning in Unknown Stochastic Environments | Sep 29, 2022 | Reinforcement Learning (RL)Safe Reinforcement Learning | —Unverified | 0 |
| Enhance Exploration in Safe Reinforcement Learning with Contrastive Representation Learning | Mar 13, 2025 | Contrastive LearningRepresentation Learning | —Unverified | 0 |
| Ensuring Safety in Target Pursuit Control: A CBF-Safe Reinforcement Learning Approach | Nov 26, 2024 | Collision Avoidancereinforcement-learning | —Unverified | 0 |
| Feasibility-Aware Pessimistic Estimation: Toward Long-Horizon Safety in Offline RL | May 13, 2025 | Offline RLSafe Reinforcement Learning | —Unverified | 0 |