| Enhancing System-Level Safety in Mixed-Autonomy Platoon via Safe Reinforcement Learning | Jan 20, 2024 | Autonomous DrivingCollision Avoidance | CodeCode Available | 1 |
| Safe Reinforcement Learning with Free-form Natural Language Constraints and Pre-Trained Language Models | Jan 15, 2024 | FormReinforcement Learning (RL) | —Unverified | 0 |
| Safe reinforcement learning in uncertain contexts | Jan 11, 2024 | Multi-class Classificationreinforcement-learning | CodeCode Available | 0 |
| Long-term Safe Reinforcement Learning with Binary Feedback | Jan 8, 2024 | reinforcement-learningReinforcement Learning | —Unverified | 0 |
| Gradient Shaping for Multi-Constraint Safe Reinforcement Learning | Dec 23, 2023 | reinforcement-learningReinforcement Learning | —Unverified | 0 |
| Safe Reinforcement Learning with Instantaneous Constraints: The Role of Aggressive Exploration | Dec 22, 2023 | 4kreinforcement-learning | —Unverified | 0 |
| Imitate the Good and Avoid the Bad: An Incremental Approach to Safe Reinforcement Learning | Dec 16, 2023 | Reinforcement Learning (RL)Safe Reinforcement Learning | CodeCode Available | 0 |
| Safe Exploration in Reinforcement Learning: Training Backup Control Barrier Functions with Zero Training Time Safety Violations | Dec 13, 2023 | reinforcement-learningReinforcement Learning | —Unverified | 0 |
| Modeling Risk in Reinforcement Learning: A Literature Mapping | Dec 8, 2023 | Managementreinforcement-learning | —Unverified | 0 |
| TRC: Trust Region Conditional Value at Risk for Safe Reinforcement Learning | Dec 1, 2023 | reinforcement-learningSafe Reinforcement Learning | —Unverified | 0 |
| Safe Reinforcement Learning in Tensor Reproducing Kernel Hilbert Space | Dec 1, 2023 | reinforcement-learningReinforcement Learning | —Unverified | 0 |
| Efficient Off-Policy Safe Reinforcement Learning Using Trust Region Conditional Value at Risk | Dec 1, 2023 | Reinforcement Learning (RL)Safe Reinforcement Learning | —Unverified | 0 |
| Safe Reinforcement Learning in a Simulated Robotic Arm | Nov 28, 2023 | reinforcement-learningReinforcement Learning | —Unverified | 0 |
| Networked Multiagent Safe Reinforcement Learning for Low-carbon Demand Management in Distribution Network | Nov 27, 2023 | ManagementSafe Reinforcement Learning | —Unverified | 0 |
| Scheduling Distributed Flexible Assembly Lines using Safe Reinforcement Learning with Soft Shielding | Nov 21, 2023 | Safe Reinforcement LearningScheduling | —Unverified | 0 |
| State-Wise Safe Reinforcement Learning With Pixel Observations | Nov 3, 2023 | reinforcement-learningReinforcement Learning | CodeCode Available | 1 |
| SCPO: Safe Reinforcement Learning with Safety Critic Policy Optimization | Nov 1, 2023 | Benchmarkingreinforcement-learning | —Unverified | 0 |
| Hierarchical Framework for Interpretable and Probabilistic Model-Based Safe Reinforcement Learning | Oct 28, 2023 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 1 |
| Safety-Gymnasium: A Unified Safe Reinforcement Learning Benchmark | Oct 19, 2023 | reinforcement-learningReinforcement Learning | —Unverified | 0 |
| Safe RLHF: Safe Reinforcement Learning from Human Feedback | Oct 19, 2023 | reinforcement-learningReinforcement Learning | CodeCode Available | 3 |
| Specialized Deep Residual Policy Safe Reinforcement Learning-Based Controller for Complex and Continuous State-Action Spaces | Oct 15, 2023 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Robust Safe Reinforcement Learning under Adversarial Disturbances | Oct 11, 2023 | reinforcement-learningReinforcement Learning | —Unverified | 0 |
| Reinforcement Learning in a Safety-Embedded MDP with Trajectory Optimization | Oct 10, 2023 | reinforcement-learningReinforcement Learning | —Unverified | 0 |
| Safe Deep Policy Adaptation | Oct 8, 2023 | reinforcement-learningReinforcement Learning | CodeCode Available | 1 |
| Constraint-Conditioned Policy Optimization for Versatile Safe Reinforcement Learning | Oct 5, 2023 | reinforcement-learningReinforcement Learning | —Unverified | 0 |
| Distributionally Safe Reinforcement Learning under Model Uncertainty: A Single-Level Approach by Differentiable Convex Programming | Oct 3, 2023 | Safe Reinforcement Learning | —Unverified | 0 |
| Risk-Sensitive Inhibitory Control for Safe Reinforcement Learning | Oct 2, 2023 | reinforcement-learningReinforcement Learning | —Unverified | 0 |
| Iterative Reachability Estimation for Safe Reinforcement Learning | Sep 24, 2023 | MuJoCoreinforcement-learning | —Unverified | 0 |
| Learning to Recover for Safe Reinforcement Learning | Sep 21, 2023 | reinforcement-learningReinforcement Learning | —Unverified | 0 |
| Guided Online Distillation: Promoting Safe Reinforcement Learning by Offline Demonstration | Sep 18, 2023 | Autonomous DrivingDecision Making | —Unverified | 0 |
| Safe Reinforcement Learning with Dual Robustness | Sep 13, 2023 | reinforcement-learningReinforcement Learning | —Unverified | 0 |
| On Reward Structures of Markov Decision Processes | Aug 28, 2023 | reinforcement-learningReinforcement Learning | —Unverified | 0 |
| Towards Optimal Head-to-head Autonomous Racing with Curriculum Reinforcement Learning | Aug 25, 2023 | Autonomous RacingFriction | —Unverified | 0 |
| Safeguarding Learning-based Control for Smart Energy Systems with Sampling Specifications | Aug 11, 2023 | reinforcement-learningReinforcement Learning | —Unverified | 0 |
| Approximate Model-Based Shielding for Safe Reinforcement Learning | Jul 27, 2023 | Atari Gamesmodel | CodeCode Available | 0 |
| SafeDreamer: Safe Reinforcement Learning with World Models | Jul 14, 2023 | reinforcement-learningReinforcement Learning | CodeCode Available | 1 |
| Probabilistic Constrained Reinforcement Learning with Formal Interpretability | Jul 13, 2023 | Decision Makingreinforcement-learning | CodeCode Available | 0 |
| Safe Reinforcement Learning for Strategic Bidding of Virtual Power Plants in Day-Ahead Markets | Jul 11, 2023 | Safe Reinforcement Learning | —Unverified | 0 |
| Safety-Aware Task Composition for Discrete and Continuous Reinforcement Learning | Jun 29, 2023 | reinforcement-learningReinforcement Learning | —Unverified | 0 |
| Safe Reinforcement Learning with Dead-Ends Avoidance and Recovery | Jun 24, 2023 | continuous-controlContinuous Control | —Unverified | 0 |
| Datasets and Benchmarks for Offline Safe Reinforcement Learning | Jun 15, 2023 | Autonomous DrivingBenchmarking | CodeCode Available | 2 |
| Cancellation-Free Regret Bounds for Lagrangian Approaches in Constrained Markov Decision Processes | Jun 12, 2023 | Safe Reinforcement Learning | —Unverified | 0 |
| ROSARL: Reward-Only Safe Reinforcement Learning | May 31, 2023 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Provably Efficient Generalized Lagrangian Policy Optimization for Safe Multi-Agent Reinforcement Learning | May 31, 2023 | Multi-agent Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Control invariant set enhanced safe reinforcement learning: improved sampling efficiency, guaranteed stability and robustness | May 24, 2023 | Reinforcement Learning (RL)Safe Reinforcement Learning | —Unverified | 0 |
| GUARD: A Safe Reinforcement Learning Benchmark | May 23, 2023 | Autonomous DrivingDiversity | CodeCode Available | 1 |
| Lagrangian-based online safe reinforcement learning for state-constrained systems | May 22, 2023 | reinforcement-learningReinforcement Learning | —Unverified | 0 |
| OmniSafe: An Infrastructure for Accelerating Safe Reinforcement Learning Research | May 16, 2023 | Philosophyreinforcement-learning | CodeCode Available | 3 |
| Optimal Energy System Scheduling Using A Constraint-Aware Reinforcement Learning Algorithm | May 9, 2023 | Deep Reinforcement Learningenergy management | CodeCode Available | 1 |
| Joint Learning of Policy with Unknown Temporal Constraints for Safe Reinforcement Learning | Apr 30, 2023 | reinforcement-learningReinforcement Learning (RL) | —Unverified | 0 |