| Enhancing Efficiency of Safe Reinforcement Learning via Sample Manipulation | May 31, 2024 | MuJoCoreinforcement-learning | CodeCode Available | 5 |
| Off-Policy Primal-Dual Safe Reinforcement Learning | Jan 26, 2024 | reinforcement-learningReinforcement Learning | CodeCode Available | 5 |
| Balance Reward and Safety Optimization for Safe Reinforcement Learning: A Perspective of Gradient Manipulation | May 2, 2024 | MuJoCoReinforcement Learning (RL) | CodeCode Available | 5 |
| Safe RLHF: Safe Reinforcement Learning from Human Feedback | Oct 19, 2023 | reinforcement-learningReinforcement Learning | CodeCode Available | 3 |
| OmniSafe: An Infrastructure for Accelerating Safe Reinforcement Learning Research | May 16, 2023 | Philosophyreinforcement-learning | CodeCode Available | 3 |
| Constrained Decision Transformer for Offline Safe Reinforcement Learning | Feb 14, 2023 | reinforcement-learningReinforcement Learning | CodeCode Available | 2 |
| MetaDrive: Composing Diverse Driving Scenarios for Generalizable Reinforcement Learning | Sep 26, 2021 | BenchmarkingDecision Making | CodeCode Available | 2 |
| Datasets and Benchmarks for Offline Safe Reinforcement Learning | Jun 15, 2023 | Autonomous DrivingBenchmarking | CodeCode Available | 2 |
| A Review of Safe Reinforcement Learning: Methods, Theory and Applications | May 20, 2022 | Autonomous DrivingDecision Making | CodeCode Available | 2 |
| Conservative and Adaptive Penalty for Model-Based Safe Reinforcement Learning | Dec 14, 2021 | reinforcement-learningReinforcement Learning | CodeCode Available | 1 |