| ConRFT: A Reinforced Fine-tuning Method for VLA Models via Consistency Policy | Feb 8, 2025 | Q-LearningSafe Exploration | CodeCode Available | 3 | 5 |
| MetaDrive: Composing Diverse Driving Scenarios for Generalizable Reinforcement Learning | Sep 26, 2021 | BenchmarkingDecision Making | CodeCode Available | 2 | 5 |
| Provably Safe PAC-MDP Exploration Using Analogies | Jul 7, 2020 | reinforcement-learningReinforcement Learning | CodeCode Available | 1 | 5 |
| Neurosymbolic Reinforcement Learning with Formally Verified Exploration | Sep 26, 2020 | reinforcement-learningReinforcement Learning | CodeCode Available | 1 | 5 |
| State-Wise Safe Reinforcement Learning With Pixel Observations | Nov 3, 2023 | reinforcement-learningReinforcement Learning | CodeCode Available | 1 | 5 |
| SafeML: Safety Monitoring of Machine Learning Classifiers through Statistical Difference Measure | May 27, 2020 | BIG-bench Machine LearningDomain Adaptation | CodeCode Available | 1 | 5 |
| Toward Safe and Accelerated Deep Reinforcement Learning for Next-Generation Wireless Networks | Sep 16, 2022 | Deep Reinforcement LearningManagement | CodeCode Available | 1 | 5 |
| Safe Exploration in Continuous Action Spaces | Jan 26, 2018 | Reinforcement LearningReinforcement Learning (RL) | CodeCode Available | 1 | 5 |
| Feasible Actor-Critic: Constrained Reinforcement Learning for Ensuring Statewise Safety | May 22, 2021 | reinforcement-learningReinforcement Learning | CodeCode Available | 1 | 5 |
| Transductive Active Learning with Application to Safe Bayesian Optimization | Jul 12, 2024 | Active LearningBayesian Optimization | CodeCode Available | 1 | 5 |
| Verifiably Safe Exploration for End-to-End Reinforcement Learning | Jul 2, 2020 | Deep Reinforcement Learningobject-detection | CodeCode Available | 1 | 5 |
| Autonomous UAV Exploration of Dynamic Environments via Incremental Sampling and Probabilistic Roadmap | Oct 14, 2020 | Safe Exploration | CodeCode Available | 1 | 5 |
| Near-Optimal Multi-Agent Learning for Safe Coverage Control | Oct 12, 2022 | DiversityNavigate | CodeCode Available | 1 | 5 |
| Model-based Safe Deep Reinforcement Learning via a Constrained Proximal Policy Optimization Algorithm | Oct 14, 2022 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 | 5 |
| Align-RUDDER: Learning From Few Demonstrations by Reward Redistribution | Sep 29, 2020 | General Reinforcement LearningMinecraft | CodeCode Available | 1 | 5 |
| AI Safety Gridworlds | Nov 27, 2017 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 | 5 |
| Learning-based Model Predictive Control for Safe Exploration | Mar 22, 2018 | modelModel Predictive Control | CodeCode Available | 0 | 5 |
| Atlas: Automate Online Service Configuration in Network Slicing | Oct 30, 2022 | Bayesian OptimizationSafe Exploration | CodeCode Available | 0 | 5 |
| Learning-based Model Predictive Control for Safe Exploration and Reinforcement Learning | Jun 27, 2019 | Model Predictive Controlreinforcement-learning | CodeCode Available | 0 | 5 |
| Confidence-Guided Human-AI Collaboration: Reinforcement Learning with Distributional Proxy Value Propagation for Autonomous Driving | Jun 4, 2025 | Autonomous DrivingImitation Learning | CodeCode Available | 0 | 5 |
| Handling Long-Term Safety and Uncertainty in Safe Reinforcement Learning | Sep 18, 2024 | reinforcement-learningReinforcement Learning | CodeCode Available | 0 | 5 |
| Concrete Problems in AI Safety | Jun 21, 2016 | BIG-bench Machine LearningSafe Exploration | CodeCode Available | 0 | 5 |
| Exterior Penalty Policy Optimization with Penalty Metric Network under Constraints | Jul 22, 2024 | Safe Exploration | CodeCode Available | 0 | 5 |
| Infinite Time Horizon Safety of Bayesian Neural Networks | Nov 4, 2021 | reinforcement-learningReinforcement Learning | CodeCode Available | 0 | 5 |
| Curiosity Killed or Incapacitated the Cat and the Asymptotically Optimal Agent | Jun 5, 2020 | reinforcement-learningReinforcement Learning | CodeCode Available | 0 | 5 |