| ConRFT: A Reinforced Fine-tuning Method for VLA Models via Consistency Policy | Feb 8, 2025 | Q-LearningSafe Exploration | CodeCode Available | 3 |
| MetaDrive: Composing Diverse Driving Scenarios for Generalizable Reinforcement Learning | Sep 26, 2021 | BenchmarkingDecision Making | CodeCode Available | 2 |
| SafeML: Safety Monitoring of Machine Learning Classifiers through Statistical Difference Measure | May 27, 2020 | BIG-bench Machine LearningDomain Adaptation | CodeCode Available | 1 |
| Toward Safe and Accelerated Deep Reinforcement Learning for Next-Generation Wireless Networks | Sep 16, 2022 | Deep Reinforcement LearningManagement | CodeCode Available | 1 |
| Provably Safe PAC-MDP Exploration Using Analogies | Jul 7, 2020 | reinforcement-learningReinforcement Learning | CodeCode Available | 1 |
| Verifiably Safe Exploration for End-to-End Reinforcement Learning | Jul 2, 2020 | Deep Reinforcement Learningobject-detection | CodeCode Available | 1 |
| Feasible Actor-Critic: Constrained Reinforcement Learning for Ensuring Statewise Safety | May 22, 2021 | reinforcement-learningReinforcement Learning | CodeCode Available | 1 |
| Neurosymbolic Reinforcement Learning with Formally Verified Exploration | Sep 26, 2020 | reinforcement-learningReinforcement Learning | CodeCode Available | 1 |
| Transductive Active Learning with Application to Safe Bayesian Optimization | Jul 12, 2024 | Active LearningBayesian Optimization | CodeCode Available | 1 |
| Near-Optimal Multi-Agent Learning for Safe Coverage Control | Oct 12, 2022 | DiversityNavigate | CodeCode Available | 1 |
| State-Wise Safe Reinforcement Learning With Pixel Observations | Nov 3, 2023 | reinforcement-learningReinforcement Learning | CodeCode Available | 1 |
| Safe Exploration in Continuous Action Spaces | Jan 26, 2018 | Reinforcement LearningReinforcement Learning (RL) | CodeCode Available | 1 |
| Autonomous UAV Exploration of Dynamic Environments via Incremental Sampling and Probabilistic Roadmap | Oct 14, 2020 | Safe Exploration | CodeCode Available | 1 |
| Model-based Safe Deep Reinforcement Learning via a Constrained Proximal Policy Optimization Algorithm | Oct 14, 2022 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Align-RUDDER: Learning From Few Demonstrations by Reward Redistribution | Sep 29, 2020 | General Reinforcement LearningMinecraft | CodeCode Available | 1 |
| Bayesian Controller Fusion: Leveraging Control Priors in Deep Reinforcement Learning for Robotics | Jul 21, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Avoiding Negative Side-Effects and Promoting Safe Exploration with Imaginative Planning | Sep 25, 2019 | Reinforcement Learning (RL)Safe Exploration | —Unverified | 0 |
| Ablation Study of How Run Time Assurance Impacts the Training and Performance of Reinforcement Learning Agents | Jul 8, 2022 | reinforcement-learningReinforcement Learning | —Unverified | 0 |
| Model-Assisted Probabilistic Safe Adaptive Control With Meta-Bayesian Learning | Jul 3, 2023 | Meta-LearningSafe Exploration | —Unverified | 0 |
| Conservative Safety Critics for Exploration | Oct 27, 2020 | Reinforcement Learning (RL)Safe Exploration | —Unverified | 0 |
| MESA: Offline Meta-RL for Safe Adaptation and Fault Tolerance | Dec 7, 2021 | continuous-controlContinuous Control | —Unverified | 0 |
| Model-Based Offline Meta-Reinforcement Learning with Regularization | Feb 7, 2022 | Meta Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| A Human-Centered Safe Robot Reinforcement Learning Framework with Interactive Behaviors | Feb 25, 2023 | reinforcement-learningReinforcement Learning (RL) | —Unverified | 0 |
| Contextual Affordances for Safe Exploration in Robotic Scenarios | May 10, 2024 | Safe Exploration | —Unverified | 0 |
| Learning to explore when mistakes are not allowed | Feb 19, 2025 | Safe ExplorationSafe Reinforcement Learning | —Unverified | 0 |
| Meta SAC-Lag: Towards Deployable Safe Reinforcement Learning via MetaGradient-based Hyperparameter Tuning | Aug 15, 2024 | Safe ExplorationSafe Reinforcement Learning | —Unverified | 0 |
| Data Efficient Reinforcement Learning for Legged Robots | Jul 8, 2019 | Model Predictive Controlreinforcement-learning | —Unverified | 0 |
| Data-efficient visuomotor policy training using reinforcement learning and generative models | Jul 26, 2020 | Decision MakingDisentanglement | —Unverified | 0 |
| Decoupled Learning of Environment Characteristics for Safe Exploration | Aug 9, 2017 | reinforcement-learningReinforcement Learning | —Unverified | 0 |
| DESTA: A Framework for Safe Reinforcement Learning with Markov Games of Intervention | Oct 27, 2021 | OpenAI Gymreinforcement-learning | —Unverified | 0 |
| Learning to Drive Using Sparse Imitation Reinforcement Learning | May 24, 2022 | Autonomous Drivingreinforcement-learning | —Unverified | 0 |
| Learning Transferable Domain Priors for Safe Exploration in Reinforcement Learning | Sep 10, 2019 | reinforcement-learningReinforcement Learning | —Unverified | 0 |
| ActSafe: Active Exploration with Safety Constraints for Reinforcement Learning | Oct 12, 2024 | Efficient Explorationreinforcement-learning | —Unverified | 0 |
| Chance-Constrained Trajectory Optimization for Safe Exploration and Learning of Nonlinear Systems | May 9, 2020 | Motion PlanningOptimal Motion Planning | —Unverified | 0 |
| Learning to Control Highly Accelerated Ballistic Movements on Muscular Robots | Apr 7, 2019 | Bayesian OptimizationSafe Exploration | —Unverified | 0 |
| Learn-to-Race Challenge 2022: Benchmarking Safe Learning and Cross-domain Generalisation in Autonomous Racing | May 5, 2022 | Autonomous DrivingAutonomous Racing | —Unverified | 0 |
| Building HVAC Scheduling Using Reinforcement Learning via Neural Network Based Model Approximation | Oct 11, 2019 | Deep Reinforcement LearningModel-based Reinforcement Learning | —Unverified | 0 |
| Exploration of Unranked Items in Safe Online Learning to Re-Rank | May 2, 2023 | Learning-To-RankSafe Exploration | —Unverified | 0 |
| Learning-Enhanced Safeguard Control for High-Relative-Degree Systems: Robust Optimization under Disturbances and Faults | Jan 26, 2025 | Reinforcement Learning (RL)Safe Exploration | —Unverified | 0 |
| Guiding Safe Exploration with Weakest Preconditions | Sep 28, 2022 | continuous-controlContinuous Control | —Unverified | 0 |
| A Safe Self-evolution Algorithm for Autonomous Driving Based on Data-Driven Risk Quantification Model | Aug 23, 2024 | Autonomous DrivingEvolutionary Algorithms | —Unverified | 0 |
| Highway Value Iteration Networks | Jun 5, 2024 | DiversitySafe Exploration | —Unverified | 0 |
| A Safe Semi-supervised Graph Convolution Network | Jul 5, 2022 | Safe Exploration | —Unverified | 0 |
| Information-Theoretic Safe Bayesian Optimization | Feb 23, 2024 | Bayesian OptimizationDecision Making | —Unverified | 0 |
| Exploration in Deep Reinforcement Learning: A Survey | May 2, 2022 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| A safe exploration approach to constrained Markov decision processes | Dec 1, 2023 | reinforcement-learningSafe Exploration | —Unverified | 0 |
| BubbleRank: Safe Online Learning to Re-Rank via Implicit Click Feedback | Jun 15, 2018 | Learning-To-RankRe-Ranking | —Unverified | 0 |
| Approximate Shielding of Atari Agents for Safe Exploration | Apr 21, 2023 | Atari GamesSafe Exploration | —Unverified | 0 |
| Learning-based Symbolic Abstractions for Nonlinear Control Systems | Apr 4, 2020 | Safe Exploration | —Unverified | 0 |
| Learning Human-like Representations to Enable Learning Human Values | Dec 21, 2023 | EthicsFairness | —Unverified | 0 |