| ConRFT: A Reinforced Fine-tuning Method for VLA Models via Consistency Policy | Feb 8, 2025 | Q-LearningSafe Exploration | CodeCode Available | 3 |
| MetaDrive: Composing Diverse Driving Scenarios for Generalizable Reinforcement Learning | Sep 26, 2021 | BenchmarkingDecision Making | CodeCode Available | 2 |
| Transductive Active Learning with Application to Safe Bayesian Optimization | Jul 12, 2024 | Active LearningBayesian Optimization | CodeCode Available | 1 |
| State-Wise Safe Reinforcement Learning With Pixel Observations | Nov 3, 2023 | reinforcement-learningReinforcement Learning | CodeCode Available | 1 |
| Model-based Safe Deep Reinforcement Learning via a Constrained Proximal Policy Optimization Algorithm | Oct 14, 2022 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Near-Optimal Multi-Agent Learning for Safe Coverage Control | Oct 12, 2022 | DiversityNavigate | CodeCode Available | 1 |
| Toward Safe and Accelerated Deep Reinforcement Learning for Next-Generation Wireless Networks | Sep 16, 2022 | Deep Reinforcement LearningManagement | CodeCode Available | 1 |
| Feasible Actor-Critic: Constrained Reinforcement Learning for Ensuring Statewise Safety | May 22, 2021 | reinforcement-learningReinforcement Learning | CodeCode Available | 1 |
| Autonomous UAV Exploration of Dynamic Environments via Incremental Sampling and Probabilistic Roadmap | Oct 14, 2020 | Safe Exploration | CodeCode Available | 1 |
| Align-RUDDER: Learning From Few Demonstrations by Reward Redistribution | Sep 29, 2020 | General Reinforcement LearningMinecraft | CodeCode Available | 1 |
| Neurosymbolic Reinforcement Learning with Formally Verified Exploration | Sep 26, 2020 | reinforcement-learningReinforcement Learning | CodeCode Available | 1 |
| Provably Safe PAC-MDP Exploration Using Analogies | Jul 7, 2020 | reinforcement-learningReinforcement Learning | CodeCode Available | 1 |
| Verifiably Safe Exploration for End-to-End Reinforcement Learning | Jul 2, 2020 | Deep Reinforcement Learningobject-detection | CodeCode Available | 1 |
| SafeML: Safety Monitoring of Machine Learning Classifiers through Statistical Difference Measure | May 27, 2020 | BIG-bench Machine LearningDomain Adaptation | CodeCode Available | 1 |
| Safe Exploration in Continuous Action Spaces | Jan 26, 2018 | Reinforcement LearningReinforcement Learning (RL) | CodeCode Available | 1 |
| SLAC: Simulation-Pretrained Latent Action Space for Whole-Body Real-World RL | Jun 4, 2025 | DisentanglementIndustrial Robots | —Unverified | 0 |
| Confidence-Guided Human-AI Collaboration: Reinforcement Learning with Distributional Proxy Value Propagation for Autonomous Driving | Jun 4, 2025 | Autonomous DrivingImitation Learning | CodeCode Available | 0 |
| Bresa: Bio-inspired Reflexive Safe Reinforcement Learning for Contact-Rich Robotic Tasks | Mar 27, 2025 | Reinforcement Learning (RL)Safe Exploration | —Unverified | 0 |
| Safe exploration in reproducing kernel Hilbert spaces | Mar 13, 2025 | Bayesian OptimizationSafe Exploration | —Unverified | 0 |
| Safety Representations for Safer Policy Learning | Feb 27, 2025 | Safe Exploration | —Unverified | 0 |
| Learning to explore when mistakes are not allowed | Feb 19, 2025 | Safe ExplorationSafe Reinforcement Learning | —Unverified | 0 |
| Learning-Enhanced Safeguard Control for High-Relative-Degree Systems: Robust Optimization under Disturbances and Faults | Jan 26, 2025 | Reinforcement Learning (RL)Safe Exploration | —Unverified | 0 |
| Safe Bayesian Optimization for the Control of High-Dimensional Embodied Systems | Dec 29, 2024 | Bayesian OptimizationHumanoid Control | —Unverified | 0 |
| ActSafe: Active Exploration with Safety Constraints for Reinforcement Learning | Oct 12, 2024 | Efficient Explorationreinforcement-learning | —Unverified | 0 |
| Robust Deep Reinforcement Learning for Volt-VAR Optimization in Active Distribution System under Uncertainty | Sep 27, 2024 | Conformal PredictionDeep Reinforcement Learning | —Unverified | 0 |