| SLAC: Simulation-Pretrained Latent Action Space for Whole-Body Real-World RL | Jun 4, 2025 | DisentanglementIndustrial Robots | —Unverified | 0 |
| Confidence-Guided Human-AI Collaboration: Reinforcement Learning with Distributional Proxy Value Propagation for Autonomous Driving | Jun 4, 2025 | Autonomous DrivingImitation Learning | CodeCode Available | 0 |
| Bresa: Bio-inspired Reflexive Safe Reinforcement Learning for Contact-Rich Robotic Tasks | Mar 27, 2025 | Reinforcement Learning (RL)Safe Exploration | —Unverified | 0 |
| Safe exploration in reproducing kernel Hilbert spaces | Mar 13, 2025 | Bayesian OptimizationSafe Exploration | —Unverified | 0 |
| Safety Representations for Safer Policy Learning | Feb 27, 2025 | Safe Exploration | —Unverified | 0 |
| Learning to explore when mistakes are not allowed | Feb 19, 2025 | Safe ExplorationSafe Reinforcement Learning | —Unverified | 0 |
| ConRFT: A Reinforced Fine-tuning Method for VLA Models via Consistency Policy | Feb 8, 2025 | Q-LearningSafe Exploration | CodeCode Available | 3 |
| Learning-Enhanced Safeguard Control for High-Relative-Degree Systems: Robust Optimization under Disturbances and Faults | Jan 26, 2025 | Reinforcement Learning (RL)Safe Exploration | —Unverified | 0 |
| Safe Bayesian Optimization for the Control of High-Dimensional Embodied Systems | Dec 29, 2024 | Bayesian OptimizationHumanoid Control | —Unverified | 0 |
| ActSafe: Active Exploration with Safety Constraints for Reinforcement Learning | Oct 12, 2024 | Efficient Explorationreinforcement-learning | —Unverified | 0 |
| Robust Deep Reinforcement Learning for Volt-VAR Optimization in Active Distribution System under Uncertainty | Sep 27, 2024 | Conformal PredictionDeep Reinforcement Learning | —Unverified | 0 |
| Handling Long-Term Safety and Uncertainty in Safe Reinforcement Learning | Sep 18, 2024 | reinforcement-learningReinforcement Learning | CodeCode Available | 0 |
| Revisiting Safe Exploration in Safe Reinforcement learning | Sep 2, 2024 | Benchmarkingreinforcement-learning | —Unverified | 0 |
| A Safe Self-evolution Algorithm for Autonomous Driving Based on Data-Driven Risk Quantification Model | Aug 23, 2024 | Autonomous DrivingEvolutionary Algorithms | —Unverified | 0 |
| Meta SAC-Lag: Towards Deployable Safe Reinforcement Learning via MetaGradient-based Hyperparameter Tuning | Aug 15, 2024 | Safe ExplorationSafe Reinforcement Learning | —Unverified | 0 |
| A Safe Exploration Strategy for Model-free Task Adaptation in Safety-constrained Grid Environments | Aug 2, 2024 | Binary ClassificationSafe Exploration | —Unverified | 0 |
| Exterior Penalty Policy Optimization with Penalty Metric Network under Constraints | Jul 22, 2024 | Safe Exploration | CodeCode Available | 0 |
| Transductive Active Learning with Application to Safe Bayesian Optimization | Jul 12, 2024 | Active LearningBayesian Optimization | CodeCode Available | 1 |
| Enhanced Safety in Autonomous Driving: Integrating Latent State Diffusion Model for End-to-End Navigation | Jul 8, 2024 | Autonomous DrivingDecision Making | —Unverified | 0 |
| Highway Value Iteration Networks | Jun 5, 2024 | DiversitySafe Exploration | —Unverified | 0 |
| Safe Reinforcement Learning in Black-Box Environments via Adaptive Shielding | May 28, 2024 | reinforcement-learningReinforcement Learning (RL) | CodeCode Available | 0 |
| Preparing for Black Swans: The Antifragility Imperative for Machine Learning | May 18, 2024 | Continual LearningDecision Making | —Unverified | 0 |
| Contextual Affordances for Safe Exploration in Robotic Scenarios | May 10, 2024 | Safe Exploration | —Unverified | 0 |
| Safe Exploration Using Bayesian World Models and Log-Barrier Optimization | May 9, 2024 | Safe Exploration | —Unverified | 0 |
| Trajectory-wise Iterative Reinforcement Learning Framework for Auto-bidding | Feb 23, 2024 | Offline RLreinforcement-learning | —Unverified | 0 |
| Information-Theoretic Safe Bayesian Optimization | Feb 23, 2024 | Bayesian OptimizationDecision Making | —Unverified | 0 |
| A comparison of RL-based and PID controllers for 6-DOF swimming robots: hybrid underwater object tracking | Jan 29, 2024 | Object TrackingSafe Exploration | CodeCode Available | 0 |
| Towards Socially and Morally Aware RL agent: Reward Design With LLM | Jan 23, 2024 | Reinforcement Learning (RL)Safe Exploration | —Unverified | 0 |
| Towards Safe Load Balancing based on Control Barrier Functions and Deep Reinforcement Learning | Jan 10, 2024 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Learning Human-like Representations to Enable Learning Human Values | Dec 21, 2023 | EthicsFairness | —Unverified | 0 |
| Safe Exploration in Reinforcement Learning: Training Backup Control Barrier Functions with Zero Training Time Safety Violations | Dec 13, 2023 | reinforcement-learningReinforcement Learning | —Unverified | 0 |
| A safe exploration approach to constrained Markov decision processes | Dec 1, 2023 | reinforcement-learningSafe Exploration | —Unverified | 0 |
| Safe Reinforcement Learning in a Simulated Robotic Arm | Nov 28, 2023 | reinforcement-learningReinforcement Learning | —Unverified | 0 |
| State-Wise Safe Reinforcement Learning With Pixel Observations | Nov 3, 2023 | reinforcement-learningReinforcement Learning | CodeCode Available | 1 |
| Safe Exploration in Reinforcement Learning: A Generalized Formulation and Algorithms | Oct 5, 2023 | reinforcement-learningReinforcement Learning | —Unverified | 0 |
| Reinforcement Learning by Guided Safe Exploration | Jul 26, 2023 | reinforcement-learningReinforcement Learning | —Unverified | 0 |
| Probabilistic Counterexample Guidance for Safer Reinforcement Learning (Extended Version) | Jul 10, 2023 | reinforcement-learningReinforcement Learning | CodeCode Available | 0 |
| Model-Assisted Probabilistic Safe Adaptive Control With Meta-Bayesian Learning | Jul 3, 2023 | Meta-LearningSafe Exploration | —Unverified | 0 |
| Safe Reinforcement Learning with Dead-Ends Avoidance and Recovery | Jun 24, 2023 | continuous-controlContinuous Control | —Unverified | 0 |
| Provably Learning Nash Policies in Constrained Markov Potential Games | Jun 13, 2023 | Decision MakingMulti-agent Reinforcement Learning | —Unverified | 0 |
| Exploration of Unranked Items in Safe Online Learning to Re-Rank | May 2, 2023 | Learning-To-RankSafe Exploration | —Unverified | 0 |
| System III: Learning with Domain Knowledge for Safety Constraints | Apr 23, 2023 | Safe Exploration | —Unverified | 0 |
| Approximate Shielding of Atari Agents for Safe Exploration | Apr 21, 2023 | Atari GamesSafe Exploration | —Unverified | 0 |
| Safe and Sample-efficient Reinforcement Learning for Clustered Dynamic Environments | Mar 24, 2023 | reinforcement-learningReinforcement Learning | CodeCode Available | 0 |
| A Human-Centered Safe Robot Reinforcement Learning Framework with Interactive Behaviors | Feb 25, 2023 | reinforcement-learningReinforcement Learning (RL) | —Unverified | 0 |
| Information-Theoretic Safe Exploration with Gaussian Processes | Dec 9, 2022 | Decision MakingGaussian Processes | CodeCode Available | 0 |
| Benefits of Monotonicity in Safe Exploration with Gaussian Processes | Nov 3, 2022 | Gaussian ProcessesSafe Exploration | CodeCode Available | 0 |
| Atlas: Automate Online Service Configuration in Network Slicing | Oct 30, 2022 | Bayesian OptimizationSafe Exploration | CodeCode Available | 0 |
| The Pump Scheduling Problem: A Real-World Scenario for Reinforcement Learning | Oct 20, 2022 | Deep Reinforcement LearningOffline RL | CodeCode Available | 0 |
| Model-based Safe Deep Reinforcement Learning via a Constrained Proximal Policy Optimization Algorithm | Oct 14, 2022 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |