| Reinforcement Learning from Human Feedback with High-Confidence Safety Constraints | Jun 9, 2025 | Safe Reinforcement Learning | CodeCode Available | 0 |
| Provably Safe Reinforcement Learning from Analytic Gradients | Jun 2, 2025 | reinforcement-learningReinforcement Learning | —Unverified | 0 |
| A Provable Approach for End-to-End Safe Reinforcement Learning | May 28, 2025 | Gaussian ProcessesReinforcement Learning (RL) | —Unverified | 0 |
| Offline Guarded Safe Reinforcement Learning for Medical Treatment Optimization Strategies | May 22, 2025 | Offline RLQ-Learning | —Unverified | 0 |
| Risk-Aware Safe Reinforcement Learning for Control of Stochastic Linear Systems | May 14, 2025 | Reinforcement Learning (RL)Safe Reinforcement Learning | —Unverified | 0 |
| Feasibility-Aware Pessimistic Estimation: Toward Long-Horizon Safety in Offline RL | May 13, 2025 | Offline RLSafe Reinforcement Learning | —Unverified | 0 |
| Skill-based Safe Reinforcement Learning with Risk Planning | May 2, 2025 | reinforcement-learningReinforcement Learning | —Unverified | 0 |
| Designing Control Barrier Function via Probabilistic Enumeration for Safe Reinforcement Learning Navigation | Apr 30, 2025 | Autonomous NavigationSafe Reinforcement Learning | —Unverified | 0 |
| Anytime Safe Reinforcement Learning | Apr 23, 2025 | reinforcement-learningReinforcement Learning | —Unverified | 0 |
| TraCeS: Trajectory Based Credit Assignment From Sparse Safety Feedback | Apr 17, 2025 | continuous-controlContinuous Control | —Unverified | 0 |
| Learning Natural Language Constraints for Safe Reinforcement Learning of Language Agents | Apr 4, 2025 | Safe Reinforcement Learning | —Unverified | 0 |
| Safety Modulation: Enhancing Safety in Reinforcement Learning through Cost-Modulated Rewards | Apr 3, 2025 | Safe Reinforcement Learning | —Unverified | 0 |
| Bresa: Bio-inspired Reflexive Safe Reinforcement Learning for Contact-Rich Robotic Tasks | Mar 27, 2025 | Reinforcement Learning (RL)Safe Exploration | —Unverified | 0 |
| Safe RLHF-V: Safe Reinforcement Learning from Human Feedback in Multimodal Large Language Models | Mar 22, 2025 | MisinformationSafe Reinforcement Learning | —Unverified | 0 |
| Reachable Sets-based Trajectory Planning Combining Reinforcement Learning and iLQR | Mar 19, 2025 | reinforcement-learningReinforcement Learning | —Unverified | 0 |
| Hierarchical Reinforcement Learning for Safe Mapless Navigation with Congestion Estimation | Mar 15, 2025 | Hierarchical Reinforcement LearningMotion Planning | —Unverified | 0 |
| Enhance Exploration in Safe Reinforcement Learning with Contrastive Representation Learning | Mar 13, 2025 | Contrastive LearningRepresentation Learning | —Unverified | 0 |
| HASARD: A Benchmark for Vision-Based Safe Reinforcement Learning in Embodied Agents | Mar 11, 2025 | NavigateReinforcement Learning (RL) | —Unverified | 0 |
| Probabilistic Shielding for Safe Reinforcement Learning | Mar 9, 2025 | reinforcement-learningReinforcement Learning | —Unverified | 0 |
| SafeVLA: Towards Safety Alignment of Vision-Language-Action Model via Constrained Learning | Mar 5, 2025 | Safe Reinforcement LearningSafety Alignment | —Unverified | 0 |
| Safe Multi-Agent Navigation guided by Goal-Conditioned Safe Reinforcement Learning | Feb 25, 2025 | BenchmarkingReinforcement Learning (RL) | CodeCode Available | 0 |
| Risk-Averse Reinforcement Learning: An Optimal Transport Perspective on Temporal Difference Learning | Feb 22, 2025 | Decision Makingreinforcement-learning | CodeCode Available | 0 |
| Learning to explore when mistakes are not allowed | Feb 19, 2025 | Safe ExplorationSafe Reinforcement Learning | —Unverified | 0 |
| Safe Reinforcement Learning-based Control for Hydrogen Diesel Dual-Fuel Engines | Feb 13, 2025 | Model Predictive ControlReinforcement Learning (RL) | —Unverified | 0 |
| DIAL: Distribution-Informed Adaptive Learning of Multi-Task Constraints for Safety-Critical Systems | Jan 30, 2025 | Autonomous DrivingImitation Learning | —Unverified | 0 |
| Safe Reinforcement Learning for Real-World Engine Control | Jan 28, 2025 | reinforcement-learningReinforcement Learning | —Unverified | 0 |
| Safe Reinforcement Learning with Minimal Supervision | Jan 8, 2025 | reinforcement-learningReinforcement Learning | —Unverified | 0 |
| Safe Multiagent Coordination via Entropic Exploration | Dec 29, 2024 | reinforcement-learningReinforcement Learning | —Unverified | 0 |
| Constraint-Adaptive Policy Switching for Offline Safe Reinforcement Learning | Dec 25, 2024 | Decision MakingOffline RL | CodeCode Available | 1 |
| Offline Safe Reinforcement Learning Using Trajectory Classification | Dec 19, 2024 | Classificationreinforcement-learning | CodeCode Available | 0 |
| Physics-model-guided Worst-case Sampling for Safe Reinforcement Learning | Dec 17, 2024 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Tilted Quantile Gradient Updates for Quantile-Constrained Reinforcement Learning | Dec 17, 2024 | Formreinforcement-learning | CodeCode Available | 0 |
| Safe Reinforcement Learning using Finite-Horizon Gradient-based Estimation | Dec 15, 2024 | reinforcement-learningReinforcement Learning | —Unverified | 0 |
| From Text to Trajectory: Exploring Complex Constraint Representation and Decomposition in Safe Reinforcement Learning | Dec 12, 2024 | Reinforcement Learning (RL)Safe Reinforcement Learning | —Unverified | 0 |
| Latent Safety-Constrained Policy Approach for Safe Offline Reinforcement Learning | Dec 11, 2024 | Autonomous DrivingOffline RL | CodeCode Available | 0 |
| Marvel: Accelerating Safe Online Reinforcement Learning with Finetuned Offline Policy | Dec 5, 2024 | reinforcement-learningReinforcement Learning | CodeCode Available | 0 |
| RL2: Reinforce Large Language Model to Assist Safe Reinforcement Learning for Energy Management of Active Distribution Networks | Dec 2, 2024 | energy managementIn-Context Learning | —Unverified | 0 |
| Ensuring Safety in Target Pursuit Control: A CBF-Safe Reinforcement Learning Approach | Nov 26, 2024 | Collision Avoidancereinforcement-learning | —Unverified | 0 |
| Think Smart, Act SMARL! Analyzing Probabilistic Logic Shields for Multi-Agent Reinforcement Learning | Nov 7, 2024 | Multi-agent Reinforcement LearningQ-Learning | CodeCode Available | 0 |
| Progressive Safeguards for Safe and Model-Agnostic Reinforcement Learning | Oct 31, 2024 | Meta-LearningMinecraft | —Unverified | 0 |
| Augmented Lagrangian-Based Safe Reinforcement Learning Approach for Distribution System Volt/VAR Control | Oct 19, 2024 | Reinforcement Learning (RL)Safe Reinforcement Learning | —Unverified | 0 |
| Improved Regret Bound for Safe Reinforcement Learning via Tighter Cost Pessimism and Reward Optimism | Oct 14, 2024 | Safe Reinforcement Learning | —Unverified | 0 |
| Flipping-based Policy for Chance-Constrained Markov Decision Processes | Oct 9, 2024 | Reinforcement Learning (RL)Safe Reinforcement Learning | —Unverified | 0 |
| A Safety Modulator Actor-Critic Method in Model-Free Safe Reinforcement Learning and Application in UAV Hovering | Oct 9, 2024 | Reinforcement Learning (RL)Safe Reinforcement Learning | —Unverified | 0 |
| Realizable Continuous-Space Shields for Safe Reinforcement Learning | Oct 2, 2024 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| A Critical Review of Safe Reinforcement Learning Techniques in Smart Grid Applications | Sep 24, 2024 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Handling Long-Term Safety and Uncertainty in Safe Reinforcement Learning | Sep 18, 2024 | reinforcement-learningReinforcement Learning | CodeCode Available | 0 |
| Safety-Oriented Pruning and Interpretation of Reinforcement Learning Policies | Sep 16, 2024 | reinforcement-learningReinforcement Learning | —Unverified | 0 |
| Optimal Management of Grid-Interactive Efficient Buildings via Safe Reinforcement Learning | Sep 12, 2024 | Decision MakingManagement | —Unverified | 0 |
| Revisiting Safe Exploration in Safe Reinforcement learning | Sep 2, 2024 | Benchmarkingreinforcement-learning | —Unverified | 0 |