| Reinforcement Learning from Human Feedback with High-Confidence Safety Constraints | Jun 9, 2025 | Safe Reinforcement Learning | CodeCode Available | 0 |
| Provably Safe Reinforcement Learning from Analytic Gradients | Jun 2, 2025 | reinforcement-learningReinforcement Learning | —Unverified | 0 |
| A Provable Approach for End-to-End Safe Reinforcement Learning | May 28, 2025 | Gaussian ProcessesReinforcement Learning (RL) | —Unverified | 0 |
| Offline Guarded Safe Reinforcement Learning for Medical Treatment Optimization Strategies | May 22, 2025 | Offline RLQ-Learning | —Unverified | 0 |
| Risk-Aware Safe Reinforcement Learning for Control of Stochastic Linear Systems | May 14, 2025 | Reinforcement Learning (RL)Safe Reinforcement Learning | —Unverified | 0 |
| Feasibility-Aware Pessimistic Estimation: Toward Long-Horizon Safety in Offline RL | May 13, 2025 | Offline RLSafe Reinforcement Learning | —Unverified | 0 |
| Skill-based Safe Reinforcement Learning with Risk Planning | May 2, 2025 | reinforcement-learningReinforcement Learning | —Unverified | 0 |
| Designing Control Barrier Function via Probabilistic Enumeration for Safe Reinforcement Learning Navigation | Apr 30, 2025 | Autonomous NavigationSafe Reinforcement Learning | —Unverified | 0 |
| Anytime Safe Reinforcement Learning | Apr 23, 2025 | reinforcement-learningReinforcement Learning | —Unverified | 0 |
| TraCeS: Trajectory Based Credit Assignment From Sparse Safety Feedback | Apr 17, 2025 | continuous-controlContinuous Control | —Unverified | 0 |
| Learning Natural Language Constraints for Safe Reinforcement Learning of Language Agents | Apr 4, 2025 | Safe Reinforcement Learning | —Unverified | 0 |
| Safety Modulation: Enhancing Safety in Reinforcement Learning through Cost-Modulated Rewards | Apr 3, 2025 | Safe Reinforcement Learning | —Unverified | 0 |
| Bresa: Bio-inspired Reflexive Safe Reinforcement Learning for Contact-Rich Robotic Tasks | Mar 27, 2025 | Reinforcement Learning (RL)Safe Exploration | —Unverified | 0 |
| Safe RLHF-V: Safe Reinforcement Learning from Human Feedback in Multimodal Large Language Models | Mar 22, 2025 | MisinformationSafe Reinforcement Learning | —Unverified | 0 |
| Reachable Sets-based Trajectory Planning Combining Reinforcement Learning and iLQR | Mar 19, 2025 | reinforcement-learningReinforcement Learning | —Unverified | 0 |
| Hierarchical Reinforcement Learning for Safe Mapless Navigation with Congestion Estimation | Mar 15, 2025 | Hierarchical Reinforcement LearningMotion Planning | —Unverified | 0 |
| Enhance Exploration in Safe Reinforcement Learning with Contrastive Representation Learning | Mar 13, 2025 | Contrastive LearningRepresentation Learning | —Unverified | 0 |
| HASARD: A Benchmark for Vision-Based Safe Reinforcement Learning in Embodied Agents | Mar 11, 2025 | NavigateReinforcement Learning (RL) | —Unverified | 0 |
| Probabilistic Shielding for Safe Reinforcement Learning | Mar 9, 2025 | reinforcement-learningReinforcement Learning | —Unverified | 0 |
| SafeVLA: Towards Safety Alignment of Vision-Language-Action Model via Constrained Learning | Mar 5, 2025 | Safe Reinforcement LearningSafety Alignment | —Unverified | 0 |
| Safe Multi-Agent Navigation guided by Goal-Conditioned Safe Reinforcement Learning | Feb 25, 2025 | BenchmarkingReinforcement Learning (RL) | CodeCode Available | 0 |
| Risk-Averse Reinforcement Learning: An Optimal Transport Perspective on Temporal Difference Learning | Feb 22, 2025 | Decision Makingreinforcement-learning | CodeCode Available | 0 |
| Learning to explore when mistakes are not allowed | Feb 19, 2025 | Safe ExplorationSafe Reinforcement Learning | —Unverified | 0 |
| Safe Reinforcement Learning-based Control for Hydrogen Diesel Dual-Fuel Engines | Feb 13, 2025 | Model Predictive ControlReinforcement Learning (RL) | —Unverified | 0 |
| DIAL: Distribution-Informed Adaptive Learning of Multi-Task Constraints for Safety-Critical Systems | Jan 30, 2025 | Autonomous DrivingImitation Learning | —Unverified | 0 |