| Safe RLHF-V: Safe Reinforcement Learning from Human Feedback in Multimodal Large Language Models | Mar 22, 2025 | MisinformationSafe Reinforcement Learning | —Unverified | 0 |
| Reachable Sets-based Trajectory Planning Combining Reinforcement Learning and iLQR | Mar 19, 2025 | reinforcement-learningReinforcement Learning | —Unverified | 0 |
| Hierarchical Reinforcement Learning for Safe Mapless Navigation with Congestion Estimation | Mar 15, 2025 | Hierarchical Reinforcement LearningMotion Planning | —Unverified | 0 |
| Enhance Exploration in Safe Reinforcement Learning with Contrastive Representation Learning | Mar 13, 2025 | Contrastive LearningRepresentation Learning | —Unverified | 0 |
| HASARD: A Benchmark for Vision-Based Safe Reinforcement Learning in Embodied Agents | Mar 11, 2025 | NavigateReinforcement Learning (RL) | —Unverified | 0 |
| Probabilistic Shielding for Safe Reinforcement Learning | Mar 9, 2025 | reinforcement-learningReinforcement Learning | —Unverified | 0 |
| SafeVLA: Towards Safety Alignment of Vision-Language-Action Model via Constrained Learning | Mar 5, 2025 | Safe Reinforcement LearningSafety Alignment | —Unverified | 0 |
| Safe Multi-Agent Navigation guided by Goal-Conditioned Safe Reinforcement Learning | Feb 25, 2025 | BenchmarkingReinforcement Learning (RL) | CodeCode Available | 0 |
| Risk-Averse Reinforcement Learning: An Optimal Transport Perspective on Temporal Difference Learning | Feb 22, 2025 | Decision Makingreinforcement-learning | CodeCode Available | 0 |
| Learning to explore when mistakes are not allowed | Feb 19, 2025 | Safe ExplorationSafe Reinforcement Learning | —Unverified | 0 |