SOTAVerified

Safe Reinforcement Learning

Papers

Showing 150 of 306 papers

TitleStatusHype
Reinforcement Learning from Human Feedback with High-Confidence Safety ConstraintsCode0
Provably Safe Reinforcement Learning from Analytic Gradients0
A Provable Approach for End-to-End Safe Reinforcement Learning0
Offline Guarded Safe Reinforcement Learning for Medical Treatment Optimization Strategies0
Risk-Aware Safe Reinforcement Learning for Control of Stochastic Linear Systems0
Feasibility-Aware Pessimistic Estimation: Toward Long-Horizon Safety in Offline RL0
Skill-based Safe Reinforcement Learning with Risk Planning0
Designing Control Barrier Function via Probabilistic Enumeration for Safe Reinforcement Learning Navigation0
Anytime Safe Reinforcement Learning0
TraCeS: Trajectory Based Credit Assignment From Sparse Safety Feedback0
Learning Natural Language Constraints for Safe Reinforcement Learning of Language Agents0
Safety Modulation: Enhancing Safety in Reinforcement Learning through Cost-Modulated Rewards0
Bresa: Bio-inspired Reflexive Safe Reinforcement Learning for Contact-Rich Robotic Tasks0
Safe RLHF-V: Safe Reinforcement Learning from Human Feedback in Multimodal Large Language Models0
Reachable Sets-based Trajectory Planning Combining Reinforcement Learning and iLQR0
Hierarchical Reinforcement Learning for Safe Mapless Navigation with Congestion Estimation0
Enhance Exploration in Safe Reinforcement Learning with Contrastive Representation Learning0
HASARD: A Benchmark for Vision-Based Safe Reinforcement Learning in Embodied Agents0
Probabilistic Shielding for Safe Reinforcement Learning0
SafeVLA: Towards Safety Alignment of Vision-Language-Action Model via Constrained Learning0
Safe Multi-Agent Navigation guided by Goal-Conditioned Safe Reinforcement LearningCode0
Risk-Averse Reinforcement Learning: An Optimal Transport Perspective on Temporal Difference LearningCode0
Learning to explore when mistakes are not allowed0
Safe Reinforcement Learning-based Control for Hydrogen Diesel Dual-Fuel Engines0
DIAL: Distribution-Informed Adaptive Learning of Multi-Task Constraints for Safety-Critical Systems0
Safe Reinforcement Learning for Real-World Engine Control0
Safe Reinforcement Learning with Minimal Supervision0
Safe Multiagent Coordination via Entropic Exploration0
Constraint-Adaptive Policy Switching for Offline Safe Reinforcement LearningCode1
Offline Safe Reinforcement Learning Using Trajectory ClassificationCode0
Physics-model-guided Worst-case Sampling for Safe Reinforcement Learning0
Tilted Quantile Gradient Updates for Quantile-Constrained Reinforcement LearningCode0
Safe Reinforcement Learning using Finite-Horizon Gradient-based Estimation0
From Text to Trajectory: Exploring Complex Constraint Representation and Decomposition in Safe Reinforcement Learning0
Latent Safety-Constrained Policy Approach for Safe Offline Reinforcement LearningCode0
Marvel: Accelerating Safe Online Reinforcement Learning with Finetuned Offline PolicyCode0
RL2: Reinforce Large Language Model to Assist Safe Reinforcement Learning for Energy Management of Active Distribution Networks0
Ensuring Safety in Target Pursuit Control: A CBF-Safe Reinforcement Learning Approach0
Think Smart, Act SMARL! Analyzing Probabilistic Logic Shields for Multi-Agent Reinforcement LearningCode0
Progressive Safeguards for Safe and Model-Agnostic Reinforcement Learning0
Augmented Lagrangian-Based Safe Reinforcement Learning Approach for Distribution System Volt/VAR Control0
Improved Regret Bound for Safe Reinforcement Learning via Tighter Cost Pessimism and Reward Optimism0
Flipping-based Policy for Chance-Constrained Markov Decision Processes0
A Safety Modulator Actor-Critic Method in Model-Free Safe Reinforcement Learning and Application in UAV Hovering0
Realizable Continuous-Space Shields for Safe Reinforcement Learning0
A Critical Review of Safe Reinforcement Learning Techniques in Smart Grid Applications0
Handling Long-Term Safety and Uncertainty in Safe Reinforcement LearningCode0
Safety-Oriented Pruning and Interpretation of Reinforcement Learning Policies0
Optimal Management of Grid-Interactive Efficient Buildings via Safe Reinforcement Learning0
Revisiting Safe Exploration in Safe Reinforcement learning0
Show:102550
← PrevPage 1 of 7Next →

No leaderboard results yet.