SOTAVerified

Safe Reinforcement Learning

Papers

Showing 251300 of 306 papers

TitleStatusHype
Safe Reinforcement Learning of Dynamic High-Dimensional Robotic Tasks: Navigation, Manipulation, Interaction0
Safe Reinforcement Learning on Autonomous Vehicles0
Safe Reinforcement Learning on the Constraint Manifold: Theory and Applications0
Safe Reinforcement Learning through Meta-learned Instincts0
Safe Reinforcement Learning using Data-Driven Predictive Control0
Safe Reinforcement Learning using Finite-Horizon Gradient-based Estimation0
Safe Reinforcement Learning Using Robust Action Governor0
Safe Reinforcement Learning via Confidence-Based Filters0
Safe Reinforcement Learning via Projection on a Safe Set: How to Achieve Optimality?0
Safe Reinforcement Learning via Shielding under Partial Observability0
Safe Reinforcement Learning with Probabilistic Control Barrier Functions for Ramp Merging0
Safe Reinforcement Learning with Instantaneous Constraints: The Role of Aggressive Exploration0
Safe Reinforcement Learning with Chance-constrained Model Predictive Control0
Safe Reinforcement Learning with Contrastive Risk Prediction0
Safe Reinforcement Learning with Dead-Ends Avoidance and Recovery0
ROSARL: Reward-Only Safe Reinforcement LearningCode0
Imitate the Good and Avoid the Bad: An Incremental Approach to Safe Reinforcement LearningCode0
Trial without Error: Towards Safe Reinforcement Learning via Human InterventionCode0
Risk-Averse Reinforcement Learning: An Optimal Transport Perspective on Temporal Difference LearningCode0
Real-world validation of safe reinforcement learning, model predictive control and decision tree-based home energy management systemsCode0
DOPE: Doubly Optimistic and Pessimistic Exploration for Safe Reinforcement LearningCode0
Safe reinforcement learning in uncertain contextsCode0
Safe Reinforcement Learning of Control-Affine Systems with Vertex NetworksCode0
Reinforcement Learning from Human Feedback with High-Confidence Safety ConstraintsCode0
Offline Safe Reinforcement Learning Using Trajectory ClassificationCode0
SafeLight: A Reinforcement Learning Method toward Collision-free Traffic Signal ControlCode0
Marvel: Accelerating Safe Online Reinforcement Learning with Finetuned Offline PolicyCode0
Log Barriers for Safe Black-box Optimization with Application to Safe Reinforcement LearningCode0
Leveraging Approximate Model-based Shielding for Probabilistic Safety Guarantees in Continuous EnvironmentsCode0
Specialized Deep Residual Policy Safe Reinforcement Learning-Based Controller for Complex and Continuous State-Action SpacesCode0
Convergent Policy Optimization for Safe Reinforcement LearningCode0
Constrained Policy OptimizationCode0
Safe Multi-Agent Navigation guided by Goal-Conditioned Safe Reinforcement LearningCode0
How to Certify Machine Learning Based Safety-critical Systems? A Systematic Literature ReviewCode0
Safe Reinforcement Learning Using Black-Box Reachability AnalysisCode0
CUP: A Conservative Update Policy Algorithm for Safe Reinforcement LearningCode0
Probabilistic Constrained Reinforcement Learning with Formal InterpretabilityCode0
Better Safe than Sorry: Evidence Accumulation Allows for Safe Reinforcement LearningCode0
Safe Reinforcement Learning with Scene Decomposition for Navigating Complex Urban EnvironmentsCode0
Latent Safety-Constrained Policy Approach for Safe Offline Reinforcement LearningCode0
Explainable and Safe Reinforcement Learning for Autonomous Air MobilityCode0
Handling Long-Term Safety and Uncertainty in Safe Reinforcement LearningCode0
Safe Reinforcement Learning with Nonlinear Dynamics via Model Predictive ShieldingCode0
Safe Reinforcement Learning via Probabilistic Logic ShieldsCode0
End-to-End Safe Reinforcement Learning through Barrier Functions for Safety-Critical Continuous Control TasksCode0
Safe Reinforcement Learning via ShieldingCode0
From Representational Harms to Quality-of-Service Harms: A Case Study on Llama 2 Safety SafeguardsCode0
A Lyapunov-based Approach to Safe Reinforcement LearningCode0
Approximate Model-Based Shielding for Safe Reinforcement LearningCode0
Think Smart, Act SMARL! Analyzing Probabilistic Logic Shields for Multi-Agent Reinforcement LearningCode0
Show:102550
← PrevPage 6 of 7Next →

No leaderboard results yet.