SOTAVerified

Safe Reinforcement Learning

Papers

Showing 126150 of 306 papers

TitleStatusHype
Modeling Risk in Reinforcement Learning: A Literature Mapping0
CRPO: A New Approach for Safe Reinforcement Learning with Convergence Guarantee0
Provable Traffic Rule Compliance in Safe Reinforcement Learning on the Open Sea0
Provably Efficient Safe Exploration via Primal-Dual Policy Optimization0
Feasibility-Aware Pessimistic Estimation: Toward Long-Horizon Safety in Offline RL0
Case Study: Verifying the Safety of an Autonomous Racing Car with a Neural Network Controller0
A Primal Approach to Constrained Policy Optimization: Global Optimality and Finite-Time Analysis0
Ensuring Safety in Target Pursuit Control: A CBF-Safe Reinforcement Learning Approach0
Cancellation-Free Regret Bounds for Lagrangian Approaches in Constrained Markov Decision Processes0
Probabilistic Shielding for Safe Reinforcement Learning0
Enhance Exploration in Safe Reinforcement Learning with Contrastive Representation Learning0
Bridging the gap between Learning-to-plan, Motion Primitives and Safe Reinforcement Learning0
Enforcing Hard Constraints with Soft Barriers: Safe Reinforcement Learning in Unknown Stochastic Environments0
Optimal Transport-Assisted Risk-Sensitive Q-Learning0
Optimal Mixture Weights for Off-Policy Evaluation with Multiple Behavior Policies0
Bresa: Bio-inspired Reflexive Safe Reinforcement Learning for Contact-Rich Robotic Tasks0
Adaptive Aggregation for Safety-Critical Control0
A Critical Review of Safe Reinforcement Learning Techniques in Smart Grid Applications0
Progressive Safeguards for Safe and Model-Agnostic Reinforcement Learning0
Optimal Management of Grid-Interactive Efficient Buildings via Safe Reinforcement Learning0
Parenting: Safe Reinforcement Learning from Human Input0
Penalized Proximal Policy Optimization for Safe Reinforcement Learning0
Physics-model-guided Worst-case Sampling for Safe Reinforcement Learning0
On the Properties of Kullback-Leibler Divergence Between Multivariate Gaussian Distributions0
Efficient Off-Policy Safe Reinforcement Learning Using Trust Region Conditional Value at Risk0
Show:102550
← PrevPage 6 of 13Next →

No leaderboard results yet.