SOTAVerified

Safe Reinforcement Learning

Papers

Showing 101150 of 306 papers

TitleStatusHype
Joint Learning of Policy with Unknown Temporal Constraints for Safe Reinforcement Learning0
Constraint-Conditioned Policy Optimization for Versatile Safe Reinforcement Learning0
From Text to Trajectory: Exploring Complex Constraint Representation and Decomposition in Safe Reinforcement Learning0
Computationally Efficient Safe Reinforcement Learning for Power Systems0
Learning Control Barrier Functions and their application in Reinforcement Learning: A Survey0
Learning Natural Language Constraints for Safe Reinforcement Learning of Language Agents0
Learning to Be Cautious0
Learning to explore when mistakes are not allowed0
Learning to Recover for Safe Reinforcement Learning0
A Safety Modulator Actor-Critic Method in Model-Free Safe Reinforcement Learning and Application in UAV Hovering0
FOSP: Fine-tuning Offline Safe Policy through World Models0
Coordinated Frequency Control through Safe Reinforcement Learning0
A Provable Approach for End-to-End Safe Reinforcement Learning0
Long and Short-Term Constraints Driven Safe Reinforcement Learning for Autonomous Driving0
Long-term Safe Reinforcement Learning with Binary Feedback0
Lyapunov-based uncertainty-aware safe reinforcement learning0
Make Safe Decisions in Power System: Safe Reinforcement Learning Based Pre-decision Making for Voltage Stability Emergency Control0
Data Generation Method for Learning a Low-dimensional Safe Region in Safe Reinforcement Learning0
Adaptive Primal-Dual Method for Safe Reinforcement Learning0
Penalizing side effects using stepwise relative reachability0
Flipping-based Policy for Chance-Constrained Markov Decision Processes0
Meta SAC-Lag: Towards Deployable Safe Reinforcement Learning via MetaGradient-based Hyperparameter Tuning0
Feasible Policy Iteration for Safe Reinforcement Learning0
CIMRL: Combining IMitation and Reinforcement Learning for Safe Autonomous Driving0
Model-Based Safe Reinforcement Learning with Time-Varying State and Control Constraints: An Application to Intelligent Vehicles0
Modeling Risk in Reinforcement Learning: A Literature Mapping0
CRPO: A New Approach for Safe Reinforcement Learning with Convergence Guarantee0
Provable Traffic Rule Compliance in Safe Reinforcement Learning on the Open Sea0
Provably Efficient Safe Exploration via Primal-Dual Policy Optimization0
Feasibility-Aware Pessimistic Estimation: Toward Long-Horizon Safety in Offline RL0
Case Study: Verifying the Safety of an Autonomous Racing Car with a Neural Network Controller0
A Primal Approach to Constrained Policy Optimization: Global Optimality and Finite-Time Analysis0
Ensuring Safety in Target Pursuit Control: A CBF-Safe Reinforcement Learning Approach0
Cancellation-Free Regret Bounds for Lagrangian Approaches in Constrained Markov Decision Processes0
Probabilistic Shielding for Safe Reinforcement Learning0
Enhance Exploration in Safe Reinforcement Learning with Contrastive Representation Learning0
Bridging the gap between Learning-to-plan, Motion Primitives and Safe Reinforcement Learning0
Enforcing Hard Constraints with Soft Barriers: Safe Reinforcement Learning in Unknown Stochastic Environments0
Optimal Transport-Assisted Risk-Sensitive Q-Learning0
Optimal Mixture Weights for Off-Policy Evaluation with Multiple Behavior Policies0
Bresa: Bio-inspired Reflexive Safe Reinforcement Learning for Contact-Rich Robotic Tasks0
Adaptive Aggregation for Safety-Critical Control0
A Critical Review of Safe Reinforcement Learning Techniques in Smart Grid Applications0
Progressive Safeguards for Safe and Model-Agnostic Reinforcement Learning0
Optimal Management of Grid-Interactive Efficient Buildings via Safe Reinforcement Learning0
Parenting: Safe Reinforcement Learning from Human Input0
Penalized Proximal Policy Optimization for Safe Reinforcement Learning0
Physics-model-guided Worst-case Sampling for Safe Reinforcement Learning0
On the Properties of Kullback-Leibler Divergence Between Multivariate Gaussian Distributions0
Efficient Off-Policy Safe Reinforcement Learning Using Trust Region Conditional Value at Risk0
Show:102550
← PrevPage 3 of 7Next →

No leaderboard results yet.