SOTAVerified

Safe Reinforcement Learning

Papers

Showing 251300 of 306 papers

TitleStatusHype
DIAL: Distribution-Informed Adaptive Learning of Multi-Task Constraints for Safety-Critical Systems0
Directed Policy Gradient for Safe Reinforcement Learning with Human Advice0
Distributionally Safe Reinforcement Learning under Model Uncertainty: A Single-Level Approach by Differentiable Convex Programming0
Do No Harm: A Counterfactual Approach to Safe Reinforcement Learning0
Doubly Robust Off-Policy Actor-Critic Algorithms for Reinforcement Learning0
Dynamic Model Predictive Shielding for Provably Safe Reinforcement Learning0
Efficient Off-Policy Safe Reinforcement Learning Using Trust Region Conditional Value at Risk0
Enforcing Hard Constraints with Soft Barriers: Safe Reinforcement Learning in Unknown Stochastic Environments0
Enhance Exploration in Safe Reinforcement Learning with Contrastive Representation Learning0
Ensuring Safety in Target Pursuit Control: A CBF-Safe Reinforcement Learning Approach0
Feasibility-Aware Pessimistic Estimation: Toward Long-Horizon Safety in Offline RL0
Feasible Policy Iteration for Safe Reinforcement Learning0
Flipping-based Policy for Chance-Constrained Markov Decision Processes0
FOSP: Fine-tuning Offline Safe Policy through World Models0
From Text to Trajectory: Exploring Complex Constraint Representation and Decomposition in Safe Reinforcement Learning0
Fully Bayesian Recurrent Neural Networks for Safe Reinforcement Learning0
GenSafe: A Generalizable Safety Enhancer for Safe Reinforcement Learning Algorithms Based on Reduced Order Markov Decision Process Model0
Gradient Shaping for Multi-Constraint Safe Reinforcement Learning0
Guided Online Distillation: Promoting Safe Reinforcement Learning by Offline Demonstration0
HASARD: A Benchmark for Vision-Based Safe Reinforcement Learning in Embodied Agents0
Hierarchical Reinforcement Learning for Safe Mapless Navigation with Congestion Estimation0
Lagrangian-based online safe reinforcement learning for state-constrained systems0
Implicit Safe Set Algorithm for Provably Safe Reinforcement Learning0
Improved Regret Bound for Safe Reinforcement Learning via Tighter Cost Pessimism and Reward Optimism0
Iterative Reachability Estimation for Safe Reinforcement Learning0
Joint Learning of Policy with Unknown Temporal Constraints for Safe Reinforcement Learning0
Learning a Shield from Catastrophic Action Effects: Never Repeat the Same Mistake0
Learning Control Barrier Functions and their application in Reinforcement Learning: A Survey0
Learning Natural Language Constraints for Safe Reinforcement Learning of Language Agents0
Learning to Be Cautious0
Learning to explore when mistakes are not allowed0
Learning to Recover for Safe Reinforcement Learning0
LMPriors: Pre-Trained Language Models as Task-Specific Priors0
Long and Short-Term Constraints Driven Safe Reinforcement Learning for Autonomous Driving0
Long-term Safe Reinforcement Learning with Binary Feedback0
Lyapunov-based uncertainty-aware safe reinforcement learning0
Make Safe Decisions in Power System: Safe Reinforcement Learning Based Pre-decision Making for Voltage Stability Emergency Control0
E^2CFD: Towards Effective and Efficient Cost Function Design for Safe Reinforcement Learning via Large Language Model0
Penalizing side effects using stepwise relative reachability0
Meta SAC-Lag: Towards Deployable Safe Reinforcement Learning via MetaGradient-based Hyperparameter Tuning0
Model-Based Safe Reinforcement Learning with Time-Varying State and Control Constraints: An Application to Intelligent Vehicles0
Modeling Risk in Reinforcement Learning: A Literature Mapping0
Uniformly Safe RL with Objective Suppression for Multi-Constraint Safety-Critical Applications0
Techno-Economic Modeling and Safe Operational Optimization of Multi-Network Constrained Integrated Community Energy Systems0
Multi-task Safe Reinforcement Learning for Navigating Intersections in Dense Traffic0
Networked Multiagent Safe Reinforcement Learning for Low-carbon Demand Management in Distribution Network0
Offline Guarded Safe Reinforcement Learning for Medical Treatment Optimization Strategies0
On Bellman's principle of optimality and Reinforcement learning for safety-constrained Markov decision process0
On Reward Structures of Markov Decision Processes0
On the Properties of Kullback-Leibler Divergence Between Multivariate Gaussian Distributions0
Show:102550
← PrevPage 6 of 7Next →

No leaderboard results yet.