SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 55765600 of 15113 papers

TitleStatusHype
Safe Control and Learning Using the Generalized Action Governor0
Safe Controller for Output Feedback Linear Systems using Model-Based Reinforcement Learning0
Debiased Off-Policy Evaluation for Recommendation Systems0
Safe Coupled Deep Q-Learning for Recommendation Systems0
Safety-Critical Learning of Robot Control with Temporal Logic Specifications0
Safe Decision-making for Lane-change of Autonomous Vehicles via Human Demonstration-aided Reinforcement Learning0
Safe deep reinforcement learning-based constrained optimal control scheme for active distribution networks0
Safe Deep Reinforcement Learning by Verifying Task-Level Properties0
Safe Distributional Reinforcement Learning0
Safe Domain Randomization via Uncertainty-Aware Out-of-Distribution Detection and Policy Adaptation0
Safe Evaluation For Offline Learning: Are We Ready To Deploy?0
Safe Exploration by Solving Early Terminated MDP0
Safe Exploration for Identifying Linear Systems via Robust Optimization0
Safe Exploration in Linear Equality Constraint0
Safe Exploration in Model-based Reinforcement Learning using Control Barrier Functions0
Safe Exploration in Reinforcement Learning: Training Backup Control Barrier Functions with Zero Training Time Safety Violations0
Safe Exploration in Reinforcement Learning: A Generalized Formulation and Algorithms0
A predictive safety filter for learning-based control of constrained nonlinear dynamical systems0
Safe Exploration of State and Action Spaces in Reinforcement Learning0
Safeguarded Progress in Reinforcement Learning: Safe Bayesian Exploration for Control Policy Synthesis0
Safe Inverse Reinforcement Learning via Control Barrier Function0
Safe Learning and Optimization Techniques: Towards a Survey of the State of the Art0
Safe Model-based Off-policy Reinforcement Learning for Eco-Driving in Connected and Automated Hybrid Electric Vehicles0
Safe and Efficient Reinforcement Learning Using Disturbance-Observer-Based Control Barrier Functions0
Safe multi-agent deep reinforcement learning for joint bidding and maintenance scheduling of generation units0
Show:102550
← PrevPage 224 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified