SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 1200112050 of 15113 papers

TitleStatusHype
Run Time Assured Reinforcement Learning for Six Degree-of-Freedom Spacecraft Inspection0
Runtime Safety Assurance Using Reinforcement Learning0
Runtime Verification of Learning Properties for Reinforcement Learning Algorithms0
S2RL: Do We Really Need to Perceive All States in Deep Multi-Agent Reinforcement Learning?0
S2VG: Soft Stochastic Value Gradient method0
S4RL: Surprisingly Simple Self-Supervision for Offline Reinforcement Learning0
SAAC: Safe Reinforcement Learning as an Adversarial Game of Actor-Critics0
SAC-GLAM: Improving Online RL for LLM agents with Soft Actor-Critic and Hindsight Relabeling0
Random Policy Enables In-Context Reinforcement Learning within Trust Horizons0
Safe and Psychologically Pleasant Traffic Signal Control with Reinforcement Learning using Action Masking0
Safe and Robust Reinforcement Learning: Principles and Practice0
Safe Approximate Dynamic Programming Via Kernelized Lipschitz Estimation0
Safe Continual Domain Adaptation after Sim2Real Transfer of Reinforcement Learning Policies in Robotics0
Safe Control and Learning Using the Generalized Action Governor0
Safe Controller for Output Feedback Linear Systems using Model-Based Reinforcement Learning0
Debiased Off-Policy Evaluation for Recommendation Systems0
Safe Coupled Deep Q-Learning for Recommendation Systems0
Safety-Critical Learning of Robot Control with Temporal Logic Specifications0
Safe Decision-making for Lane-change of Autonomous Vehicles via Human Demonstration-aided Reinforcement Learning0
Safe deep reinforcement learning-based constrained optimal control scheme for active distribution networks0
Safe Deep Reinforcement Learning by Verifying Task-Level Properties0
Safe Distributional Reinforcement Learning0
Safe Domain Randomization via Uncertainty-Aware Out-of-Distribution Detection and Policy Adaptation0
Safe Evaluation For Offline Learning: Are We Ready To Deploy?0
Safe Exploration by Solving Early Terminated MDP0
Safe Exploration for Identifying Linear Systems via Robust Optimization0
Safe Exploration in Linear Equality Constraint0
Safe Exploration in Model-based Reinforcement Learning using Control Barrier Functions0
Safe Exploration in Reinforcement Learning: Training Backup Control Barrier Functions with Zero Training Time Safety Violations0
Safe Exploration in Reinforcement Learning: A Generalized Formulation and Algorithms0
A predictive safety filter for learning-based control of constrained nonlinear dynamical systems0
Safe Exploration of State and Action Spaces in Reinforcement Learning0
Safeguarded Progress in Reinforcement Learning: Safe Bayesian Exploration for Control Policy Synthesis0
Safe Inverse Reinforcement Learning via Control Barrier Function0
Safe Learning and Optimization Techniques: Towards a Survey of the State of the Art0
Safe Model-based Off-policy Reinforcement Learning for Eco-Driving in Connected and Automated Hybrid Electric Vehicles0
Safe and Efficient Reinforcement Learning Using Disturbance-Observer-Based Control Barrier Functions0
Safe multi-agent deep reinforcement learning for joint bidding and maintenance scheduling of generation units0
Safe, Multi-Agent, Reinforcement Learning for Autonomous Driving0
Safe Multi-Agent Reinforcement Learning via Shielding0
Safe Planning and Policy Optimization via World Model Learning0
Safe Policies for Reinforcement Learning via Primal-Dual Methods0
Safe Policy Improvement for POMDPs via Finite-State Controllers0
Safe Policy Improvement in Constrained Markov Decision Processes0
Safe Policy Search for Lifelong Reinforcement Learning with Sublinear Regret0
Safe RAN control: A Symbolic Reinforcement Learning Approach0
SAFER: Data-Efficient and Safe Reinforcement Learning Through Skill Acquisition0
SAFER: Data-Efficient and Safe Reinforcement Learning via Skill Acquisition0
Safer Deep RL with Shallow MCTS: A Case Study in Pommerman0
SAFER: Safe Collision Avoidance using Focused and Efficient Trajectory Search with Reinforcement Learning0
Show:102550
← PrevPage 241 of 303Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified