SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 56515675 of 15113 papers

TitleStatusHype
Safety aware model-based reinforcement learning for optimal control of a class of output-feedback nonlinear systems0
Safety-Aware Multi-Agent Apprenticeship Learning0
Safe Autonomous Racing via Approximate Reachability on Ego-vision0
Safety-Aware Reinforcement Learning for Electric Vehicle Charging Station Management in Distribution Network0
Safety-Aware Reinforcement Learning for Control via Risk-Sensitive Action-Value Iteration and Quantile Regression0
Safety Aware Reinforcement Learning (SARL)0
Safety-Aware Task Composition for Discrete and Continuous Reinforcement Learning0
Safety Correction from Baseline: Towards the Risk-aware Policy in Robotics via Dual-agent Reinforcement Learning0
Safety-Enhanced Self-Learning for Optimal Power Converter Control0
Safety Enhancement for Deep Reinforcement Learning in Autonomous Separation Assurance0
Safety Filtering for Reinforcement Learning-based Adaptive Cruise Control0
Safety-guaranteed Reinforcement Learning based on Multi-class Support Vector Machine0
Safety-Guided Deep Reinforcement Learning via Online Gaussian Process Estimation0
Safety-Oriented Pruning and Interpretation of Reinforcement Learning Policies0
Safety through Permissibility: Shield Construction for Fast and Safe Reinforcement Learning0
Black-Box Safety Validation of Autonomous Systems: A Multi-Fidelity Reinforcement Learning Approach0
Safety Verification of Model Based Reinforcement Learning Controllers0
SaFormer: A Conditional Sequence Modeling Approach to Offline Safe Reinforcement Learning0
SA-IGA: A Multiagent Reinforcement Learning Method Towards Socially Optimal Outcomes0
SAINT-ACC: Safety-Aware Intelligent Adaptive Cruise Control for Autonomous Vehicles Using Deep Reinforcement Learning0
Saliency-based Sequential Image Attention with Multiset Prediction0
SaLinA: Sequential Learning of Agents0
SAMG: State-Action-Aware Offline-to-Online Reinforcement Learning with Offline Model Guidance0
Sample and Oracle Efficient Reinforcement Learning for MDPs with Linearly-Realizable Value Functions0
Sample-based Distributional Policy Gradient0
Show:102550
← PrevPage 227 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified