SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 55015525 of 15113 papers

TitleStatusHype
Robust Log-Optimal Strategy with Reinforcement Learning0
Robust Longitudinal Control for Vehicular Autonomous Platoons Using Deep Reinforcement Learning0
Robust Meta-Reinforcement Learning with Curriculum-Based Task Sampling0
Robust Model-based Reinforcement Learning for Autonomous Greenhouse Control0
Robust Model Predictive Shielding for Safe Reinforcement Learning with Stochastic Dynamics0
Robust Multi-Agent Reinforcement Learning Driven by Correlated Equilibrium0
Robust Multi-Agent Reinforcement Learning with Model Uncertainty0
Robust Multimodal Image Registration Using Deep Recurrent Reinforcement Learning0
Robust N-1 secure HV Grid Flexibility Estimation for TSO-DSO coordinated Congestion Management with Deep Reinforcement Learning0
Robustness and risk management via distributional dynamic programming0
Robustness and Visual Explanation for Black Box Image, Video, and ECG Signal Classification with Reinforcement Learning0
Robust Offline Reinforcement Learning -- Certify the Confidence Interval0
Robust Offline Reinforcement Learning for Non-Markovian Decision Processes0
Robust Offline Reinforcement Learning from Low-Quality Data0
Robust Offline Reinforcement Learning with Gradient Penalty and Constraint Relaxation0
Robust off-policy Reinforcement Learning via Soft Constrained Adversary0
Robust Opponent Modeling via Adversarial Ensemble Reinforcement Learning in Asymmetric Imperfect-Information Games0
Robust Path Selection in Software-defined WANs using Deep Reinforcement Learning0
Robust Policy Learning over Multiple Uncertainty Sets0
Robust Policy Learning via Offline Skill Diffusion0
Robust Policy Switching for Antifragile Reinforcement Learning for UAV Deconfliction in Adversarial Environments0
Robust Predictable Control0
Robust Preference Learning for Storytelling via Contrastive Reinforcement Learning0
Robust Quadruped Jumping via Deep Reinforcement Learning0
Robust Recovery Controller for a Quadrupedal Robot using Deep Reinforcement Learning0
Show:102550
← PrevPage 221 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified