SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 31413150 of 15113 papers

TitleStatusHype
IxDRL: A Novel Explainable Deep Reinforcement Learning Toolkit based on Analyses of InterestingnessCode0
REX: Rapid Exploration and eXploitation for AI Agents0
Continuous-Time Reinforcement Learning: New Design Algorithms with Theoretical Insights and Performance Guarantees0
Quarl: A Learning-Based Quantum Circuit Optimizer0
Natural Actor-Critic for Robust Reinforcement Learning with Function ApproximationCode1
An Alternative to Variance: Gini Deviation for Risk-averse Policy Gradient0
Basal-Bolus Advisor for Type 1 Diabetes (T1D) Patients Using Multi-Agent Reinforcement Learning (RL) Methodology0
Discovering User Types: Mapping User Traits by Task-Specific Behaviors in Reinforcement Learning0
Magnetic Field-Based Reward Shaping for Goal-Conditioned Reinforcement Learning0
POMDP inference and robust solution via deep reinforcement learning: An application to railway optimal maintenanceCode0
Show:102550
← PrevPage 315 of 1512Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified