SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 15611570 of 15113 papers

TitleStatusHype
Development and Validation of Heparin Dosing Policies Using an Offline Reinforcement Learning Algorithm0
Stage-Wise Reward Shaping for Acrobatic Robots: A Constrained Multi-Objective Reinforcement Learning ApproachCode2
Energy Saving in 6G O-RAN Using DQN-based xApp0
Physics Enhanced Residual Policy Learning (PERPL) for safety cruising in mixed traffic platooning under actuator and communication delay0
Intelligent Routing Algorithm over SDN: Reusable Reinforcement Learning Approach0
A novel agent with formal goal-reaching guarantees: an experimental study with a mobile robot0
CANDERE-COACH: Reinforcement Learning from Noisy Feedback0
A Distribution-Aware Flow-Matching for Generating Unstructured Data for Few-Shot Reinforcement Learning0
OMG-RL:Offline Model-based Guided Reward Learning for Heparin Treatment0
Scalable Multi-agent Reinforcement Learning for Factory-wide Dynamic Scheduling0
Show:102550
← PrevPage 157 of 1512Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified