SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 49264950 of 15113 papers

TitleStatusHype
Reinforcement Learning for Semantic Segmentation in Indoor Scenes0
Funnel-based Reward Shaping for Signal Temporal Logic Tasks in Reinforcement Learning0
Reinforcement Learning for Slate-based Recommender Systems: A Tractable Decomposition and Practical Methodology0
Reinforcement Learning for Sociohydrology0
Reinforcement Learning for Solving the Pricing Problem in Column Generation: Applications to Vehicle Routing0
Reinforcement Learning for Sparse-Reward Object-Interaction Tasks in a First-person Simulated 3D Environment0
Reinforcement Learning for Standards Design0
Reinforcement Learning for Stock Transactions0
Reinforcement Learning for Strategic Recommendations0
Reinforcement learning for suppression of collective activity in oscillatory ensembles0
Reinforcement Learning For Survival, A Clinically Motivated Method For Critically Ill Patients0
Reinforcement Learning for Systematic FX Trading0
Reinforcement Learning for Task Specifications with Action-Constraints0
Reinforcement Learning for Test Case Prioritization0
Reinforcement Learning for the Beginning of Starcraft II Game0
Reinforcement learning for the privacy preservation and manipulation of eye tracking data0
Reinforcement Learning for Thermostatically Controlled Loads Control using Modelica and Python0
Reinforcement Learning for the Soccer Dribbling Task0
Reinforcement Learning for the Unit Commitment Problem0
Reinforcement Learning for Traffic Signal Control: Comparison with Commercial Systems0
Reinforcement learning for traffic signal control in hybrid action space0
Reinforcement Learning for Transition-Based Mention Detection0
Reinforcement Learning for UA V Attitude Control0
Reinforcement Learning for UAV Autonomous Navigation, Mapping and Target Detection0
Reinforcement Learning for UAV control with Policy and Reward Shaping0
Show:102550
← PrevPage 198 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified