SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 94519475 of 15113 papers

TitleStatusHype
Recent Advances of Deep Robotic Affordance Learning: A Reinforcement Learning Perspective0
Recent Progress in Energy Management of Connected Hybrid Electric Vehicles Using Reinforcement Learning0
Reinforcement Learning as a Robotics-Inspired Framework for Insect Navigation: From Spatial Representations to Neural Implementation0
Recognition Method of Important Words in Korean Text based on Reinforcement Learning0
Recommendation Fairness: From Static to Dynamic0
Recommendations with Negative Feedback via Pairwise Deep Reinforcement Learning0
Recommendation System-based Upper Confidence Bound for Online Advertising0
Recommending the optimal policy by learning to act from temporal data0
Re-conceptualising the Language Game Paradigm in the Framework of Multi-Agent Reinforcement Learning0
RECONNAISSANCE FOR REINFORCEMENT LEARNING WITH SAFETY CONSTRAINTS0
Reconstruct and Represent Video Contents for Captioning via Reinforcement Learning0
ReCoRe: Regularized Contrastive Representation Learning of World Model0
Recruitment-imitation Mechanism for Evolutionary Reinforcement Learning0
Rectifying Reinforcement Learning for Reward Matching0
Recurrent Attentional Reinforcement Learning for Multi-label Image Recognition0
Recurrent Attention Models for Depth-Based Person Identification0
Recurrent Control Nets for Deep Reinforcement Learning0
Recurrent Reinforcement Learning: A Hybrid Approach0
Recurrent Value Functions0
Recurrent World Models Facilitate Policy Evolution0
Recursive Constraints to Prevent Instability in Constrained Reinforcement Learning0
Recursive Least Squares Advantage Actor-Critic Algorithms0
Recursive Reasoning Graph for Multi-Agent Reinforcement Learning0
Recursive Reinforcement Learning0
Recursive Sparse Pseudo-input Gaussian Process SARSA0
Show:102550
← PrevPage 379 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified