SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 19511960 of 15113 papers

TitleStatusHype
Unified Models of Human Behavioral Agents in Bandits, Contextual Bandits and RLCode1
ALLSTEPS: Curriculum-driven Learning of Stepping Stone SkillsCode1
Learning hierarchical behavior and motion planning for autonomous drivingCode1
Curious Hierarchical Actor-Critic Reinforcement LearningCode1
Plan2Vec: Unsupervised Representation Learning by Latent PlansCode1
CARL: Controllable Agent with Reinforcement Learning for Quadruped LocomotionCode1
SUPERT: Towards New Frontiers in Unsupervised Evaluation Metrics for Multi-Document SummarizationCode1
Offline Reinforcement Learning: Tutorial, Review, and Perspectives on Open ProblemsCode1
Off-Policy Adversarial Inverse Reinforcement LearningCode1
Option Discovery using Deep Skill ChainingCode1
Show:102550
← PrevPage 196 of 1512Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified