SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 37713780 of 15113 papers

TitleStatusHype
Deep Reinforcement Learning for mmWave Initial Beam Alignment0
Learning to Forecast Aleatoric and Epistemic Uncertainties over Long Horizon Trajectories0
Data Driven Reward Initialization for Preference based Reinforcement Learning0
A State Augmentation based approach to Reinforcement Learning from Human Preferences0
Robot path planning using deep reinforcement learning0
Exploiting Unlabeled Data for Feedback Efficient Human Preference based Reinforcement Learning0
Post Reinforcement Learning InferenceCode0
Quantum Computing Provides Exponential Regret Improvement in Episodic Reinforcement Learning0
Dual RL: Unification and New Methods for Reinforcement and Imitation LearningCode1
Tuning computer vision models with task rewards0
Show:102550
← PrevPage 378 of 1512Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified