SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 23712380 of 15113 papers

TitleStatusHype
Batch Reinforcement Learning Through Continuation Method0
Batch Reinforcement Learning on the Industrial Benchmark: First Experiences0
An Adaptable Approach to Learn Realistic Legged Locomotion without Examples0
Adaptive Stochastic Nonlinear Model Predictive Control with Look-ahead Deep Reinforcement Learning for Autonomous Vehicle Motion Control0
Batch Reinforcement Learning from Crowds0
Batch Recurrent Q-Learning for Backchannel Generation Towards Engaging Agents0
An A* Curriculum Approach to Reinforcement Learning for RGBD Indoor Robot Navigation0
Batch Policy Gradient Methods for Improving Neural Conversation Models0
Learning "What-if" Explanations for Sequential Decision-Making0
An Actor-Critic Method for Simulation-Based Optimization0
Show:102550
← PrevPage 238 of 1512Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified