SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 40614070 of 15113 papers

TitleStatusHype
Acquiring Diverse Skills using Curriculum Reinforcement Learning with Mixture of Experts0
Acquiring Target Stacking Skills by Goal-Parameterized Deep Reinforcement Learning0
ACTER: Diverse and Actionable Counterfactual Sequences for Explaining and Diagnosing RL Policies0
Acting upon Imagination: when to trust imagined trajectories in model based reinforcement learning0
Actionable Models: Unsupervised Offline Reinforcement Learning of Robotic Skills0
Action abstractions for amortized sampling0
Action and Trajectory Planning for Urban Autonomous Driving with Hierarchical Reinforcement Learning0
Action Categorization for Computationally Improved Task Learning and Planning0
Action-Constrained Reinforcement Learning for Frame-Level Bit Allocation in HEVC/H.265 through Frank-Wolfe Policy Optimization0
Action-dependent Control Variates for Policy Optimization via Stein Identity0
Show:102550
← PrevPage 407 of 1512Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified