SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 45214530 of 15113 papers

TitleStatusHype
A Local Temporal Difference Code for Distributional Reinforcement Learning0
A Lower Bound for the Sample Complexity of Inverse Reinforcement Learning0
AlphaD3M: Machine Learning Pipeline Synthesis0
Alpha-DAG: a reinforcement learning based algorithm to learn Directed Acyclic Graphs0
Alpha-divergence bridges maximum likelihood and reinforcement learning in neural sequence generation0
AlphaRouter: Quantum Circuit Routing with Reinforcement Learning and Tree Search0
AlphaSeq: Sequence Discovery with Deep Reinforcement Learning0
AlphaSnake: Policy Iteration on a Nondeterministic NP-hard Markov Decision Process0
AlphaStar: An Evolutionary Computation Perspective0
AlphaStock: A Buying-Winners-and-Selling-Losers Investment Strategy using Interpretable Deep Reinforcement Attention Networks0
Show:102550
← PrevPage 453 of 1512Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified