SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 28212830 of 15113 papers

TitleStatusHype
A bandit approach to curriculum generation for automatic speech recognition0
Data-Driven Inverse Reinforcement Learning for Expert-Learner Zero-Sum Games0
Adaptive Dialog Policy Learning with Hindsight and User Modeling0
A Unifying View of Optimism in Episodic Reinforcement Learning0
ACES -- Automatic Configuration of Energy Harvesting Sensors with Reinforcement Learning0
A Unifying Framework for Action-Conditional Self-Predictive Reinforcement Learning0
A unified view of likelihood ratio and reparameterization gradients and an optimal importance sampling scheme0
A Law of Iterated Logarithm for Multi-Agent Reinforcement Learning0
Data-Driven H-infinity Control with a Real-Time and Efficient Reinforcement Learning Algorithm: An Application to Autonomous Mobility-on-Demand Systems0
A unified view of entropy-regularized Markov decision processes0
Show:102550
← PrevPage 283 of 1512Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified