SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 25612570 of 15113 papers

TitleStatusHype
Convergence Rates for Stochastic Approximation: Biased Noise with Unbounded Variance, and Applications0
Convergence Guarantees for Deep Epsilon Greedy Policy Learning0
A Meta-Reinforcement Learning Approach to Process Control0
Convergence of a Human-in-the-Loop Policy-Gradient Algorithm With Eligibility Trace Under Reward, Policy, and Advantage Feedback0
Autoregressive Multi-trait Essay Scoring via Reinforcement Learning with Scoring-aware Multiple Rewards0
Adaptive patch foraging in deep reinforcement learning agents0
Convergence for Natural Policy Gradient on Infinite-State Queueing MDPs0
Convergence of Batch Asynchronous Stochastic Approximation With Applications to Reinforcement Learning0
Autonomous Warehouse Robot using Deep Q-Learning0
A Comparative Analysis of Deep Reinforcement Learning-enabled Freeway Decision-making for Automated Vehicles0
Show:102550
← PrevPage 257 of 1512Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified