SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 37513760 of 15113 papers

TitleStatusHype
DC4L: Distribution Shift Recovery via Data-Driven Control for Deep Learning ModelsCode0
Fantastic Rewards and How to Tame Them: A Case Study on Reward Learning for Task-oriented Dialogue SystemsCode0
Backstepping Temporal Difference Learning0
Differentiable Arbitrating in Zero-sum Markov Games0
Safe Deep Reinforcement Learning by Verifying Task-Level Properties0
Demonstration-Guided Reinforcement Learning with Efficient Exploration for Task Automation of Surgical RobotCode2
Auto.gov: Learning-based Governance for Decentralized Finance (DeFi)Code0
Compositionality and Bounds for Optimal Value Functions in Reinforcement Learning0
Generalization in Visual Reinforcement Learning with the Reward Sequence DistributionCode0
AutoDOViz: Human-Centered Automation for Decision Optimization0
Show:102550
← PrevPage 376 of 1512Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified