SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 22512260 of 15113 papers

TitleStatusHype
Execution-based Code Generation using Deep Reinforcement LearningCode1
Can Q-Learning with Graph Networks Learn a Generalizable Branching Heuristic for a SAT Solver?Code1
Encoding formulas as deep networks: Reinforcement learning for zero-shot execution of LTL formulasCode1
Bridging Imagination and Reality for Model-Based Deep Reinforcement LearningCode1
Object Detection with Deep Reinforcement LearningCode1
Energy Pricing in P2P Energy Systems Using Reinforcement LearningCode1
An Inductive Bias for Distances: Neural Nets that Respect the Triangle InequalityCode1
Enforcing Policy Feasibility Constraints through Differentiable Projection for Energy OptimizationCode1
Bridging RL Theory and Practice with the Effective HorizonCode1
Zeroth-Order Optimization Meets Human Feedback: Provable Learning via Ranking OraclesCode1
Show:102550
← PrevPage 226 of 1512Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified