SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 1122611250 of 15113 papers

TitleStatusHype
GST: Group-Sparse Training for Accelerating Deep Reinforcement Learning0
Guaranteed satisficing and finite regret: Analysis of a cognitive satisficing value function0
Guaranteed Trust Region Optimization via Two-Phase KL Penalization0
Guaranteeing Out-Of-Distribution Detection in Deep RL via Transition Estimation0
Guarantees for Epsilon-Greedy Reinforcement Learning with Function Approximation0
Guarded Policy Optimization with Imperfect Online Demonstrations0
"Guess what I'm doing": Extending legibility to sequential decision tasks0
Guided by Guardrails: Control Barrier Functions as Safety Instructors for Robotic Learning0
Guided Constrained Policy Optimization for Dynamic Quadrupedal Robot Locomotion0
Guided Data Augmentation for Offline Reinforcement Learning and Imitation Learning0
Guiding Robot Exploration in Reinforcement Learning via Automated Planning0
Guided Exploration in Deep Reinforcement Learning0
Guided Meta-Policy Search0
Guided Online Distillation: Promoting Safe Reinforcement Learning by Offline Demonstration0
Guided Policy Search Based Control of a High Dimensional Advanced Manufacturing Process0
Guided Reinforcement Learning for Robust Multi-Contact Loco-Manipulation0
Guided Safe Shooting: model based reinforcement learning with safety constraints0
LEAGUE: Guided Skill Learning and Abstraction for Long-Horizon Manipulation0
Guiding Global Placement With Reinforcement Learning0
Guiding Reinforcement Learning Exploration Using Natural Language0
Guiding Representation Learning in Deep Generative Models with Policy Gradients0
Guiding Safe Exploration with Weakest Preconditions0
Gym-ANM: Open-source software to leverage reinforcement learning for power system management in research and education0
gym-DSSAT: a crop model turned into a Reinforcement Learning environment0
Gym-preCICE: Reinforcement Learning Environments for Active Flow Control0
Show:102550
← PrevPage 450 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified