SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 1107611100 of 15113 papers

TitleStatusHype
Almost Optimal Model-Free Reinforcement Learning via Reference-Advantage Decomposition0
Learning as Reinforcement: Applying Principles of Neuroscience for More General Reinforcement Learning Agents0
Data-Driven Learning and Load Ensemble Control0
Attention Routing: track-assignment detailed routing using attention-based reinforcement learning0
Tightening Exploration in Upper Confidence Reinforcement Learning0
Self-Guided Evolution Strategies with Historical Estimated GradientsCode0
Superkernel Neural Architecture Search for Image Denoising0
Variational Policy Propagation for Multi-agent Reinforcement Learning0
Macro-Action-Based Deep Multi-Agent Reinforcement Learning0
Time Adaptive Reinforcement Learning0
Modeling Survival in model-based Reinforcement Learning0
Show Us the Way: Learning to Manage Dialog from Demonstrations0
F2A2: Flexible Fully-decentralized Approximate Actor-critic for Cooperative Multi-agent Reinforcement Learning0
Knowledge-guided Deep Reinforcement Learning for Interactive Recommendation0
Deep Reinforcement Learning for Adaptive Learning Systems0
Approximate Inverse Reinforcement Learning from Vision-based Imitation Learning0
Goal-conditioned Batch Reinforcement Learning for Rotation Invariant Locomotion0
Data-Driven Robust Control Using Reinforcement Learning0
A Game Theoretic Framework for Model Based Reinforcement Learning0
Analyzing Reinforcement Learning Benchmarks with Random Weight GuessingCode0
OptiGAN: Generative Adversarial Networks for Goal Optimized Sequence GenerationCode0
Reinforcement Learning for Safety-Critical Control under Model Uncertainty, using Control Lyapunov Functions and Control Barrier Functions0
Reinforcement Learning in a Physics-Inspired Semi-Markov EnvironmentCode0
Safe deep reinforcement learning-based constrained optimal control scheme for active distribution networks0
Improving Input-Output Linearizing Controllers for Bipedal Robots via Reinforcement Learning0
Show:102550
← PrevPage 444 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified