SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 76767700 of 15113 papers

TitleStatusHype
Model-based Policy Search for Partially Measurable Systems0
Model-Based Regularization for Deep Reinforcement Learning with Transcoder Networks0
Model-based Reinforcement Learning and the Eluder Dimension0
Model-based Reinforcement Learning: A Survey0
Model-Based Reinforcement Learning Exploiting State-Action Equivalence0
Model-based reinforcement learning for biological sequence design0
Model-based Reinforcement Learning for Predictions and Control for Limit Order Books0
Model-Based Reinforcement Learning for Physical Systems Without Velocity and Acceleration Measurements0
Model-Based Reinforcement Learning for Approximate Optimal Control with Temporal Logic Specifications0
Model-based Reinforcement Learning for Service Mesh Fault Resiliency in a Web Application-level0
Model-Based Reinforcement Learning via Stochastic Hybrid Models0
Model-based reinforcement learning for protein backbone design0
Model-Based Reinforcement Learning for Control of Strongly-Disturbed Unsteady Aerodynamic Flows0
Model Based Reinforcement Learning for Atari0
Model-Based Reinforcement Learning for Sepsis Treatment0
Model-Based Reinforcement Learning for Type 1Diabetes Blood Glucose Control0
Whole-Chain Recommendations0
Model-based Reinforcement Learning from Signal Temporal Logic Specifications0
Model-Based Reinforcement Learning for Offline Zero-Sum Markov Games0
Model-Based Reinforcement Learning via Imagination with Derived Memory0
Model-Based Reinforcement Learning via Meta-Policy Optimization0
Model-based Reinforcement Learning with Parametrized Physical Models and Optimism-Driven Exploration0
Model-based Reinforcement Learning with Ensembled Model-value Expansion0
Model-Based Reinforcement Learning with Multinomial Logistic Function Approximation0
Model-based Reinforcement Learning with a Hamiltonian Canonical ODE Network0
Show:102550
← PrevPage 308 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified