SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 50265050 of 15113 papers

TitleStatusHype
Reinforcement Learning of Structured Control for Linear Systems with Unknown State Matrix0
Reinforcement Learning of Theorem Proving0
Reinforcement Learning of the Prediction Horizon in Model Predictive Control0
Reinforcement Learning of Two-Issue Negotiation Dialogue Policies0
Reinforcement Learning on Computational Resource Allocation of Cloud-based Wireless Networks0
Reinforcement Learning on Encrypted Data0
Graph neural networks-based Scheduler for Production planning problems using Reinforcement Learning0
Reinforcement Learning Policy Recommendation for Interbank Network Stability0
Reinforcement-Learning Portfolio Allocation with Dynamic Embedding of Market Information0
Reinforcement Learning Portfolio Manager Framework with Monte Carlo Simulation0
Reinforcement Learning: Prediction, Control and Value Function Approximation0
Reinforcement Learning Problem Solving with Large Language Models0
Reinforcement Learning reveals fundamental limits on the mixing of active particles0
Reinforcement learning reward function in unmanned aerial vehicle control tasks0
On-demand Cold Start Frequency Reduction with Off-Policy Reinforcement Learning in Serverless Computing0
Reinforcement Learning (RL) Meets Urban Climate Modeling: Investigating the Efficacy and Impacts of RL-Based HVAC Control0
Reinforcement Learning Scheduler for Vehicle-to-Vehicle Communications Outside Coverage0
Reinforcement Learning State Estimation for High-Dimensional Nonlinear Systems0
Reinforcement Learning Teachers of Test Time Scaling0
Reinforcement learning techniques for Outer Loop Link Adaptation in 4G/5G systems0
Reinforcement Learning through Active Inference0
Reinforcement Learning To Adapt Speech Enhancement to Instantaneous Input Signal Quality0
Reinforcement Learning to Optimize Lifetime Value in Cold-Start Recommendation0
Reinforcement Learning to Optimize Long-term User Engagement in Recommender Systems0
Reinforcement Learning to Optimize the Logistics Distribution Routes of Unmanned Aerial Vehicle0
Show:102550
← PrevPage 202 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified