SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 48014825 of 15113 papers

TitleStatusHype
Reinforcement Learning-Enabled Decision-Making Strategies for a Vehicle-Cyber-Physical-System in Connected Environment0
Reinforcement Learning-Enabled Reliable Wireless Sensor Networks in Dynamic Underground Environments0
Reinforcement Learning-enabled Satellite Constellation Reconfiguration and Retasking for Mission-Critical Applications0
Reinforcement Learning Enhanced Explainer for Graph Neural Networks0
Reinforcement learning-enhanced genetic algorithm for wind farm layout optimization0
Reinforcement Learning-Enhanced Procedural Generation for Dynamic Narrative-Driven AR Experiences0
Reinforcement Learning Environment with LLM-Controlled Adversary in D&D 5th Edition Combat0
Reinforcement Learning Experience Reuse with Policy Residual Representation0
Reinforcement Learning Finetunes Small Subnetworks in Large Language Models0
Reinforcement Learning for Active Matter0
Reinforcement Learning for Adaptive Caching with Dynamic Storage Pricing0
Reinforcement Learning for Adaptive Mesh Refinement0
Reinforcement Learning for Adaptive Video Compressive Sensing0
Reinforcement Learning for a Discrete-Time Linear-Quadratic Control Problem with an Application0
Reinforcement learning for Admission Control in 5G Wireless Networks0
Reinforcement Learning for Admission Control in Wireless Virtual Network Embedding0
Reinforcement Learning for Agile Active Target Sensing with a UAV0
Reinforcement Learning for Aligning Large Language Models Agents with Interactive Environments: Quantifying and Mitigating Prompt Overfitting0
Reinforcement Learning for AMR Charging Decisions: The Impact of Reward and Action Space Design0
Reinforcement learning for anisotropic p-adaptation and error estimation in high-order solvers0
Reinforcement Learning for Assignment problem0
Reinforcement Learning for Assignment Problem with Time Constraints0
Reinforcement Learning for Autonomous Defence in Software-Defined Networking0
Reinforcement Learning for Autonomous Driving with Latent State Inference and Spatial-Temporal Relationships0
Reinforcement learning for bandwidth estimation and congestion control in real-time communications0
Show:102550
← PrevPage 193 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified