SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 90769100 of 15113 papers

TitleStatusHype
Reinforcement Learning for Adaptive Caching with Dynamic Storage Pricing0
Reinforcement Learning for Adaptive Mesh Refinement0
Reinforcement Learning for Adaptive Video Compressive Sensing0
Reinforcement Learning for a Discrete-Time Linear-Quadratic Control Problem with an Application0
Reinforcement learning for Admission Control in 5G Wireless Networks0
Reinforcement Learning for Admission Control in Wireless Virtual Network Embedding0
Reinforcement Learning for Agile Active Target Sensing with a UAV0
Reinforcement Learning for Aligning Large Language Models Agents with Interactive Environments: Quantifying and Mitigating Prompt Overfitting0
Reinforcement Learning for AMR Charging Decisions: The Impact of Reward and Action Space Design0
Reinforcement learning for anisotropic p-adaptation and error estimation in high-order solvers0
Reinforcement Learning for Assignment problem0
Reinforcement Learning for Assignment Problem with Time Constraints0
Reinforcement Learning for Autonomous Defence in Software-Defined Networking0
Reinforcement Learning for Autonomous Driving with Latent State Inference and Spatial-Temporal Relationships0
Reinforcement learning for bandwidth estimation and congestion control in real-time communications0
Reinforcement Learning for Battery Energy Storage Dispatch augmented with Model-based Optimizer0
Reinforcement Learning for Beam Pattern Design in Millimeter Wave and Massive MIMO Systems0
Reinforcement Learning for Better Verbalized Confidence in Long-Form Generation0
Reinforcement Learning for Block Decomposition of CAD Models0
Reinforcement Learning for Caching with Space-Time Popularity Dynamics0
Reinforcement Learning for Classical Planning: Viewing Heuristics as Dense Reward Generators0
Reinforcement Learning for Cognitive Delay/Disruption Tolerant Network Node Management in an LEO-based Satellite Constellation0
Reinforcement Learning for Combinatorial Optimization: A Survey0
Reinforcement Learning for ConnectX0
Reinforcement Learning for Control with Probabilistic Stability Guarantee0
Show:102550
← PrevPage 364 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified