SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 60266050 of 15113 papers

TitleStatusHype
Skill Transfer in Deep Reinforcement Learning under Morphological Heterogeneity0
Skip Training for Multi-Agent Reinforcement Learning Controller for Industrial Wave Energy Converters0
skrl: Modular and Flexible Library for Reinforcement Learning0
SLAC: Simulation-Pretrained Latent Action Space for Whole-Body Real-World RL0
SlateFree: a Model-Free Decomposition for Reinforcement Learning with Slate Actions0
SleeperNets: Universal Backdoor Poisoning Attacks Against Reinforcement Learning Agents0
SLiM-Gym: Reinforcement Learning for Population Genetics0
Slowly Changing Adversarial Bandit Algorithms are Efficient for Discounted MDPs0
Smaller Batches, Bigger Gains? Investigating the Impact of Batch Sizes on Reinforcement Learning Based Real-World Production Scheduling0
Smart caching in a Data Lake for High Energy Physics analysis0
SmartChoices: Hybridizing Programming and Machine Learning0
Smart Exploration in Reinforcement Learning using Bounded Uncertainty Models0
Smart Feasibility Pump: Reinforcement Learning for (Mixed) Integer Programming0
Smart Interference Management xApp using Deep Reinforcement Learning0
SmartPathfinder: Pushing the Limits of Heuristic Solutions for Vehicle Routing Problem with Drones Using Reinforcement Learning0
Smart Scheduling based on Deep Reinforcement Learning for Cellular Networks0
SMART: Self-supervised Multi-task pretrAining with contRol Transformers0
Smart Train Operation Algorithms based on Expert Knowledge and Reinforcement Learning0
Smell of Source: Learning-Based Odor Source Localization with Molecular Communication0
Smoothed Action Value Functions for Learning Gaussian Policies0
Smoothed functional-based gradient algorithms for off-policy reinforcement learning: A non-asymptotic viewpoint0
Smoothed Q-learning0
Smooth Imitation Learning via Smooth Costs and Smooth Policies0
Smoothing Deep Reinforcement Learning for Power Control for Spectrum Sharing in Cognitive Radios0
Smooth Trajectory Collision Avoidance through Deep Reinforcement Learning0
Show:102550
← PrevPage 242 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified