SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 1245112500 of 15113 papers

TitleStatusHype
SINR-Aware Deep Reinforcement Learning for Distributed Dynamic Channel Allocation in Cognitive Interference Networks0
Situated GAIL: Multitask imitation using task-conditioned adversarial inverse reinforcement learning0
Sketch-Based Linear Value Function Approximation0
Sketch-to-Skill: Bootstrapping Robot Learning with Human Drawn Trajectory Sketches0
Skill-based Meta-Reinforcement Learning0
Skill-based Model-based Reinforcement Learning0
Skill-Critic: Refining Learned Skills for Hierarchical Reinforcement Learning0
Skill Discovery in Continuous Reinforcement Learning Domains using Skill Chaining0
Skill Discovery of Coordination in Multi-agent Reinforcement Learning0
Skilled Experience Catalogue: A Skill-Balancing Mechanism for Non-Player Characters using Reinforcement Learning0
Skill-Enhanced Reinforcement Learning Acceleration from Demonstrations0
SkillS: Adaptive Skill Sequencing for Efficient Temporally-Extended Exploration0
Skills Regularized Task Decomposition for Multi-task Offline Reinforcement Learning0
Skill Transfer in Deep Reinforcement Learning under Morphological Heterogeneity0
Skip Training for Multi-Agent Reinforcement Learning Controller for Industrial Wave Energy Converters0
skrl: Modular and Flexible Library for Reinforcement Learning0
SLAC: Simulation-Pretrained Latent Action Space for Whole-Body Real-World RL0
SlateFree: a Model-Free Decomposition for Reinforcement Learning with Slate Actions0
SleeperNets: Universal Backdoor Poisoning Attacks Against Reinforcement Learning Agents0
SLiM-Gym: Reinforcement Learning for Population Genetics0
Slowly Changing Adversarial Bandit Algorithms are Efficient for Discounted MDPs0
Smaller Batches, Bigger Gains? Investigating the Impact of Batch Sizes on Reinforcement Learning Based Real-World Production Scheduling0
Smart caching in a Data Lake for High Energy Physics analysis0
SmartChoices: Hybridizing Programming and Machine Learning0
Smart Exploration in Reinforcement Learning using Bounded Uncertainty Models0
Smart Feasibility Pump: Reinforcement Learning for (Mixed) Integer Programming0
Smart Interference Management xApp using Deep Reinforcement Learning0
SmartPathfinder: Pushing the Limits of Heuristic Solutions for Vehicle Routing Problem with Drones Using Reinforcement Learning0
Smart Scheduling based on Deep Reinforcement Learning for Cellular Networks0
SMART: Self-supervised Multi-task pretrAining with contRol Transformers0
Smart Train Operation Algorithms based on Expert Knowledge and Reinforcement Learning0
Smell of Source: Learning-Based Odor Source Localization with Molecular Communication0
Smoothed Action Value Functions for Learning Gaussian Policies0
Smoothed functional-based gradient algorithms for off-policy reinforcement learning: A non-asymptotic viewpoint0
Smoothed Q-learning0
Smooth Imitation Learning via Smooth Costs and Smooth Policies0
Smoothing Deep Reinforcement Learning for Power Control for Spectrum Sharing in Cognitive Radios0
Smooth Trajectory Collision Avoidance through Deep Reinforcement Learning0
Snap Angle Prediction for 360° Panoramas0
Snap Angle Prediction for 360^ Panoramas0
SNeRL: Semantic-aware Neural Radiance Fields for Reinforcement Learning0
SocialAI: Benchmarking Socio-Cognitive Abilities in Deep Reinforcement Learning Agents0
Social diversity and social preferences in mixed-motive reinforcement learning0
Social Interpretable Reinforcement Learning0
Socially Fair Reinforcement Learning0
Social Network Structure Shapes Innovation: Experience-sharing in RL with SAPIENS0
Social Vehicle Swarms: A Novel Perspective on Social-aware Vehicular Communication Architecture0
Socratic RL: A Novel Framework for Efficient Knowledge Acquisition through Iterative Reflection and Viewpoint Distillation0
Soft Action Priors: Towards Robust Policy Transfer0
Soft Actor-Critic With Integer Actions0
Show:102550
← PrevPage 250 of 303Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified