SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 60516075 of 15113 papers

TitleStatusHype
Snap Angle Prediction for 360° Panoramas0
Snap Angle Prediction for 360^ Panoramas0
SNeRL: Semantic-aware Neural Radiance Fields for Reinforcement Learning0
SocialAI: Benchmarking Socio-Cognitive Abilities in Deep Reinforcement Learning Agents0
Social diversity and social preferences in mixed-motive reinforcement learning0
Social Interpretable Reinforcement Learning0
Socially Fair Reinforcement Learning0
Social Network Structure Shapes Innovation: Experience-sharing in RL with SAPIENS0
Social Vehicle Swarms: A Novel Perspective on Social-aware Vehicular Communication Architecture0
Socratic RL: A Novel Framework for Efficient Knowledge Acquisition through Iterative Reflection and Viewpoint Distillation0
Soft Action Priors: Towards Robust Policy Transfer0
Soft Actor-Critic With Integer Actions0
SoftCTRL: Soft conservative KL-control of Transformer Reinforcement Learning for Autonomous Driving0
Soft Decomposed Policy-Critic: Bridging the Gap for Effective Continuous Control with Discrete RL0
Soft Expert Reward Learning for Vision-and-Language Navigation0
Regularized Softmax Deep Multi-Agent Q-Learning0
Soft Policy Gradient Method for Maximum Entropy Deep Reinforcement Learning0
Soft policy optimization using dual-track advantage estimator0
Soft Q-Learning with Mutual-Information Regularization0
Soft-Robust Actor-Critic Policy-Gradient0
Soft-Robust Algorithms for Batch Reinforcement Learning0
SoK: Adversarial Machine Learning Attacks and Defences in Multi-Agent Reinforcement Learning0
Solar Power driven EV Charging Optimization with Deep Reinforcement Learning0
SOLD: Slot Object-Centric Latent Dynamics Models for Relational Manipulation Learning from Pixels0
Solipsistic Reinforcement Learning0
Show:102550
← PrevPage 243 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified