SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 1125111275 of 15113 papers

TitleStatusHype
Safe Reinforcement Learning for Autonomous Vehicles through Parallel Constrained Policy Optimization0
Relevance-Guided Modeling of Object Dynamics for Reinforcement Learning0
Learning Context-aware Task Reasoning for Efficient Meta-reinforcement Learning0
Efficient Exploration in Constrained Environments with Goal-Oriented Reference Path0
Deep Reinforcement Learning for QoS-Constrained Resource Allocation in Multiservice Networks0
Cluster-Based Social Reinforcement Learning0
Adaptive Structural Hyper-Parameter Configuration by Q-Learning0
Formal Controller Synthesis for Continuous-Space MDPs via Model-Free Reinforcement Learning0
Gaussian Process Policy Optimization0
Learning Force Control for Contact-rich Manipulation Tasks with Rigid Position-controlled Robots0
Scaling Up Multiagent Reinforcement Learning for Robotic Systems: Learn an Adaptive Sparse Communication Graph0
Dynamic Queue-Jump Lane for Emergency Vehicles under Partially Connected Settings: A Multi-Agent Deep Reinforcement Learning Approach0
Real-World Human-Robot Collaborative Reinforcement Learning0
Risk-Averse Learning by Temporal Difference Methods0
Upper Confidence Primal-Dual Reinforcement Learning for CMDP with Adversarial Loss0
A Hybrid Stochastic Policy Gradient Algorithm for Reinforcement LearningCode0
Fully Asynchronous Policy Evaluation in Distributed Reinforcement Learning over Networks0
Learning Near Optimal Policies with Low Inherent Bellman Error0
Contextual Policy Transfer in Reinforcement Learning Domains via Deep Mixtures-of-Experts0
TAdam: A Robust Stochastic Gradient OptimizerCode0
A Self-Tuning Actor-Critic Algorithm0
Deep Reinforcement Learning for FlipIt Security Game0
Mixed Reinforcement Learning with Additive Stochastic Uncertainty0
Reinforcement Learning through Active Inference0
On Catastrophic Interference in Atari 2600 GamesCode0
Show:102550
← PrevPage 451 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified