SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 98769900 of 15113 papers

TitleStatusHype
GRAC: Self-Guided and Self-Regularized Actor-CriticCode0
HTMRL: Biologically Plausible Reinforcement Learning with Hierarchical Temporal MemoryCode0
A Contraction Approach to Model-based Reinforcement Learning0
Competitiveness of MAP-Elites against Proximal Policy Optimization on locomotion tasks in deterministic simulationsCode1
GeneraLight: Improving Environment Generalization of Traffic Signal Control via Meta Reinforcement Learning0
Finding Effective Security Strategies through Reinforcement Learning and Self-PlayCode1
Knowledge-Assisted Deep Reinforcement Learning in 5G Scheduler Design: From Theoretical Framework to Implementation0
SREC: Proactive Self-Remedy of Energy-Constrained UAV-Based Networks via Deep Reinforcement LearningCode1
Reward Maximisation through Discrete Active Inference0
Reconstructing Actions To Explain Deep Reinforcement Learning0
Time your hedge with Deep Reinforcement Learning0
Theory of Mind with Guilt Aversion Facilitates Cooperative Reinforcement Learning0
Transfer Learning in Deep Reinforcement Learning: A Survey0
Meta-AAD: Active Anomaly Detection with Deep Reinforcement LearningCode1
Text Generation by Learning from DemonstrationsCode1
DRL-FAS: A Novel Framework Based on Deep Reinforcement Learning for Face Anti-Spoofing0
Reinforcement Learning for Strategic Recommendations0
Soft policy optimization using dual-track advantage estimator0
Toward Deep Supervised Anomaly Detection: Reinforcement Learning from Partially Labeled Anomaly DataCode1
Autonomous Learning of Features for Control: Experiments with Embodied and Situated Agents0
Decoding Polar Codes with Reinforcement Learning0
Decoupling Representation Learning from Reinforcement LearningCode2
Deep Actor-Critic Learning for Distributed Power Control in Wireless Mobile NetworksCode1
Efficient Transformers: A Survey0
Reinforcement Learning for Dynamic Resource Optimization in 5G Radio Access Network Slicing0
Show:102550
← PrevPage 396 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified