SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 43264350 of 15113 papers

TitleStatusHype
Safe Reinforcement Learning via Probabilistic Logic ShieldsCode0
Uncertainty-driven Trajectory Truncation for Data Augmentation in Offline Reinforcement LearningCode0
Safe Reinforcement Learning via ShieldingCode0
Mo' States Mo' Problems: Emergency Stop Mechanisms from ObservationCode0
Online Learning in Iterated Prisoner's Dilemma to Mimic Human BehaviorCode0
Reinforcement Learning Approach for Mapping Applications to Dataflow-Based Coarse-Grained Reconfigurable ArrayCode0
Uncovering Instabilities in Variational-Quantum Deep Q-NetworksCode0
Markov Abstractions for PAC Reinforcement Learning in Non-Markov Decision ProcessesCode0
Understanding Adversarial Attacks on Observations in Deep Reinforcement LearningCode0
Planning Multiple Epidemic Interventions with Reinforcement LearningCode0
Planning the path with Reinforcement Learning: Optimal Robot Motion Planning in RoboCup Small Size League EnvironmentsCode0
Planning to Learn: A Novel Algorithm for Active Learning during Model-Based PlanningCode0
Understanding Curriculum Learning in Policy Optimization for Online Combinatorial OptimizationCode0
Safe Reinforcement Learning with Scene Decomposition for Navigating Complex Urban EnvironmentsCode0
Maximum Reward Formulation In Reinforcement LearningCode0
Understanding Game-Playing Agents with Natural Language AnnotationsCode0
Safer Reinforcement Learning through Transferable Instinct NetworksCode0
Understanding Multi-Step Deep Reinforcement Learning: A Systematic Study of the DQN TargetCode0
Planning with Goal-Conditioned PoliciesCode0
Motion Planning Among Dynamic, Decision-Making Agents with Deep Reinforcement LearningCode0
Understanding the Evolution of Linear Regions in Deep Reinforcement LearningCode0
Understanding the impact of entropy on policy optimizationCode0
MazeBase: A Sandbox for Learning from GamesCode0
Reinforcement Learning Assisted Recursive QAOACode0
Reinforcement Learning -based Adaptation and Scheduling Methods for Multi-source DASHCode0
Show:102550
← PrevPage 174 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified