SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 88018825 of 15113 papers

TitleStatusHype
Passing Through Narrow Gaps with Deep Reinforcement Learning0
Pass@K Policy Optimization: Solving Harder Reinforcement Learning Problems0
Path Design and Resource Management for NOMA enhanced Indoor Intelligent Robots0
Pathfinding in Random Partially Observable Environments with Vision-Informed Deep Reinforcement Learning0
Path Following and Stabilisation of a Bicycle Model using a Reinforcement Learning Approach0
Path Integral Networks: End-to-End Differentiable Optimal Control0
Machine learning strategies for path-planning microswimmers in turbulent flows0
Path Planning of Cleaning Robot with Reinforcement Learning0
Path Planning using Reinforcement Learning: A Policy Iteration Approach0
Patient level simulation and reinforcement learning to discover novel strategies for treating ovarian cancer0
Patterns, predictions, and actions: A story about machine learning0
Pattern Transfer Learning for Reinforcement Learning in Order Dispatching0
Pauli Network Circuit Synthesis with Reinforcement Learning0
Paused Agent Replay Refresh0
Pavlovian Signalling with General Value Functions in Agent-Agent Temporal Decision Making0
PBCS : Efficient Exploration and Exploitation Using a Synergy between Reinforcement Learning and Motion Planning0
PDQN - A Deep Reinforcement Learning Method for Planning with Long Delays: Optimization of Manufacturing Dispatching0
PEARL: Parallelized Expert-Assisted Reinforcement Learning for Scene Rearrangement Planning0
PEAR: Primitive enabled Adaptive Relabeling for boosting Hierarchical Reinforcement Learning0
Pedestrian Prediction by Planning using Deep Neural Networks0
Penalized Proximal Policy Optimization for Safe Reinforcement Learning0
PEORL: Integrating Symbolic Planning and Hierarchical Reinforcement Learning for Robust Decision-Making0
Perception and Navigation in Autonomous Systems in the Era of Learning: A Survey0
Perception-Prediction-Reaction Agents for Deep Reinforcement Learning0
Perceptual Reward Functions0
Show:102550
← PrevPage 353 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified