SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 82018225 of 15113 papers

TitleStatusHype
Pedestrian Prediction by Planning using Deep Neural Networks0
Penalized Proximal Policy Optimization for Safe Reinforcement Learning0
PEORL: Integrating Symbolic Planning and Hierarchical Reinforcement Learning for Robust Decision-Making0
Perception and Navigation in Autonomous Systems in the Era of Learning: A Survey0
Perception-Prediction-Reaction Agents for Deep Reinforcement Learning0
Perceptual Reward Functions0
Perceptual Values from Observation0
Performance Bounds for Policy-Based Average Reward Reinforcement Learning Algorithms0
Performance-Driven Controller Tuning via Derivative-Free Reinforcement Learning0
Performance Dynamics and Termination Errors in Reinforcement Learning: A Unifying Perspective0
Performance Optimization for Semantic Communications: An Attention-based Reinforcement Learning Approach0
Performance Optimization for Variable Bitwidth Federated Learning in Wireless Networks0
Performance-Weighed Policy Sampling for Meta-Reinforcement Learning0
Performative Reinforcement Learning0
PERIL: Probabilistic Embeddings for hybrid Meta-Reinforcement and Imitation Learning0
Periodic agent-state based Q-learning for POMDPs0
PeRL: Permutation-Enhanced Reinforcement Learning for Interleaved Vision-Language Reasoning0
PerSim: Data-Efficient Offline Reinforcement Learning with Heterogeneous Agents via Personalized Simulators0
Autonomous Reinforcement Learning via Subgoal Curricula0
Persistent Rule-based Interactive Reinforcement Learning0
Towards Personalization of User Preferences in Partially Observable Smart Home Environments0
Personalisation via Dynamic Policy Fusion0
Personalization for Web-based Services using Offline Reinforcement Learning0
A clustering-based reinforcement learning approach for tailored personalization of e-Health interventions0
Personalization of Hearing Aid Compression by Human-In-Loop Deep Reinforcement Learning0
Show:102550
← PrevPage 329 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified