SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 84268450 of 15113 papers

TitleStatusHype
PPO-UE: Proximal Policy Optimization via Uncertainty-Aware Exploration0
Practical and efficient quantum circuit synthesis and transpiling with Reinforcement Learning0
Practical Kernel-Based Reinforcement Learning0
Practical Marginalized Importance Sampling with the Successor Representation0
Practical Risk Measures in Reinforcement Learning0
PRDP: Proximal Reward Difference Prediction for Large-Scale Reward Finetuning of Diffusion Models0
Precise and Dexterous Robotic Manipulation via Human-in-the-Loop Reinforcement Learning0
Precision medicine as a control problem: Using simulation and deep reinforcement learning to discover adaptive, personalized multi-cytokine therapy for sepsis0
PrecoderNet: Hybrid Beamforming for Millimeter Wave Systems with Deep Reinforcement Learning0
Predecessor Features0
Predicted Variables in Programming0
Predicting Goal-directed Attention Control Using Inverse-Reinforcement Learning0
Predicting Liquidity-Aware Bond Yields using Causal GANs and Deep Reinforcement Learning with LLM Evaluation0
Predicting Multiple Actions for Stochastic Continuous Control0
Predicting the Need for Blood Transfusion in Intensive Care Units with Reinforcement Learning0
Prediction Based Decision Making for Autonomous Highway Driving0
Prediction Improves Simultaneous Neural Machine Translation0
PredictionNet: Real-Time Joint Probabilistic Traffic Prediction for Planning, Control, and Simulation0
Predictive auxiliary objectives in deep RL mimic learning in the brain0
Predictive Coding for Boosting Deep Reinforcement Learning with Sparse Rewards0
Predictive Control Using Learned State Space Models via Rolling Horizon Evolution0
Predictive Crypto-Asset Automated Market Making Architecture for Decentralized Finance using Deep Reinforcement Learning0
Predictive Maintenance for Edge-Based Sensor Networks: A Deep Reinforcement Learning Approach0
Predictive Maneuver Planning with Deep Reinforcement Learning (PMP-DRL) for comfortable and safe autonomous driving0
Predictive PER: Balancing Priority and Diversity towards Stable Deep Reinforcement Learning0
Show:102550
← PrevPage 338 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified