SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 1012610150 of 15113 papers

TitleStatusHype
Double Meta-Learning for Data Efficient Policy Optimization in Non-Stationary Environments0
Double Q(σ) and Q(σ, λ): Unifying Reinforcement Learning Control Algorithms0
Double Q-learning0
Double Reinforcement Learning for Efficient and Robust Off-Policy Evaluation0
Doubly Robust Off-Policy Actor-Critic Algorithms for Reinforcement Learning0
Doubly Robust Off-policy Value Evaluation for Reinforcement Learning0
DPO: A Differential and Pointwise Control Approach to Reinforcement Learning0
DQLAP: Deep Q-Learning Recommender Algorithm with Update Policy for a Real Steam Turbine System0
DQNAS: Neural Architecture Search using Reinforcement Learning0
DQN-based Beamforming for Uplink mmWave Cellular-Connected UAVs0
DQN with model-based exploration: efficient learning on environments with sparse rewards0
DR2L: Surfacing Corner Cases to Robustify Autonomous Driving via Domain Randomization Reinforcement Learning0
DR3: Value-Based Deep Reinforcement Learning Requires Explicit Regularization0
DRAS-CQSim: A Reinforcement Learning based Framework for HPC Cluster Scheduling0
Drawing Inductor Layout with a Reinforcement Learning Agent: Method and Application for VCO Inductors0
DRDT3: Diffusion-Refined Decision Test-Time Training Model0
DREAM: Adaptive Reinforcement Learning based on Attention Mechanism for Temporal Knowledge Graph Reasoning0
DREAM Architecture: a Developmental Approach to Open-Ended Learning in Robotics0
DreamerV3 for Traffic Signal Control: Hyperparameter Tuning and Performance0
Dreaming machine learning: Lipschitz extensions for reinforcement learning on financial markets0
Dreaming: Model-based Reinforcement Learning by Latent Imagination without Reconstruction0
DreamingV2: Reinforcement Learning with Discrete World Models without Reconstruction0
DRIFT: Deep Reinforcement Learning for Functional Software Testing0
DRILL-- Deep Reinforcement Learning for Refinement Operators in ALC0
DriveMind: A Dual-VLM based Reinforcement Learning Framework for Autonomous Driving0
Show:102550
← PrevPage 406 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified