SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 66516700 of 15113 papers

TitleStatusHype
DQN-based Beamforming for Uplink mmWave Cellular-Connected UAVs0
DQN with model-based exploration: efficient learning on environments with sparse rewards0
DR2L: Surfacing Corner Cases to Robustify Autonomous Driving via Domain Randomization Reinforcement Learning0
DR3: Value-Based Deep Reinforcement Learning Requires Explicit Regularization0
DRAS-CQSim: A Reinforcement Learning based Framework for HPC Cluster Scheduling0
Drawing Inductor Layout with a Reinforcement Learning Agent: Method and Application for VCO Inductors0
DRDT3: Diffusion-Refined Decision Test-Time Training Model0
DREAM: Adaptive Reinforcement Learning based on Attention Mechanism for Temporal Knowledge Graph Reasoning0
DREAM Architecture: a Developmental Approach to Open-Ended Learning in Robotics0
DreamerV3 for Traffic Signal Control: Hyperparameter Tuning and Performance0
Dreaming machine learning: Lipschitz extensions for reinforcement learning on financial markets0
Dreaming: Model-based Reinforcement Learning by Latent Imagination without Reconstruction0
DreamingV2: Reinforcement Learning with Discrete World Models without Reconstruction0
DRIFT: Deep Reinforcement Learning for Functional Software Testing0
DRILL-- Deep Reinforcement Learning for Refinement Operators in ALC0
DriveMind: A Dual-VLM based Reinforcement Learning Framework for Autonomous Driving0
Driver Assistance Eco-driving and Transmission Control with Deep Reinforcement Learning0
DriverGym: Democratising Reinforcement Learning for Autonomous Driving0
Driver Modeling through Deep Reinforcement Learning and Behavioral Game Theory0
Driving Decision and Control for Autonomous Lane Change based on Deep Reinforcement Learning0
Driving in Real Life with Inverse Reinforcement Learning0
Driving-Policy Adaptive Safeguard for Autonomous Vehicles Using Reinforcement Learning0
Driving Tasks Transfer in Deep Reinforcement Learning for Decision-making of Autonomous Vehicles0
Driving with Style: Inverse Reinforcement Learning in General-Purpose Planning for Automated Driving0
DRL-Based QoS-Aware Resource Allocation Scheme for Coexistence of Licensed and Unlicensed Users in LTE and Beyond0
DRL-based Slice Placement Under Non-Stationary Conditions0
DRL-based Slice Placement under Realistic Network Load Conditions0
DRL-Clusters: Buffer Management with Clustering based Deep Reinforcement Learning0
Beyond Sparse Rewards: Enhancing Reinforcement Learning with Language Model Critique in Text Generation0
DRL: Deep Reinforcement Learning for Intelligent Robot Control -- Concept, Literature, and Future0
DRL-FAS: A Novel Framework Based on Deep Reinforcement Learning for Face Anti-Spoofing0
DRL-ISP: Multi-Objective Camera ISP with Deep Reinforcement Learning0
DR-MPC: Deep Residual Model Predictive Control for Real-world Social Navigation0
DROP: Distributional and Regular Optimism and Pessimism for Reinforcement Learning0
DSADF: Thinking Fast and Slow for Decision Making0
DSDF: An approach to handle stochastic agents in collaborative multi-agent reinforcement learning0
DSDF: Coordinated look-ahead strategy in stochastic multi-agent reinforcement learning0
D-Shape: Demonstration-Shaped Reinforcement Learning via Goal Conditioning0
DSP: A Differential Spatial Prediction Scheme for Comprehensive real industrial datasets0
Dual Active Learning for Reinforcement Learning from Human Feedback0
Dual-Agent Deep Reinforcement Learning for Deformable Face Tracking0
Dual Behavior Regularized Reinforcement Learning0
Dual Control for Approximate Bayesian Reinforcement Learning0
Dual Ensemble Kalman Filter for Stochastic Optimal Control0
Dual Generator Offline Reinforcement Learning0
Dual-Objective Reinforcement Learning with Novel Hamilton-Jacobi-Bellman Formulations0
Dueling Deep Q Network for Highway Decision Making in Autonomous Vehicles: A Case Study0
Dueling RL: Reinforcement Learning with Trajectory Preferences0
DyFEn: Agent-Based Fee Setting in Payment Channel Networks0
Dynamical Distance Learning for Semi-Supervised and Unsupervised Skill Discovery0
Show:102550
← PrevPage 134 of 303Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified