SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 97769800 of 15113 papers

TitleStatusHype
Deep Reinforcement Learning with Surrogate Agent-Environment Interface0
Deep Reinforcement Learning with Symmetric Prior for Predictive Power Allocation to Mobile Users0
Deep reinforcement learning with symmetric data augmentation applied for aircraft lateral attitude tracking control0
Deep Reinforcement Learning with Vector Quantized Encoding0
Deep Reinforcement Learning with Weighted Q-Learning0
Deep Residual Reinforcement Learning0
Deep RL-based Trajectory Planning for AoI Minimization in UAV-assisted IoT0
Deep RL for Blood Glucose Control: Lessons, Challenges, and Opportunities0
Deep RL with Hierarchical Action Exploration for Dialogue Generation0
Deep RL With Information Constrained Policies: Generalization in Continuous Control0
DeepRNG: Towards Deep Reinforcement Learning-Assisted Generative Testing of Software0
DeepScalper: A Risk-Aware Reinforcement Learning Framework to Capture Fleeting Intraday Trading Opportunities0
Deep Sets for Generalization in RL0
Deep SIMBAD: Active Landmark-based Self-localization Using Ranking -based Scene Descriptor0
DeepSlicing: Deep Reinforcement Learning Assisted Resource Allocation for Network Slicing0
Deep Surrogate Assisted Generation of Environments0
DeepThermal: Combustion Optimization for Thermal Power Generating Units Using Offline Reinforcement Learning0
Deep Transfer Q-Learning for Offline Non-Stationary Reinforcement Learning0
DeepVideo-R1: Video Reinforcement Fine-Tuning via Difficulty-aware Regressive GRPO0
Deep VULMAN: A Deep Reinforcement Learning-Enabled Cyber Vulnerability Management Framework0
DeepWiVe: Deep-Learning-Aided Wireless Video Transmission0
DEER: A Delay-Resilient Framework for Reinforcement Learning with Variable Delays0
Adversary Agnostic Robust Deep Reinforcement Learning0
Defense Against Reward Poisoning Attacks in Reinforcement Learning0
Defensive Escort Teams via Multi-Agent Deep Reinforcement Learning0
Show:102550
← PrevPage 392 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified