SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 49514975 of 15113 papers

TitleStatusHype
Reinforcement Learning for Ultrasound Image Analysis A Comprehensive Review of Advances and Applications0
Reinforcement Learning for Variable Selection in a Branch and Bound Algorithm0
Reinforcement Learning for Versatile, Dynamic, and Robust Bipedal Locomotion Control0
Reinforcement Learning for Visual Object Detection0
Reinforcement Learning for Volt-Var Control: A Novel Two-stage Progressive Training Strategy0
Reinforcement Learning for Weakly Supervised Temporal Grounding of Natural Language in Untrimmed Videos0
Reinforcement Learning Framework for Opportunistic Routing in WSNs0
Reinforcement Learning Framework for Quantitative Trading0
Reinforcement Learning Framework for Server Placement and Workload Allocation in Multi-Access Edge Computing0
Reinforcement learning framework for the mechanical design of microelectronic components under multiphysics constraints0
Reinforcement Learning from Bagged Reward0
Reinforcement Learning from Demonstrations by Novel Interactive Expert and Application to Automatic Berthing Control Systems for Unmanned Surface Vessel0
Reinforcement Learning from Diverse Human Preferences0
Reinforcement Learning from Imperfect Demonstrations0
Reinforcement Learning From Imperfect Corrective Actions And Proxy Rewards0
Reinforcement Learning from Imperfect Demonstrations under Soft Expert Guidance0
Reinforcement Learning from LLM Feedback to Counteract Goal Misgeneralization0
Reinforcement Learning-Guided Semi-Supervised Learning0
Reinforcement Learning in 20Q Game with Generic Knowledge Bases0
Reinforcement Learning in a Birth and Death Process: Breaking the Dependence on the State Space0
Reinforcement Learning in Agent-Based Market Simulation: Unveiling Realistic Stylized Facts and Behavior0
Reinforcement Learning in a large scale photonic Recurrent Neural Network0
Reinforcement Learning in a Neurally Controlled Robot Using Dopamine Modulated STDP0
Reinforcement Learning in a Safety-Embedded MDP with Trajectory Optimization0
Reinforcement Learning in Categorical Cybernetics0
Show:102550
← PrevPage 199 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified