SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 85768600 of 15113 papers

TitleStatusHype
Operator Deep Q-Learning: Zero-Shot Reward Transferring in Reinforcement Learning0
Operator Splitting Value Iteration0
Opinion shaping in social networks using reinforcement learning0
Opportunistic Episodic Reinforcement Learning0
Opportunities of Hybrid Model-based Reinforcement Learning for Cell Therapy Manufacturing Process Control0
Opportunities of Reinforcement Learning in South Africa's Just Transition0
Optimal Actuator Attacks on Autonomous Vehicles Using Reinforcement Learning0
Optimal and Learning Control for Autonomous Robots0
Optimal Attacks on Reinforcement Learning Policies0
Optimal Beam Association for High Mobility mmWave Vehicular Networks: Lightweight Parallel Reinforcement Learning Approach0
Optimal Bidding Strategy without Exploration in Real-time Bidding0
Optimal Charging Method for Effective Li-ion Battery Life Extension Based on Reinforcement Learning0
Optimal Conservative Offline RL with General Function Approximation via Augmented Lagrangian0
Optimal control barrier functions for RL based safe powertrain control0
Optimal Control of District Cooling Energy Plant with Reinforcement Learning and MPC0
Optimal control of eye-movements during visual search0
Optimal Control of Material Micro-Structures0
Optimal control of point-to-point navigation in turbulent time-dependent flows using Reinforcement Learning0
Optimal coordination of resources: A solution from reinforcement learning0
Optimal Cycling of a Heterogenous Battery Bank via Reinforcement Learning0
Optimal Decision-Making in Mixed-Agent Partially Observable Stochastic Environments via Reinforcement Learning0
Optimal Demand Response Using Device Based Reinforcement Learning0
Optimal Dispatch in Emergency Service System via Reinforcement Learning0
Optimal Hierarchical Learning Path Design with Reinforcement Learning0
Optimal Horizon-Free Reward-Free Exploration for Linear Mixture MDPs0
Show:102550
← PrevPage 344 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified