SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 79517975 of 15113 papers

TitleStatusHype
Open Problem: Approximate Planning of POMDPs in the class of Memoryless Policies0
Open Problem: Order Optimal Regret Bounds for Kernel-Based Reinforcement Learning0
Open Problems and Modern Solutions for Deep Reinforcement Learning0
Open Problem: Tight Online Confidence Intervals for RKHS Elements0
Open Vision Reasoner: Transferring Linguistic Cognitive Behavior for Visual Reasoning0
Operator Shifting for Model-based Policy Evaluation0
Operator Deep Q-Learning: Zero-Shot Reward Transferring in Reinforcement Learning0
Operator Splitting Value Iteration0
Opinion shaping in social networks using reinforcement learning0
Opportunistic Episodic Reinforcement Learning0
Opportunities of Hybrid Model-based Reinforcement Learning for Cell Therapy Manufacturing Process Control0
Opportunities of Reinforcement Learning in South Africa's Just Transition0
Low-Rank MDPs with Continuous Action Spaces0
Optimal and Learning Control for Autonomous Robots0
Optimal Attacks on Reinforcement Learning Policies0
Optimal Beam Association for High Mobility mmWave Vehicular Networks: Lightweight Parallel Reinforcement Learning Approach0
Optimal Bidding Strategy without Exploration in Real-time Bidding0
Optimal Charging Method for Effective Li-ion Battery Life Extension Based on Reinforcement Learning0
Optimal Conservative Offline RL with General Function Approximation via Augmented Lagrangian0
Optimal control barrier functions for RL based safe powertrain control0
Optimal Control of District Cooling Energy Plant with Reinforcement Learning and MPC0
Optimal control of eye-movements during visual search0
Optimal Control of Material Micro-Structures0
Optimal control of point-to-point navigation in turbulent time-dependent flows using Reinforcement Learning0
Optimal coordination of resources: A solution from reinforcement learning0
Show:102550
← PrevPage 319 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified