SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 78017825 of 15113 papers

TitleStatusHype
MONEYBaRL: Exploiting pitcher decision-making using Reinforcement Learning0
Monitoring Fidelity of Online Reinforcement Learning Algorithms in Clinical Trials0
Monte-Carlo Planning and Learning with Language Action Value Estimates0
Monte Carlo Planning with Large Language Model for Text-Based Game Agents0
Monte-Carlo Siamese Policy on Actor for Satellite Image Super Resolution0
Monte Carlo Tree Search Algorithms for Risk-Aware and Multi-Objective Reinforcement Learning0
Monte-Carlo Tree Search for Policy Optimization0
Moody Learners -- Explaining Competitive Behaviour of Reinforcement Learning Agents0
MOORe: Model-based Offline-to-Online Reinforcement Learning0
Moral reinforcement learning using actual causation0
More Efficient Exploration with Symbolic Priors on Action Sequence Equivalences0
More Efficient Off-Policy Evaluation through Regularized Targeted Learning0
(More) Efficient Reinforcement Learning via Posterior Sampling0
MOReL: Model-Based Offline Reinforcement Learning0
More Robust Doubly Robust Off-policy Evaluation0
MoRE: Unlocking Scalability in Reinforcement Learning for Quadruped Vision-Language-Action Models0
MOT: A Mixture of Actors Reinforcement Learning Method by Optimal Transport for Algorithmic Trading0
MoTiAC: Multi-Objective Actor-Critics for Real-Time Bidding0
Motion Perception in Reinforcement Learning with Dynamic Objects0
Motion Planner Augmented Reinforcement Learning for Robot Manipulation in Obstructed Environments0
Motion Planning by Reinforcement Learning for an Unmanned Aerial Vehicle in Virtual Open Space with Static Obstacles0
Motion Planning for Autonomous Vehicles in the Presence of Uncertainty Using Reinforcement Learning0
Motion Prediction on Self-driving Cars: A Review0
MotionRL: Align Text-to-Motion Generation to Human Preferences with Multi-Reward Reinforcement Learning0
Motivating Physical Activity via Competitive Human-Robot Interaction0
Show:102550
← PrevPage 313 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified