SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 78267850 of 15113 papers

TitleStatusHype
MP3: Movement Primitive-Based (Re-)Planning Policy0
MPC4RL -- A Software Package for Reinforcement Learning based on Model Predictive Control0
MPC-based Reinforcement Learning for a Simplified Freight Mission of Autonomous Surface Vehicles0
MPC-based Reinforcement Learning for Economic Problems with Application to Battery Storage0
MQES: Max-Q Entropy Search for Efficient Exploration in Continuous Reinforcement Learning0
MQGrad: Reinforcement Learning of Gradient Quantization in Parameter Server0
MRAC-RL: A Framework for On-Line Policy Adaptation Under Parametric Model Uncertainty0
MSDF: A Deep Reinforcement Learning Framework for Service Function Chain Migration0
MS-Ranker: Accumulating Evidence from Potentially Correct Candidates for Answer Selection0
MSRL: Distributed Reinforcement Learning with Dataflow Fragments0
MSVIPER: Improved Policy Distillation for Reinforcement-Learning-Based Robot Navigation0
MT^3: Scaling MLLM-based Text Image Machine Translation via Multi-Task Reinforcement Learning0
MTLight: Efficient Multi-Task Reinforcement Learning for Traffic Signal Control0
MT-Opt: Continuous Multi-Task Robotic Reinforcement Learning at Scale0
MULE: Multi-terrain and Unknown Load Adaptation for Effective Quadrupedal Locomotion0
Multi-Advisor Reinforcement Learning0
Multi-Agent Actor-Critic with Generative Cooperative Policy Network0
Multi-Agent Adversarial Attacks for Multi-Channel Communications0
Multi-agent Assessment with QoS Enhancement for HD Map Updates in a Vehicular Network0
Asynchronous, Option-Based Multi-Agent Policy Gradient: A Conditional Reasoning Approach0
Multiagent-based Participatory Urban Simulation through Inverse Reinforcement Learning0
Multi-agent Battery Storage Management using MPC-based Reinforcement Learning0
Multi-agent Bayesian Deep Reinforcement Learning for Microgrid Energy Management under Communication Failures0
Multi-Agent Broad Reinforcement Learning for Intelligent Traffic Light Control0
CGIBNet: Bandwidth-constrained Communication with Graph Information Bottleneck in Multi-Agent Reinforcement Learning0
Show:102550
← PrevPage 314 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified