SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 19711980 of 15113 papers

TitleStatusHype
Cross-Embodiment Robot Manipulation Skill Transfer using Latent Space AlignmentCode1
FightLadder: A Benchmark for Competitive Multi-Agent Reinforcement Learning0
MOT: A Mixture of Actors Reinforcement Learning Method by Optimal Transport for Algorithmic Trading0
Reinforcement Learning as a Robotics-Inspired Framework for Insect Navigation: From Spatial Representations to Neural Implementation0
When to Sense and Control? A Time-adaptive Approach for Continuous-Time RLCode0
Combinatorial Multivariant Multi-Armed Bandits with Applications to Episodic Reinforcement Learning and Beyond0
Causal prompting model-based offline reinforcement learning0
NeoRL: Efficient Exploration for Nonepisodic RL0
A Fast Convergence Theory for Offline Decision Making0
Federated Learning-based Collaborative Wideband Spectrum Sensing and Scheduling for UAVs in UTM Systems0
Show:102550
← PrevPage 198 of 1512Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified