SOTAVerified

General Reinforcement Learning

Papers

Showing 110 of 84 papers

TitleStatusHype
PeRL: Permutation-Enhanced Reinforcement Learning for Interleaved Vision-Language Reasoning0
NOVER: Incentive Training for Language Models via Verifier-Free Reinforcement LearningCode1
High-order Regularization for Machine Learning and Learning-based Control0
Towards More Efficient, Robust, Instance-adaptive, and Generalizable Sequential Decision making0
Rec-R1: Bridging Generative Large Language Models and User-Centric Recommendation Systems via Reinforcement LearningCode2
The Problem of Social Cost in Multi-Agent General Reinforcement Learning: Survey and Synthesis0
Hypercube Policy Regularization Framework for Offline Reinforcement LearningCode0
Kinetix: Investigating the Training of General Agents through Open-Ended Physics-Based Control TasksCode2
Reinforcement Learning: Tutorial and Survey0
Dynamic Knowledge Injection for AIXI Agents0
Show:102550
← PrevPage 1 of 9Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1RNBScore7Unverified
2PPOScore5Unverified
#ModelMetricClaimedVerifiedStatus
1RNBScore4.8Unverified
2PPOScore1Unverified
#ModelMetricClaimedVerifiedStatus
1RNBScore0.6Unverified
2PPOScore0.6Unverified
#ModelMetricClaimedVerifiedStatus
1RNBScore0.8Unverified
2PPOScore0.6Unverified
#ModelMetricClaimedVerifiedStatus
1PPOScore1.2Unverified
2RNBScore1Unverified
#ModelMetricClaimedVerifiedStatus
1RNBScore3.4Unverified
2PPOScore0.8Unverified