SOTAVerified

General Reinforcement Learning

Papers

Showing 125 of 84 papers

TitleStatusHype
PeRL: Permutation-Enhanced Reinforcement Learning for Interleaved Vision-Language Reasoning0
NOVER: Incentive Training for Language Models via Verifier-Free Reinforcement LearningCode1
High-order Regularization for Machine Learning and Learning-based Control0
Towards More Efficient, Robust, Instance-adaptive, and Generalizable Sequential Decision making0
Rec-R1: Bridging Generative Large Language Models and User-Centric Recommendation Systems via Reinforcement LearningCode2
The Problem of Social Cost in Multi-Agent General Reinforcement Learning: Survey and Synthesis0
Hypercube Policy Regularization Framework for Offline Reinforcement LearningCode0
Kinetix: Investigating the Training of General Agents through Open-Ended Physics-Based Control TasksCode2
Reinforcement Learning: Tutorial and Survey0
Dynamic Knowledge Injection for AIXI Agents0
Dropout Strategy in Reinforcement Learning: Limiting the Surrogate Objective Variance in Policy Optimization Methods0
ReMax: A Simple, Effective, and Efficient Reinforcement Learning Method for Aligning Large Language ModelsCode2
Discovering General Reinforcement Learning Algorithms with Adversarial Environment DesignCode1
Image Transformation Sequence Retrieval with General Reinforcement Learning0
L-SA: Learning Under-Explored Targets in Multi-Target Reinforcement Learning0
Computably Continuous Reinforcement-Learning Objectives are PAC-learnable0
Policy Mirror Descent Inherently Explores Action Space0
Learning to Backdoor Federated LearningCode0
Computational Dualism and Objective Superintelligence0
Accuracy-Guaranteed Collaborative DNN Inference in Industrial IoT via Deep Reinforcement Learning0
AcceRL: Policy Acceleration Framework for Deep Reinforcement Learning0
DeFIX: Detecting and Fixing Failure Scenarios with Reinforcement Learning in Imitation Learning Based Autonomous DrivingCode1
Intelligent Resource Allocation in Joint Radar-Communication With Graph Neural NetworksCode1
Learning Deformable Object Manipulation from Expert DemonstrationsCode1
Computable Artificial General Intelligence0
Show:102550
← PrevPage 1 of 4Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1RNBScore7Unverified
2PPOScore5Unverified
#ModelMetricClaimedVerifiedStatus
1RNBScore4.8Unverified
2PPOScore1Unverified
#ModelMetricClaimedVerifiedStatus
1RNBScore0.6Unverified
2PPOScore0.6Unverified
#ModelMetricClaimedVerifiedStatus
1RNBScore0.8Unverified
2PPOScore0.6Unverified
#ModelMetricClaimedVerifiedStatus
1PPOScore1.2Unverified
2RNBScore1Unverified
#ModelMetricClaimedVerifiedStatus
1RNBScore3.4Unverified
2PPOScore0.8Unverified