SOTAVerified

General Reinforcement Learning

Papers

Showing 125 of 84 papers

TitleStatusHype
OpenSpiel: A Framework for Reinforcement Learning in GamesCode3
ReMax: A Simple, Effective, and Efficient Reinforcement Learning Method for Aligning Large Language ModelsCode2
Kinetix: Investigating the Training of General Agents through Open-Ended Physics-Based Control TasksCode2
Rec-R1: Bridging Generative Large Language Models and User-Centric Recommendation Systems via Reinforcement LearningCode2
Adaptive Rational Activations to Boost Deep Reinforcement LearningCode1
Stabilizing Transformers for Reinforcement LearningCode1
Time Limits in Reinforcement LearningCode1
Action Branching Architectures for Deep Reinforcement LearningCode1
Learning to Incentivize Other Learning AgentsCode1
Learning Exploration Policies for NavigationCode1
NOVER: Incentive Training for Language Models via Verifier-Free Reinforcement LearningCode1
Intelligent Resource Allocation in Joint Radar-Communication With Graph Neural NetworksCode1
Counterfactual Data Augmentation using Locally Factored DynamicsCode1
Sample Factory: Egocentric 3D Control from Pixels at 100000 FPS with Asynchronous Reinforcement LearningCode1
Align-RUDDER: Learning From Few Demonstrations by Reward RedistributionCode1
Dynamic Algorithm Configuration: Foundation of a New Meta-Algorithmic FrameworkCode1
End-to-End Egospheric Spatial MemoryCode1
Data-Efficient Reinforcement Learning with Self-Predictive RepresentationsCode1
Intelligent Trading Systems: A Sentiment-Aware Reinforcement Learning ApproachCode1
DeFIX: Detecting and Fixing Failure Scenarios with Reinforcement Learning in Imitation Learning Based Autonomous DrivingCode1
Developmental Reinforcement Learning of Control Policy of a Quadcopter UAV with Thrust Vectoring RotorsCode1
Learning Deformable Object Manipulation from Expert DemonstrationsCode1
Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning AlgorithmCode1
Discovering General Reinforcement Learning Algorithms with Adversarial Environment DesignCode1
Computable Artificial General Intelligence0
Show:102550
← PrevPage 1 of 4Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1RNBScore7Unverified
2PPOScore5Unverified
#ModelMetricClaimedVerifiedStatus
1RNBScore4.8Unverified
2PPOScore1Unverified
#ModelMetricClaimedVerifiedStatus
1PPOScore0.6Unverified
2RNBScore0.6Unverified
#ModelMetricClaimedVerifiedStatus
1RNBScore0.8Unverified
2PPOScore0.6Unverified
#ModelMetricClaimedVerifiedStatus
1PPOScore1.2Unverified
2RNBScore1Unverified
#ModelMetricClaimedVerifiedStatus
1RNBScore3.4Unverified
2PPOScore0.8Unverified