SOTAVerified

General Reinforcement Learning

Papers

Showing 125 of 84 papers

TitleStatusHype
OpenSpiel: A Framework for Reinforcement Learning in GamesCode3
Rec-R1: Bridging Generative Large Language Models and User-Centric Recommendation Systems via Reinforcement LearningCode2
Kinetix: Investigating the Training of General Agents through Open-Ended Physics-Based Control TasksCode2
ReMax: A Simple, Effective, and Efficient Reinforcement Learning Method for Aligning Large Language ModelsCode2
NOVER: Incentive Training for Language Models via Verifier-Free Reinforcement LearningCode1
Discovering General Reinforcement Learning Algorithms with Adversarial Environment DesignCode1
DeFIX: Detecting and Fixing Failure Scenarios with Reinforcement Learning in Imitation Learning Based Autonomous DrivingCode1
Intelligent Resource Allocation in Joint Radar-Communication With Graph Neural NetworksCode1
Learning Deformable Object Manipulation from Expert DemonstrationsCode1
Intelligent Trading Systems: A Sentiment-Aware Reinforcement Learning ApproachCode1
Adaptive Rational Activations to Boost Deep Reinforcement LearningCode1
End-to-End Egospheric Spatial MemoryCode1
Align-RUDDER: Learning From Few Demonstrations by Reward RedistributionCode1
Developmental Reinforcement Learning of Control Policy of a Quadcopter UAV with Thrust Vectoring RotorsCode1
Data-Efficient Reinforcement Learning with Self-Predictive RepresentationsCode1
Counterfactual Data Augmentation using Locally Factored DynamicsCode1
Sample Factory: Egocentric 3D Control from Pixels at 100000 FPS with Asynchronous Reinforcement LearningCode1
Learning to Incentivize Other Learning AgentsCode1
Dynamic Algorithm Configuration: Foundation of a New Meta-Algorithmic FrameworkCode1
Stabilizing Transformers for Reinforcement LearningCode1
Learning Exploration Policies for NavigationCode1
Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning AlgorithmCode1
Time Limits in Reinforcement LearningCode1
Action Branching Architectures for Deep Reinforcement LearningCode1
PeRL: Permutation-Enhanced Reinforcement Learning for Interleaved Vision-Language Reasoning0
Show:102550
← PrevPage 1 of 4Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1RNBScore7Unverified
2PPOScore5Unverified
#ModelMetricClaimedVerifiedStatus
1RNBScore4.8Unverified
2PPOScore1Unverified
#ModelMetricClaimedVerifiedStatus
1RNBScore0.6Unverified
2PPOScore0.6Unverified
#ModelMetricClaimedVerifiedStatus
1RNBScore0.8Unverified
2PPOScore0.6Unverified
#ModelMetricClaimedVerifiedStatus
1PPOScore1.2Unverified
2RNBScore1Unverified
#ModelMetricClaimedVerifiedStatus
1RNBScore3.4Unverified
2PPOScore0.8Unverified