SOTAVerified

General Reinforcement Learning

Papers

Showing 125 of 84 papers

TitleStatusHype
OpenSpiel: A Framework for Reinforcement Learning in GamesCode3
ReMax: A Simple, Effective, and Efficient Reinforcement Learning Method for Aligning Large Language ModelsCode2
Rec-R1: Bridging Generative Large Language Models and User-Centric Recommendation Systems via Reinforcement LearningCode2
Kinetix: Investigating the Training of General Agents through Open-Ended Physics-Based Control TasksCode2
Intelligent Trading Systems: A Sentiment-Aware Reinforcement Learning ApproachCode1
Stabilizing Transformers for Reinforcement LearningCode1
Time Limits in Reinforcement LearningCode1
Action Branching Architectures for Deep Reinforcement LearningCode1
End-to-End Egospheric Spatial MemoryCode1
Developmental Reinforcement Learning of Control Policy of a Quadcopter UAV with Thrust Vectoring RotorsCode1
Intelligent Resource Allocation in Joint Radar-Communication With Graph Neural NetworksCode1
Learning Deformable Object Manipulation from Expert DemonstrationsCode1
Learning to Incentivize Other Learning AgentsCode1
Sample Factory: Egocentric 3D Control from Pixels at 100000 FPS with Asynchronous Reinforcement LearningCode1
Counterfactual Data Augmentation using Locally Factored DynamicsCode1
Align-RUDDER: Learning From Few Demonstrations by Reward RedistributionCode1
Learning Exploration Policies for NavigationCode1
Data-Efficient Reinforcement Learning with Self-Predictive RepresentationsCode1
Dynamic Algorithm Configuration: Foundation of a New Meta-Algorithmic FrameworkCode1
DeFIX: Detecting and Fixing Failure Scenarios with Reinforcement Learning in Imitation Learning Based Autonomous DrivingCode1
Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning AlgorithmCode1
NOVER: Incentive Training for Language Models via Verifier-Free Reinforcement LearningCode1
Adaptive Rational Activations to Boost Deep Reinforcement LearningCode1
Discovering General Reinforcement Learning Algorithms with Adversarial Environment DesignCode1
Is Deep Reinforcement Learning Really Superhuman on Atari? Leveling the playing fieldCode0
Show:102550
← PrevPage 1 of 4Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1RNBScore7Unverified
2PPOScore5Unverified
#ModelMetricClaimedVerifiedStatus
1RNBScore4.8Unverified
2PPOScore1Unverified
#ModelMetricClaimedVerifiedStatus
1PPOScore0.6Unverified
2RNBScore0.6Unverified
#ModelMetricClaimedVerifiedStatus
1RNBScore0.8Unverified
2PPOScore0.6Unverified
#ModelMetricClaimedVerifiedStatus
1PPOScore1.2Unverified
2RNBScore1Unverified
#ModelMetricClaimedVerifiedStatus
1RNBScore3.4Unverified
2PPOScore0.8Unverified