SOTAVerified|Agents Browse Leaderboard About Blog

General Reinforcement Learning

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1–10 of 84 papers

Title	Date	Tasks	Status	Hype
OpenSpiel: A Framework for Reinforcement Learning in Games	Aug 26, 2019	General Reinforcement Learningreinforcement-learning	CodeCode Available	3
Rec-R1: Bridging Generative Large Language Models and User-Centric Recommendation Systems via Reinforcement Learning	Mar 31, 2025	General Reinforcement LearningInstruction Following	CodeCode Available	2
Kinetix: Investigating the Training of General Agents through Open-Ended Physics-Based Control Tasks	Oct 30, 2024	General Reinforcement LearningReinforcement Learning (RL)	CodeCode Available	2
ReMax: A Simple, Effective, and Efficient Reinforcement Learning Method for Aligning Large Language Models	Oct 16, 2023	General Reinforcement LearningGPU	CodeCode Available	2
NOVER: Incentive Training for Language Models via Verifier-Free Reinforcement Learning	May 21, 2025	General Reinforcement LearningLogical Reasoning	CodeCode Available	1
Discovering General Reinforcement Learning Algorithms with Adversarial Environment Design	Oct 4, 2023	Deep Reinforcement LearningGeneral Reinforcement Learning	CodeCode Available	1
DeFIX: Detecting and Fixing Failure Scenarios with Reinforcement Learning in Imitation Learning Based Autonomous Driving	Oct 29, 2022	Autonomous DrivingCARLA MAP Leaderboard	CodeCode Available	1
Intelligent Resource Allocation in Joint Radar-Communication With Graph Neural Networks	Oct 17, 2022	Autonomous DrivingAutonomous Vehicles	CodeCode Available	1
Learning Deformable Object Manipulation from Expert Demonstrations	Jul 20, 2022	Deformable Object ManipulationGeneral Reinforcement Learning	CodeCode Available	1
Intelligent Trading Systems: A Sentiment-Aware Reinforcement Learning Approach	Nov 14, 2021	Algorithmic TradingGeneral Reinforcement Learning	CodeCode Available	1

Show:10 25 50

← PrevPage 1 of 9Next →

All datasets Obstacle Tower (No Gen) fixed Obstacle Tower (No Gen) varied Obstacle Tower (Strong Gen) fixed Obstacle Tower (Strong Gen) varied Obstacle Tower (Weak Gen) fixed Obstacle Tower (Weak Gen) varied

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	RNB	Score	7	—	Unverified
2	PPO	Score	5	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	RNB	Score	4.8	—	Unverified
2	PPO	Score	1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	RNB	Score	0.6	—	Unverified
2	PPO	Score	0.6	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	RNB	Score	0.8	—	Unverified
2	PPO	Score	0.6	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	PPO	Score	1.2	—	Unverified
2	RNB	Score	1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	RNB	Score	3.4	—	Unverified
2	PPO	Score	0.8	—	Unverified