| OpenSpiel: A Framework for Reinforcement Learning in Games | Aug 26, 2019 | General Reinforcement Learningreinforcement-learning | CodeCode Available | 3 |
| Rec-R1: Bridging Generative Large Language Models and User-Centric Recommendation Systems via Reinforcement Learning | Mar 31, 2025 | General Reinforcement LearningInstruction Following | CodeCode Available | 2 |
| Kinetix: Investigating the Training of General Agents through Open-Ended Physics-Based Control Tasks | Oct 30, 2024 | General Reinforcement LearningReinforcement Learning (RL) | CodeCode Available | 2 |
| ReMax: A Simple, Effective, and Efficient Reinforcement Learning Method for Aligning Large Language Models | Oct 16, 2023 | General Reinforcement LearningGPU | CodeCode Available | 2 |
| NOVER: Incentive Training for Language Models via Verifier-Free Reinforcement Learning | May 21, 2025 | General Reinforcement LearningLogical Reasoning | CodeCode Available | 1 |
| Discovering General Reinforcement Learning Algorithms with Adversarial Environment Design | Oct 4, 2023 | Deep Reinforcement LearningGeneral Reinforcement Learning | CodeCode Available | 1 |
| DeFIX: Detecting and Fixing Failure Scenarios with Reinforcement Learning in Imitation Learning Based Autonomous Driving | Oct 29, 2022 | Autonomous DrivingCARLA MAP Leaderboard | CodeCode Available | 1 |
| Intelligent Resource Allocation in Joint Radar-Communication With Graph Neural Networks | Oct 17, 2022 | Autonomous DrivingAutonomous Vehicles | CodeCode Available | 1 |
| Learning Deformable Object Manipulation from Expert Demonstrations | Jul 20, 2022 | Deformable Object ManipulationGeneral Reinforcement Learning | CodeCode Available | 1 |
| Intelligent Trading Systems: A Sentiment-Aware Reinforcement Learning Approach | Nov 14, 2021 | Algorithmic TradingGeneral Reinforcement Learning | CodeCode Available | 1 |