| OpenSpiel: A Framework for Reinforcement Learning in Games | Aug 26, 2019 | General Reinforcement Learningreinforcement-learning | CodeCode Available | 3 | 5 |
| Rec-R1: Bridging Generative Large Language Models and User-Centric Recommendation Systems via Reinforcement Learning | Mar 31, 2025 | General Reinforcement LearningInstruction Following | CodeCode Available | 2 | 5 |
| Kinetix: Investigating the Training of General Agents through Open-Ended Physics-Based Control Tasks | Oct 30, 2024 | General Reinforcement LearningReinforcement Learning (RL) | CodeCode Available | 2 | 5 |
| ReMax: A Simple, Effective, and Efficient Reinforcement Learning Method for Aligning Large Language Models | Oct 16, 2023 | General Reinforcement LearningGPU | CodeCode Available | 2 | 5 |
| Data-Efficient Reinforcement Learning with Self-Predictive Representations | Jul 12, 2020 | Atari Games 100kData Augmentation | CodeCode Available | 1 | 5 |
| DeFIX: Detecting and Fixing Failure Scenarios with Reinforcement Learning in Imitation Learning Based Autonomous Driving | Oct 29, 2022 | Autonomous DrivingCARLA MAP Leaderboard | CodeCode Available | 1 | 5 |
| End-to-End Egospheric Spatial Memory | Feb 15, 2021 | General Reinforcement LearningImitation Learning | CodeCode Available | 1 | 5 |
| Action Branching Architectures for Deep Reinforcement Learning | Nov 24, 2017 | continuous-controlContinuous Control | CodeCode Available | 1 | 5 |
| Discovering General Reinforcement Learning Algorithms with Adversarial Environment Design | Oct 4, 2023 | Deep Reinforcement LearningGeneral Reinforcement Learning | CodeCode Available | 1 | 5 |
| Counterfactual Data Augmentation using Locally Factored Dynamics | Jul 6, 2020 | counterfactualData Augmentation | CodeCode Available | 1 | 5 |