| PeRL: Permutation-Enhanced Reinforcement Learning for Interleaved Vision-Language Reasoning | Jun 17, 2025 | General Reinforcement LearningMultimodal Reasoning | —Unverified | 0 |
| NOVER: Incentive Training for Language Models via Verifier-Free Reinforcement Learning | May 21, 2025 | General Reinforcement LearningLogical Reasoning | CodeCode Available | 1 |
| High-order Regularization for Machine Learning and Learning-based Control | May 13, 2025 | General Reinforcement Learning | —Unverified | 0 |
| Towards More Efficient, Robust, Instance-adaptive, and Generalizable Sequential Decision making | Apr 12, 2025 | Decision MakingDecision Making Under Uncertainty | —Unverified | 0 |
| Rec-R1: Bridging Generative Large Language Models and User-Centric Recommendation Systems via Reinforcement Learning | Mar 31, 2025 | General Reinforcement LearningInstruction Following | CodeCode Available | 2 |
| The Problem of Social Cost in Multi-Agent General Reinforcement Learning: Survey and Synthesis | Dec 3, 2024 | General Reinforcement LearningMulti-agent Reinforcement Learning | —Unverified | 0 |
| Hypercube Policy Regularization Framework for Offline Reinforcement Learning | Nov 7, 2024 | D4RLGeneral Reinforcement Learning | CodeCode Available | 0 |
| Kinetix: Investigating the Training of General Agents through Open-Ended Physics-Based Control Tasks | Oct 30, 2024 | General Reinforcement LearningReinforcement Learning (RL) | CodeCode Available | 2 |
| Reinforcement Learning: Tutorial and Survey | Jul 18, 2024 | Deep Reinforcement LearningGeneral Reinforcement Learning | —Unverified | 0 |
| Dynamic Knowledge Injection for AIXI Agents | Dec 18, 2023 | General Reinforcement Learning | —Unverified | 0 |