| Learning Exploration Policies for Navigation | Mar 5, 2019 | Efficient ExplorationGeneral Reinforcement Learning | CodeCode Available | 1 |
| Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm | Dec 5, 2017 | Game of ChessGame of Go | CodeCode Available | 1 |
| Time Limits in Reinforcement Learning | Dec 1, 2017 | General Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Action Branching Architectures for Deep Reinforcement Learning | Nov 24, 2017 | continuous-controlContinuous Control | CodeCode Available | 1 |
| PeRL: Permutation-Enhanced Reinforcement Learning for Interleaved Vision-Language Reasoning | Jun 17, 2025 | General Reinforcement LearningMultimodal Reasoning | —Unverified | 0 |
| High-order Regularization for Machine Learning and Learning-based Control | May 13, 2025 | General Reinforcement Learning | —Unverified | 0 |
| Towards More Efficient, Robust, Instance-adaptive, and Generalizable Sequential Decision making | Apr 12, 2025 | Decision MakingDecision Making Under Uncertainty | —Unverified | 0 |
| The Problem of Social Cost in Multi-Agent General Reinforcement Learning: Survey and Synthesis | Dec 3, 2024 | General Reinforcement LearningMulti-agent Reinforcement Learning | —Unverified | 0 |
| Hypercube Policy Regularization Framework for Offline Reinforcement Learning | Nov 7, 2024 | D4RLGeneral Reinforcement Learning | CodeCode Available | 0 |
| Reinforcement Learning: Tutorial and Survey | Jul 18, 2024 | Deep Reinforcement LearningGeneral Reinforcement Learning | —Unverified | 0 |