| Low-Resource Machine Translation based on Asynchronous Dynamic Programming | Aug 1, 2021 | General Reinforcement LearningLow Resource Neural Machine Translation | —Unverified | 0 | 0 |
| L-SA: Learning Under-Explored Targets in Multi-Target Reinforcement Learning | May 23, 2023 | General Reinforcement Learningreinforcement-learning | —Unverified | 0 | 0 |
| Macro Action Reinforcement Learning with Sequence Disentanglement using Variational Autoencoder | Mar 22, 2019 | DisentanglementGeneral Reinforcement Learning | —Unverified | 0 | 0 |
| Model-Free Mean-Field Reinforcement Learning: Mean-Field MDP and Mean-Field Q-Learning | Oct 28, 2019 | General Reinforcement LearningQ-Learning | —Unverified | 0 | 0 |
| Nonparametric General Reinforcement Learning | Nov 28, 2016 | General Reinforcement Learningreinforcement-learning | —Unverified | 0 | 0 |
| PeRL: Permutation-Enhanced Reinforcement Learning for Interleaved Vision-Language Reasoning | Jun 17, 2025 | General Reinforcement LearningMultimodal Reasoning | —Unverified | 0 | 0 |
| Policy Mirror Descent Inherently Explores Action Space | Mar 8, 2023 | Efficient ExplorationGeneral Reinforcement Learning | —Unverified | 0 | 0 |
| Reducing Planning Complexity of General Reinforcement Learning with Non-Markovian Abstractions | Dec 26, 2021 | Decision MakingGeneral Reinforcement Learning | —Unverified | 0 | 0 |
| Compositional Transfer in Hierarchical Reinforcement Learning | Jun 26, 2019 | General Reinforcement LearningHierarchical Reinforcement Learning | —Unverified | 0 | 0 |
| Reinforcement Learning of Speech Recognition System Based on Policy Gradient and Hypothesis Selection | Nov 10, 2017 | General Reinforcement Learningreinforcement-learning | —Unverified | 0 | 0 |