| ReasonFlux: Hierarchical LLM Reasoning via Scaling Thought Templates | Feb 10, 2025 | Hierarchical Reinforcement LearningLanguage Modeling | CodeCode Available | 4 |
| SkillBlender: Towards Versatile Humanoid Whole-Body Loco-Manipulation via Skill Blending | Jun 11, 2025 | Hierarchical Reinforcement LearningHumanoid Control | CodeCode Available | 2 |
| Divide and Conquer: Grounding LLMs as Efficient Decision-Making Agents via Offline Hierarchical Reinforcement Learning | May 26, 2025 | Decision MakingHierarchical Reinforcement Learning | CodeCode Available | 2 |
| Joint Admission Control and Resource Allocation of Virtual Network Embedding via Hierarchical Deep Reinforcement Learning | Jun 25, 2024 | Combinatorial OptimizationGraph Neural Network | CodeCode Available | 2 |
| MacroHFT: Memory Augmented Context-aware Reinforcement Learning On High Frequency Trading | Jun 20, 2024 | Algorithmic TradingDecision Making | CodeCode Available | 2 |
| A Tutorial on Bayesian Optimization of Expensive Cost Functions, with Application to Active User Modeling and Hierarchical Reinforcement Learning | Dec 12, 2010 | Bayesian OptimizationHierarchical Reinforcement Learning | CodeCode Available | 2 |
| From Debate to Equilibrium: Belief-Driven Multi-Agent LLM Reasoning via Bayesian Nash Equilibrium | Jun 9, 2025 | Hierarchical Reinforcement Learning | CodeCode Available | 1 |
| Multi-Turn Code Generation Through Single-Step Rewards | Feb 27, 2025 | Code GenerationHierarchical Reinforcement Learning | CodeCode Available | 1 |
| Item-Difficulty-Aware Learning Path Recommendation: From a Real Walking Perspective | Aug 24, 2024 | Hierarchical Reinforcement Learning | CodeCode Available | 1 |
| Walk Wisely on Graph: Knowledge Graph Reasoning with Dual Agents via Efficient Guidance-Exploration | Aug 3, 2024 | Hierarchical Reinforcement LearningKnowledge Graphs | CodeCode Available | 1 |