| ReasonFlux: Hierarchical LLM Reasoning via Scaling Thought Templates | Feb 10, 2025 | Hierarchical Reinforcement LearningLanguage Modeling | CodeCode Available | 4 |
| RALLRec: Improving Retrieval Augmented Large Language Model Recommendation with Representation Learning | Feb 10, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Recent Advances in Discrete Speech Tokens: A Review | Feb 10, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Structural Reformation of Large Language Model Neuron Encapsulation for Divergent Information Aggregation | Feb 10, 2025 | Decision MakingLanguage Modeling | —Unverified | 0 |
| Rationalization Models for Text-to-SQL | Feb 10, 2025 | Knowledge DistillationLanguage Modeling | —Unverified | 0 |
| Jakiro: Boosting Speculative Decoding with Decoupled Multi-Head via MoE | Feb 10, 2025 | DiversityLanguage Modeling | CodeCode Available | 1 |
| μnit Scaling: Simple and Scalable FP8 LLM Training | Feb 9, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| HSI: Head-Specific Intervention Can Induce Misaligned AI Coordination in Large Language Models | Feb 9, 2025 | Answer GenerationLanguage Modeling | CodeCode Available | 0 |
| Investigating Compositional Reasoning in Time Series Foundation Models | Feb 9, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Digital Twin Buildings: 3D Modeling, GIS Integration, and Visual Descriptions Using Gaussian Splatting, ChatGPT/Deepseek, and Google Maps Platform | Feb 9, 2025 | Decision MakingLanguage Modeling | —Unverified | 0 |