| CoLA: Collaborative Low-Rank Adaptation | May 21, 2025 | CoLAMixture-of-Experts | CodeCode Available | 0 |
| Not All Models Suit Expert Offloading: On Local Routing Consistency of Mixture-of-Expert Models | May 21, 2025 | AllCPU | CodeCode Available | 0 |
| Hunyuan-TurboS: Advancing Large Language Models through Mamba-Transformer Synergy and Adaptive Chain-of-Thought | May 21, 2025 | ChatbotInstruction Following | —Unverified | 0 |
| Time Tracker: Mixture-of-Experts-Enhanced Foundation Time Series Forecasting Model with Decoupled Training Pipelines | May 21, 2025 | Graph LearningMixture-of-Experts | —Unverified | 0 |
| Efficient Data Driven Mixture-of-Expert Extraction from Trained Networks | May 21, 2025 | Mixture-of-Experts | —Unverified | 0 |
| Multimodal Cultural Safety: Evaluation Frameworks and Alignment Strategies | May 20, 2025 | Mixture-of-Experts | CodeCode Available | 0 |
| Towards Rehearsal-Free Continual Relation Extraction: Capturing Within-Task Variance with Adaptive Prompting | May 20, 2025 | Continual Relation ExtractionMixture-of-Experts | CodeCode Available | 0 |
| StPR: Spatiotemporal Preservation and Routing for Exemplar-Free Video Class-Incremental Learning | May 20, 2025 | class-incremental learningClass Incremental Learning | —Unverified | 0 |
| Multimodal Mixture of Low-Rank Experts for Sentiment Analysis and Emotion Recognition | May 20, 2025 | Emotion RecognitionMixture-of-Experts | —Unverified | 0 |
| THOR-MoE: Hierarchical Task-Guided and Context-Responsive Routing for Neural Machine Translation | May 20, 2025 | Machine TranslationMixture-of-Experts | —Unverified | 0 |