| M2Restore: Mixture-of-Experts-based Mamba-CNN Fusion Framework for All-in-One Image Restoration | Jun 9, 2025 | AllImage Restoration | —Unverified | 0 |
| MIRA: Medical Time Series Foundation Model for Real-World Health Data | Jun 9, 2025 | EthicsMissing Values | —Unverified | 0 |
| STAMImputer: Spatio-Temporal Attention MoE for Traffic Data Imputation | Jun 9, 2025 | Graph AttentionImputation | CodeCode Available | 0 |
| MoE-MLoRA for Multi-Domain CTR Prediction: Efficient Adaptation with Expert Specialization | Jun 9, 2025 | Click-Through Rate PredictionDiversity | CodeCode Available | 0 |
| MoE-GPS: Guidlines for Prediction Strategy for Dynamic Expert Duplication in MoE Load Balancing | Jun 9, 2025 | GPUMixture-of-Experts | —Unverified | 0 |
| Breaking Data Silos: Towards Open and Scalable Mobility Foundation Models via Generative Continual Learning | Jun 7, 2025 | Continual LearningFederated Learning | —Unverified | 0 |
| SMAR: Soft Modality-Aware Routing Strategy for MoE-based Multimodal Large Language Models Preserving Language Capabilities | Jun 6, 2025 | Mixture-of-Experts | —Unverified | 0 |
| Lifelong Evolution: Collaborative Learning between Large and Small Language Models for Continuous Emergent Fake News Detection | Jun 5, 2025 | Fake News Detectionknowledge editing | —Unverified | 0 |
| FlashDMoE: Fast Distributed MoE in a Single Kernel | Jun 5, 2025 | 16kCPU | CodeCode Available | 3 |
| Brain-Like Processing Pathways Form in Models With Heterogeneous Experts | Jun 3, 2025 | FormMixture-of-Experts | —Unverified | 0 |