| Little By Little: Continual Learning via Self-Activated Sparse Mixture-of-Rank Adaptive Learning | Jun 26, 2025 | Continual LearningMixture-of-Experts | —Unverified | 0 | 0 |
| Llama 3 Meets MoE: Efficient Upcycling | Dec 13, 2024 | Mixture-of-ExpertsMMLU | —Unverified | 0 | 0 |
| LLaVA-CMoE: Towards Continual Mixture of Experts for Large Vision-Language Models | Mar 27, 2025 | Mixture-of-Experts | —Unverified | 0 | 0 |
| LLaVA-MoLE: Sparse Mixture of LoRA Experts for Mitigating Data Conflicts in Instruction Finetuning MLLMs | Jan 29, 2024 | Language ModellingLarge Language Model | —Unverified | 0 | 0 |
| LLM4WM: Adapting LLM for Wireless Multi-Tasking | Jan 22, 2025 | General KnowledgeLanguage Modeling | —Unverified | 0 | 0 |
| LLM-Based Routing in Mixture of Experts: A Novel Framework for Trading | Jan 16, 2025 | Mixture-of-ExpertsWorld Knowledge | —Unverified | 0 | 0 |
| Load Balancing Mixture of Experts with Similarity Preserving Routers | Jun 16, 2025 | Mixture-of-Experts | —Unverified | 0 | 0 |
| Locking and Quacking: Stacking Bayesian model predictions by log-pooling and superposition | May 12, 2023 | Bayesian InferenceMixture-of-Experts | —Unverified | 0 | 0 |
| LoRA-Mixer: Coordinate Modular LoRA Experts Through Serial Attention Routing | Jun 17, 2025 | ARCCoLA | —Unverified | 0 | 0 |
| LoRA-Switch: Boosting the Efficiency of Dynamic LLM Adapters via System-Algorithm Co-design | May 28, 2024 | Mixture-of-Experts | —Unverified | 0 | 0 |
| Lory: Fully Differentiable Mixture-of-Experts for Autoregressive Language Model Pre-training | May 6, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| Low-Rank Mixture-of-Experts for Continual Medical Image Segmentation | Jun 19, 2024 | Continual LearningImage Segmentation | —Unverified | 0 | 0 |
| LPT++: Efficient Training on Mixture of Long-tailed Experts | Sep 17, 2024 | Mixture-of-Expertsparameter-efficient fine-tuning | —Unverified | 0 | 0 |
| LSTM-based Mixture-of-Experts for Knowledge-Aware Dialogues | May 5, 2016 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| Lynx: Enabling Efficient MoE Inference through Dynamic Batch-Aware Expert Selection | Nov 13, 2024 | Code GenerationMathematical Reasoning | —Unverified | 0 | 0 |
| M^2CD: A Unified MultiModal Framework for Optical-SAR Change Detection with Mixture of Experts and Self-Distillation | Mar 25, 2025 | Change DetectionDisaster Response | —Unverified | 0 | 0 |
| M2R2: Mixture of Multi-Rate Residuals for Efficient Transformer Inference | Feb 4, 2025 | Mixture-of-Experts | —Unverified | 0 | 0 |
| M2Restore: Mixture-of-Experts-based Mamba-CNN Fusion Framework for All-in-One Image Restoration | Jun 9, 2025 | AllImage Restoration | —Unverified | 0 | 0 |
| M^3TN: Multi-gate Mixture-of-Experts based Multi-valued Treatment Network for Uplift Modeling | Jan 24, 2024 | Mixture-of-Experts | —Unverified | 0 | 0 |
| M6-T: Exploring Sparse Expert Models and Beyond | Nov 16, 2021 | Mixture-of-Experts | —Unverified | 0 | 0 |
| Machine learning based digital twin for dynamical systems with multiple time-scales | May 12, 2020 | BIG-bench Machine LearningMixture-of-Experts | —Unverified | 0 | 0 |
| MALoRA: Mixture of Asymmetric Low-Rank Adaptation for Enhanced Multi-Task Learning | Oct 30, 2024 | Computational EfficiencyMixture-of-Experts | —Unverified | 0 | 0 |
| MANDARIN: Mixture-of-Experts Framework for Dynamic Delirium and Coma Prediction in ICU Patients: Development and Validation of an Acute Brain Dysfunction Prediction Model | Mar 8, 2025 | Mixture-of-Experts | —Unverified | 0 | 0 |
| Many Hands Make Light Work: Task-Oriented Dialogue System with Module-Based Mixture-of-Experts | May 16, 2024 | Dialogue State TrackingMixture-of-Experts | —Unverified | 0 | 0 |
| Massively Multilingual Shallow Fusion with Large Language Models | Feb 17, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |