| KDExplainer: A Task-oriented Attention Model for Explaining Knowledge Distillation | May 10, 2021 | Knowledge DistillationMixture-of-Experts | —Unverified | 0 | 0 |
| Knowledge-in-Context: Towards Knowledgeable Semi-Parametric Language Models | Oct 28, 2022 | Common Sense ReasoningCoreference Resolution | —Unverified | 0 | 0 |
| LaDiMo: Layer-wise Distillation Inspired MoEfier | Aug 8, 2024 | Knowledge DistillationMixture-of-Experts | —Unverified | 0 | 0 |
| Lancet: Accelerating Mixture-of-Experts Training via Whole Graph Computation-Communication Overlapping | Apr 30, 2024 | AllMixture-of-Experts | —Unverified | 0 | 0 |
| Language-driven All-in-one Adverse Weather Removal | Dec 3, 2023 | AllDiversity | —Unverified | 0 | 0 |
| Large-Scale YouTube-8M Video Understanding with Deep Neural Networks | Jun 14, 2017 | ClassificationGeneral Classification | —Unverified | 0 | 0 |
| La-SoftMoE CLIP for Unified Physical-Digital Face Attack Detection | Aug 23, 2024 | Mixture-of-Experts | —Unverified | 0 | 0 |
| LaVIDE: A Language-Vision Discriminator for Detecting Changes in Satellite Image with Map References | Nov 29, 2024 | Change DetectionMixture-of-Experts | —Unverified | 0 | 0 |
| Lazarus: Resilient and Elastic Training of Mixture-of-Experts Models with Adaptive Expert Placement | Jul 5, 2024 | GPUMixture-of-Experts | —Unverified | 0 | 0 |
| LDACP: Long-Delayed Ad Conversions Prediction Model for Bidding Strategy | Nov 25, 2024 | Mixture-of-Expertsregression | —Unverified | 0 | 0 |
| Learning Factored Representations in a Deep Mixture of Experts | Dec 16, 2013 | Mixture-of-Experts | —Unverified | 0 | 0 |
| Learning Heterogeneous Tissues with Mixture of Experts for Gigapixel Whole Slide Images | Jan 1, 2025 | Mixture-of-Expertswhole slide images | —Unverified | 0 | 0 |
| Learning in Gated Neural Networks | Jun 6, 2019 | Mixture-of-Experts | —Unverified | 0 | 0 |
| Learning Large-scale Universal User Representation with Sparse Mixture of Experts | Jul 11, 2022 | Mixture-of-Experts | —Unverified | 0 | 0 |
| Learning More Generalized Experts by Merging Experts in Mixture-of-Experts | May 19, 2024 | Incremental LearningMixture-of-Experts | —Unverified | 0 | 0 |
| Learning Sparse Mixture of Experts for Visual Question Answering | Sep 19, 2019 | Mixture-of-ExpertsQuestion Answering | —Unverified | 0 | 0 |
| Learning to Compose Topic-Aware Mixture of Experts for Zero-Shot Video Captioning | Nov 7, 2018 | Mixture-of-ExpertsVideo Captioning | —Unverified | 0 | 0 |
| Learning to Ground VLMs without Forgetting | Oct 14, 2024 | DecoderLanguage Modelling | —Unverified | 0 | 0 |
| LEMoE: Advanced Mixture of Experts Adaptor for Lifelong Model Editing of Large Language Models | Jun 28, 2024 | Mixture-of-ExpertsModel Editing | —Unverified | 0 | 0 |
| Leveraging Mixture of Experts for Improved Speech Deepfake Detection | Sep 24, 2024 | DeepFake DetectionFace Swapping | —Unverified | 0 | 0 |
| Leveraging MoE-based Large Language Model for Zero-Shot Multi-Task Semantic Communication | Mar 19, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| Leveraging Pre-Trained Models for Multimodal Class-Incremental Learning under Adaptive Fusion | Feb 7, 2025 | class-incremental learningClass Incremental Learning | —Unverified | 0 | 0 |
| Lifelong Evolution: Collaborative Learning between Large and Small Language Models for Continuous Emergent Fake News Detection | Jun 5, 2025 | Fake News Detectionknowledge editing | —Unverified | 0 | 0 |
| Lifelong Knowledge Editing for Vision Language Models with Low-Rank Mixture-of-Experts | Nov 23, 2024 | knowledge editingMixture-of-Experts | —Unverified | 0 | 0 |
| Lifelong Language Pretraining with Distribution-Specialized Experts | May 20, 2023 | Lifelong learningMixture-of-Experts | —Unverified | 0 | 0 |
| Little By Little: Continual Learning via Self-Activated Sparse Mixture-of-Rank Adaptive Learning | Jun 26, 2025 | Continual LearningMixture-of-Experts | —Unverified | 0 | 0 |
| Llama 3 Meets MoE: Efficient Upcycling | Dec 13, 2024 | Mixture-of-ExpertsMMLU | —Unverified | 0 | 0 |
| LLaVA-CMoE: Towards Continual Mixture of Experts for Large Vision-Language Models | Mar 27, 2025 | Mixture-of-Experts | —Unverified | 0 | 0 |
| LLaVA-MoLE: Sparse Mixture of LoRA Experts for Mitigating Data Conflicts in Instruction Finetuning MLLMs | Jan 29, 2024 | Language ModellingLarge Language Model | —Unverified | 0 | 0 |
| LLM4WM: Adapting LLM for Wireless Multi-Tasking | Jan 22, 2025 | General KnowledgeLanguage Modeling | —Unverified | 0 | 0 |
| LLM-Based Routing in Mixture of Experts: A Novel Framework for Trading | Jan 16, 2025 | Mixture-of-ExpertsWorld Knowledge | —Unverified | 0 | 0 |
| Load Balancing Mixture of Experts with Similarity Preserving Routers | Jun 16, 2025 | Mixture-of-Experts | —Unverified | 0 | 0 |
| Locking and Quacking: Stacking Bayesian model predictions by log-pooling and superposition | May 12, 2023 | Bayesian InferenceMixture-of-Experts | —Unverified | 0 | 0 |
| LoRA-Mixer: Coordinate Modular LoRA Experts Through Serial Attention Routing | Jun 17, 2025 | ARCCoLA | —Unverified | 0 | 0 |
| LoRA-Switch: Boosting the Efficiency of Dynamic LLM Adapters via System-Algorithm Co-design | May 28, 2024 | Mixture-of-Experts | —Unverified | 0 | 0 |
| Lory: Fully Differentiable Mixture-of-Experts for Autoregressive Language Model Pre-training | May 6, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| Low-Rank Mixture-of-Experts for Continual Medical Image Segmentation | Jun 19, 2024 | Continual LearningImage Segmentation | —Unverified | 0 | 0 |
| LPT++: Efficient Training on Mixture of Long-tailed Experts | Sep 17, 2024 | Mixture-of-Expertsparameter-efficient fine-tuning | —Unverified | 0 | 0 |
| LSTM-based Mixture-of-Experts for Knowledge-Aware Dialogues | May 5, 2016 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| Lynx: Enabling Efficient MoE Inference through Dynamic Batch-Aware Expert Selection | Nov 13, 2024 | Code GenerationMathematical Reasoning | —Unverified | 0 | 0 |
| M^2CD: A Unified MultiModal Framework for Optical-SAR Change Detection with Mixture of Experts and Self-Distillation | Mar 25, 2025 | Change DetectionDisaster Response | —Unverified | 0 | 0 |
| M2R2: Mixture of Multi-Rate Residuals for Efficient Transformer Inference | Feb 4, 2025 | Mixture-of-Experts | —Unverified | 0 | 0 |
| M2Restore: Mixture-of-Experts-based Mamba-CNN Fusion Framework for All-in-One Image Restoration | Jun 9, 2025 | AllImage Restoration | —Unverified | 0 | 0 |
| M^3TN: Multi-gate Mixture-of-Experts based Multi-valued Treatment Network for Uplift Modeling | Jan 24, 2024 | Mixture-of-Experts | —Unverified | 0 | 0 |
| M6-T: Exploring Sparse Expert Models and Beyond | Nov 16, 2021 | Mixture-of-Experts | —Unverified | 0 | 0 |
| Machine learning based digital twin for dynamical systems with multiple time-scales | May 12, 2020 | BIG-bench Machine LearningMixture-of-Experts | —Unverified | 0 | 0 |
| MALoRA: Mixture of Asymmetric Low-Rank Adaptation for Enhanced Multi-Task Learning | Oct 30, 2024 | Computational EfficiencyMixture-of-Experts | —Unverified | 0 | 0 |
| MANDARIN: Mixture-of-Experts Framework for Dynamic Delirium and Coma Prediction in ICU Patients: Development and Validation of an Acute Brain Dysfunction Prediction Model | Mar 8, 2025 | Mixture-of-Experts | —Unverified | 0 | 0 |
| Many Hands Make Light Work: Task-Oriented Dialogue System with Module-Based Mixture-of-Experts | May 16, 2024 | Dialogue State TrackingMixture-of-Experts | —Unverified | 0 | 0 |
| Massively Multilingual Shallow Fusion with Large Language Models | Feb 17, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |