Mixture-of-Experts

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1151–1200 of 1312 papers

Title	Date	Tasks	Status
KDExplainer: A Task-oriented Attention Model for Explaining Knowledge Distillation	May 10, 2021	Knowledge DistillationMixture-of-Experts	—Unverified
Knowledge-in-Context: Towards Knowledgeable Semi-Parametric Language Models	Oct 28, 2022	Common Sense ReasoningCoreference Resolution	—Unverified
LaDiMo: Layer-wise Distillation Inspired MoEfier	Aug 8, 2024	Knowledge DistillationMixture-of-Experts	—Unverified
Lancet: Accelerating Mixture-of-Experts Training via Whole Graph Computation-Communication Overlapping	Apr 30, 2024	AllMixture-of-Experts	—Unverified
Language-driven All-in-one Adverse Weather Removal	Dec 3, 2023	AllDiversity	—Unverified
Large-Scale YouTube-8M Video Understanding with Deep Neural Networks	Jun 14, 2017	ClassificationGeneral Classification	—Unverified
La-SoftMoE CLIP for Unified Physical-Digital Face Attack Detection	Aug 23, 2024	Mixture-of-Experts	—Unverified
LaVIDE: A Language-Vision Discriminator for Detecting Changes in Satellite Image with Map References	Nov 29, 2024	Change DetectionMixture-of-Experts	—Unverified
Lazarus: Resilient and Elastic Training of Mixture-of-Experts Models with Adaptive Expert Placement	Jul 5, 2024	GPUMixture-of-Experts	—Unverified
LDACP: Long-Delayed Ad Conversions Prediction Model for Bidding Strategy	Nov 25, 2024	Mixture-of-Expertsregression	—Unverified
Learning Factored Representations in a Deep Mixture of Experts	Dec 16, 2013	Mixture-of-Experts	—Unverified
Learning Heterogeneous Tissues with Mixture of Experts for Gigapixel Whole Slide Images	Jan 1, 2025	Mixture-of-Expertswhole slide images	—Unverified
Learning in Gated Neural Networks	Jun 6, 2019	Mixture-of-Experts	—Unverified
Learning Large-scale Universal User Representation with Sparse Mixture of Experts	Jul 11, 2022	Mixture-of-Experts	—Unverified
Learning More Generalized Experts by Merging Experts in Mixture-of-Experts	May 19, 2024	Incremental LearningMixture-of-Experts	—Unverified
Learning Sparse Mixture of Experts for Visual Question Answering	Sep 19, 2019	Mixture-of-ExpertsQuestion Answering	—Unverified
Learning to Compose Topic-Aware Mixture of Experts for Zero-Shot Video Captioning	Nov 7, 2018	Mixture-of-ExpertsVideo Captioning	—Unverified
Learning to Ground VLMs without Forgetting	Oct 14, 2024	DecoderLanguage Modelling	—Unverified
LEMoE: Advanced Mixture of Experts Adaptor for Lifelong Model Editing of Large Language Models	Jun 28, 2024	Mixture-of-ExpertsModel Editing	—Unverified
Leveraging Mixture of Experts for Improved Speech Deepfake Detection	Sep 24, 2024	DeepFake DetectionFace Swapping	—Unverified
Leveraging MoE-based Large Language Model for Zero-Shot Multi-Task Semantic Communication	Mar 19, 2025	Language ModelingLanguage Modelling	—Unverified
Leveraging Pre-Trained Models for Multimodal Class-Incremental Learning under Adaptive Fusion	Feb 7, 2025	class-incremental learningClass Incremental Learning	—Unverified
Lifelong Evolution: Collaborative Learning between Large and Small Language Models for Continuous Emergent Fake News Detection	Jun 5, 2025	Fake News Detectionknowledge editing	—Unverified
Lifelong Knowledge Editing for Vision Language Models with Low-Rank Mixture-of-Experts	Nov 23, 2024	knowledge editingMixture-of-Experts	—Unverified
Lifelong Language Pretraining with Distribution-Specialized Experts	May 20, 2023	Lifelong learningMixture-of-Experts	—Unverified
Little By Little: Continual Learning via Self-Activated Sparse Mixture-of-Rank Adaptive Learning	Jun 26, 2025	Continual LearningMixture-of-Experts	—Unverified
Llama 3 Meets MoE: Efficient Upcycling	Dec 13, 2024	Mixture-of-ExpertsMMLU	—Unverified
LLaVA-CMoE: Towards Continual Mixture of Experts for Large Vision-Language Models	Mar 27, 2025	Mixture-of-Experts	—Unverified
LLaVA-MoLE: Sparse Mixture of LoRA Experts for Mitigating Data Conflicts in Instruction Finetuning MLLMs	Jan 29, 2024	Language ModellingLarge Language Model	—Unverified
LLM4WM: Adapting LLM for Wireless Multi-Tasking	Jan 22, 2025	General KnowledgeLanguage Modeling	—Unverified
LLM-Based Routing in Mixture of Experts: A Novel Framework for Trading	Jan 16, 2025	Mixture-of-ExpertsWorld Knowledge	—Unverified
Load Balancing Mixture of Experts with Similarity Preserving Routers	Jun 16, 2025	Mixture-of-Experts	—Unverified
Locking and Quacking: Stacking Bayesian model predictions by log-pooling and superposition	May 12, 2023	Bayesian InferenceMixture-of-Experts	—Unverified
LoRA-Mixer: Coordinate Modular LoRA Experts Through Serial Attention Routing	Jun 17, 2025	ARCCoLA	—Unverified
LoRA-Switch: Boosting the Efficiency of Dynamic LLM Adapters via System-Algorithm Co-design	May 28, 2024	Mixture-of-Experts	—Unverified
Lory: Fully Differentiable Mixture-of-Experts for Autoregressive Language Model Pre-training	May 6, 2024	Language ModelingLanguage Modelling	—Unverified
Low-Rank Mixture-of-Experts for Continual Medical Image Segmentation	Jun 19, 2024	Continual LearningImage Segmentation	—Unverified
LPT++: Efficient Training on Mixture of Long-tailed Experts	Sep 17, 2024	Mixture-of-Expertsparameter-efficient fine-tuning	—Unverified
LSTM-based Mixture-of-Experts for Knowledge-Aware Dialogues	May 5, 2016	Language ModelingLanguage Modelling	—Unverified
Lynx: Enabling Efficient MoE Inference through Dynamic Batch-Aware Expert Selection	Nov 13, 2024	Code GenerationMathematical Reasoning	—Unverified
M^2CD: A Unified MultiModal Framework for Optical-SAR Change Detection with Mixture of Experts and Self-Distillation	Mar 25, 2025	Change DetectionDisaster Response	—Unverified
M2R2: Mixture of Multi-Rate Residuals for Efficient Transformer Inference	Feb 4, 2025	Mixture-of-Experts	—Unverified
M2Restore: Mixture-of-Experts-based Mamba-CNN Fusion Framework for All-in-One Image Restoration	Jun 9, 2025	AllImage Restoration	—Unverified
M^3TN: Multi-gate Mixture-of-Experts based Multi-valued Treatment Network for Uplift Modeling	Jan 24, 2024	Mixture-of-Experts	—Unverified
M6-T: Exploring Sparse Expert Models and Beyond	Nov 16, 2021	Mixture-of-Experts	—Unverified
Machine learning based digital twin for dynamical systems with multiple time-scales	May 12, 2020	BIG-bench Machine LearningMixture-of-Experts	—Unverified
MALoRA: Mixture of Asymmetric Low-Rank Adaptation for Enhanced Multi-Task Learning	Oct 30, 2024	Computational EfficiencyMixture-of-Experts	—Unverified
MANDARIN: Mixture-of-Experts Framework for Dynamic Delirium and Coma Prediction in ICU Patients: Development and Validation of an Acute Brain Dysfunction Prediction Model	Mar 8, 2025	Mixture-of-Experts	—Unverified
Many Hands Make Light Work: Task-Oriented Dialogue System with Module-Based Mixture-of-Experts	May 16, 2024	Dialogue State TrackingMixture-of-Experts	—Unverified
Massively Multilingual Shallow Fusion with Large Language Models	Feb 17, 2023	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified

Show:10 25 50

← PrevPage 24 of 27Next →

No leaderboard results yet.