Mixture-of-Experts

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 226–250 of 1312 papers

Title	Date	Tasks	Status	Hype
MxMoE: Mixed-precision Quantization for MoE with Accuracy and Performance Co-Design	May 9, 2025	Mixture-of-ExpertsQuantization	CodeCode Available	1
Image Super-resolution Via Latent Diffusion: A Sampling-space Mixture Of Experts And Frequency-augmented Decoder Approach	Oct 18, 2023	Blind Super-ResolutionDecoder	CodeCode Available	1
Customizing Language Models with Instance-wise LoRA for Sequential Recommendation	Aug 19, 2024	Mixture-of-ExpertsMulti-Task Learning	CodeCode Available	1
Improving Video-Text Retrieval by Multi-Stream Corpus Alignment and Dual Softmax Loss	Sep 9, 2021	Mixture-of-ExpertsRetrieval	CodeCode Available	1
Distilling the Knowledge in a Neural Network	Mar 9, 2015	Knowledge DistillationMixture-of-Experts	CodeCode Available	1
DirectMultiStep: Direct Route Generation for Multi-Step Retrosynthesis	May 22, 2024	DiversityMixture-of-Experts	CodeCode Available	1
A Time Series is Worth Five Experts: Heterogeneous Mixture of Experts for Traffic Flow Prediction	Sep 26, 2024	Mixture-of-ExpertsPrediction	CodeCode Available	1
HyperRouter: Towards Efficient Training and Inference of Sparse Mixture of Experts	Dec 12, 2023	Mixture-of-Experts	CodeCode Available	1
Large Multi-modality Model Assisted AI-Generated Image Quality Assessment	Apr 27, 2024	Image Quality AssessmentMixture-of-Experts	CodeCode Available	1
Meta-DMoE: Adapting to Domain Shift by Meta-Distillation from Mixture-of-Experts	Oct 8, 2022	Domain GeneralizationKnowledge Distillation	CodeCode Available	1
Hierarchical Time-Aware Mixture of Experts for Multi-Modal Sequential Recommendation	Jan 24, 2025	Contrastive LearningMixture-of-Experts	CodeCode Available	1
BrainMAP: Learning Multiple Activation Pathways in Brain Networks	Dec 23, 2024	MambaMixture-of-Experts	CodeCode Available	1
Patch-level Routing in Mixture-of-Experts is Provably Sample-efficient for Convolutional Neural Networks	Jun 7, 2023	Mixture-of-Experts	CodeCode Available	1
Heterogeneous Multi-task Learning with Expert Diversity	Jun 20, 2021	DiversityMixture-of-Experts	CodeCode Available	1
Graph Sparsification via Mixture of Graphs	May 23, 2024	Graph LearningMixture-of-Experts	CodeCode Available	1
Gradient-free variational learning with conditional mixture networks	Aug 29, 2024	Computational EfficiencyMixture-of-Experts	CodeCode Available	1
GraphMETRO: Mitigating Complex Graph Distribution Shifts via Mixture of Aligned Experts	Dec 7, 2023	DiversityGraph Neural Network	CodeCode Available	1
Heterogeneous Mixture of Experts for Remote Sensing Image Super-Resolution	Feb 12, 2025	Image Super-ResolutionMixture-of-Experts	CodeCode Available	1
HydraSum: Disentangling Stylistic Features in Text Summarization using Multi-Decoder Models	Oct 8, 2021	Abstractive Text SummarizationDecoder	CodeCode Available	1
Gated Multimodal Units for Information Fusion	Feb 7, 2017	General ClassificationGenre classification	CodeCode Available	1
GaVaMoE: Gaussian-Variational Gated Mixture of Experts for Explainable Recommendation	Oct 15, 2024	Explainable RecommendationLanguage Modelling	CodeCode Available	1
EvoMoE: An Evolutional Mixture-of-Experts Training Framework via Dense-To-Sparse Gate	Dec 29, 2021	Language ModelingLanguage Modelling	CodeCode Available	1
BiMediX: Bilingual Medical Mixture of Experts LLM	Feb 20, 2024	Mixture-of-ExpertsMultiple-choice	CodeCode Available	1
FreqMoE: Enhancing Time Series Forecasting through Frequency Decomposition Mixture of Experts	Jan 25, 2025	Mixture-of-ExpertsPrediction	CodeCode Available	1
Dense Backpropagation Improves Training for Sparse Mixture-of-Experts	Apr 16, 2025	Mixture-of-Experts	CodeCode Available	1

Show:10 25 50

← PrevPage 10 of 53Next →

No leaderboard results yet.