Mixture-of-Experts

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 201–225 of 1312 papers

Title	Date	Tasks	Status	Hype	Score
MixPHM: Redundancy-Aware Parameter-Efficient Tuning for Low-Resource Visual Question Answering	Mar 2, 2023	Mixture-of-ExpertsQuestion Answering	CodeCode Available	1	5
Exploiting Inter-Layer Expert Affinity for Accelerating Mixture-of-Experts Model Inference	Jan 16, 2024	GPUMixture-of-Experts	CodeCode Available	1	5
Examining Post-Training Quantization for Mixture-of-Experts: A Benchmark	Jun 12, 2024	BenchmarkingMixture-of-Experts	CodeCode Available	1	5
Addressing Confounding Feature Issue for Causal Recommendation	May 13, 2022	Mixture-of-ExpertsRecommendation Systems	CodeCode Available	1	5
C3PO: Critical-Layer, Core-Expert, Collaborative Pathway Optimization for Test-Time Expert Re-Mixing	Apr 10, 2025	In-Context LearningMixture-of-Experts	CodeCode Available	1	5
EWMoE: An effective model for global weather forecasting with mixture-of-experts	May 9, 2024	Mixture-of-ExpertsWeather Forecasting	CodeCode Available	1	5
MiLo: Efficient Quantized MoE Inference with Mixture of Low-Rank Compensators	Apr 3, 2025	Mixture-of-ExpertsQuantization	CodeCode Available	1	5
Meta-DMoE: Adapting to Domain Shift by Meta-Distillation from Mixture-of-Experts	Oct 8, 2022	Domain GeneralizationKnowledge Distillation	CodeCode Available	1	5
Enhancing NeRF akin to Enhancing LLMs: Generalizable NeRF Transformer with Mixture-of-View-Experts	Aug 22, 2023	Mixture-of-ExpertsNeRF	CodeCode Available	1	5
HydraSum: Disentangling Stylistic Features in Text Summarization using Multi-Decoder Models	Oct 8, 2021	Abstractive Text SummarizationDecoder	CodeCode Available	1	5
Mimic Embedding via Adaptive Aggregation: Learning Generalizable Person Re-identification	Dec 16, 2021	Generalizable Person Re-identificationMixture-of-Experts	CodeCode Available	1	5
XMoE: Sparse Models with Fine-grained and Adaptive Expert Selection	Feb 27, 2024	Language ModelingLanguage Modelling	CodeCode Available	1	5
Emotion-Qwen: Training Hybrid Experts for Unified Emotion and General Vision-Language Understanding	May 10, 2025	DescriptiveEmotion Recognition	CodeCode Available	1	5
Contrastive Learning and Mixture of Experts Enables Precise Vector Embeddings	Jan 28, 2024	Contrastive LearningDescriptive	CodeCode Available	1	5
Enhancing Fast Feed Forward Networks with Load Balancing and a Master Leaf Node	May 27, 2024	Computational EfficiencyMixture-of-Experts	CodeCode Available	1	5
MeteoRA: Multiple-tasks Embedded LoRA for Large Language Models	May 19, 2024	Mixture-of-Expertsparameter-efficient fine-tuning	CodeCode Available	1	5
Merge, Then Compress: Demystify Efficient SMoE with Hints from Its Routing Policy	Oct 2, 2023	Mixture-of-Experts	CodeCode Available	1	5
Jakiro: Boosting Speculative Decoding with Decoupled Multi-Head via MoE	Feb 10, 2025	DiversityLanguage Modeling	CodeCode Available	1	5
Merging Experts into One: Improving Computational Efficiency of Mixture of Experts	Oct 15, 2023	Computational EfficiencyMixture-of-Experts	CodeCode Available	1	5
Efficient Fine-tuning of Audio Spectrogram Transformers via Soft Mixture of Adapters	Feb 1, 2024	Mixture-of-Expertsparameter-efficient fine-tuning	CodeCode Available	1	5
Emergent Modularity in Pre-trained Transformers	May 28, 2023	Mixture-of-Experts	CodeCode Available	1	5
Merging Multi-Task Models via Weight-Ensembling Mixture of Experts	Feb 1, 2024	Mixture-of-ExpertsTask Arithmetic	CodeCode Available	1	5
MiCE: Mixture of Contrastive Experts for Unsupervised Image Clustering	May 5, 2021	ClusteringContrastive Learning	CodeCode Available	1	5
Mixture of Attention Heads: Selecting Attention Heads Per Token	Oct 11, 2022	Computational EfficiencyLanguage Modeling	CodeCode Available	1	5
MoCaE: Mixture of Calibrated Experts Significantly Improves Object Detection	Sep 26, 2023	Instance SegmentationMixture-of-Experts	CodeCode Available	1	5

Show:10 25 50

← PrevPage 9 of 53Next →

No leaderboard results yet.