Mixture-of-Experts

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 301–325 of 1312 papers

Title	Date	Tasks	Status	Hype
DirectMultiStep: Direct Route Generation for Multi-Step Retrosynthesis	May 22, 2024	DiversityMixture-of-Experts	CodeCode Available	1
Emotion-Qwen: Training Hybrid Experts for Unified Emotion and General Vision-Language Understanding	May 10, 2025	DescriptiveEmotion Recognition	CodeCode Available	1
Frequency-Adaptive Pan-Sharpening with Mixture of Experts	Jan 4, 2024	Mixture-of-Experts	CodeCode Available	1
Edge-MoE: Memory-Efficient Multi-Task Vision Transformer Architecture with Task-level Sparsity via Mixture-of-Experts	May 30, 2023	CPUGPU	CodeCode Available	1
Heterogeneous Mixture of Experts for Remote Sensing Image Super-Resolution	Feb 12, 2025	Image Super-ResolutionMixture-of-Experts	CodeCode Available	1
FLAME-MoE: A Transparent End-to-End Research Platform for Mixture-of-Experts Language Models	May 26, 2025	Mixture-of-Experts	CodeCode Available	1
AutoMoE: Heterogeneous Mixture-of-Experts with Adaptive Computation for Efficient Neural Machine Translation	Oct 14, 2022	CPUMachine Translation	CodeCode Available	1
FineMoGen: Fine-Grained Spatio-Temporal Motion Generation and Editing	Dec 22, 2023	Mixture-of-ExpertsMotion Generation	CodeCode Available	1
Merge, Then Compress: Demystify Efficient SMoE with Hints from Its Routing Policy	Oct 2, 2023	Mixture-of-Experts	CodeCode Available	1
Merging Experts into One: Improving Computational Efficiency of Mixture of Experts	Oct 15, 2023	Computational EfficiencyMixture-of-Experts	CodeCode Available	1
MeteoRA: Multiple-tasks Embedded LoRA for Large Language Models	May 19, 2024	Mixture-of-Expertsparameter-efficient fine-tuning	CodeCode Available	1
Efficient Dictionary Learning with Switch Sparse Autoencoders	Oct 10, 2024	Dictionary LearningMixture-of-Experts	CodeCode Available	1
Efficient Expert Pruning for Sparse Mixture-of-Experts Language Models: Enhancing Performance and Reducing Inference Costs	Jul 1, 2024	GPUMixture-of-Experts	CodeCode Available	1
Efficient Fine-tuning of Audio Spectrogram Transformers via Soft Mixture of Adapters	Feb 1, 2024	Mixture-of-Expertsparameter-efficient fine-tuning	CodeCode Available	1
Enhancing Fast Feed Forward Networks with Load Balancing and a Master Leaf Node	May 27, 2024	Computational EfficiencyMixture-of-Experts	CodeCode Available	1
Specialized federated learning using a mixture of experts	Oct 5, 2020	Federated LearningMixture-of-Experts	CodeCode Available	1
EvoMoE: An Evolutional Mixture-of-Experts Training Framework via Dense-To-Sparse Gate	Dec 29, 2021	Language ModelingLanguage Modelling	CodeCode Available	1
Dense Backpropagation Improves Training for Sparse Mixture-of-Experts	Apr 16, 2025	Mixture-of-Experts	CodeCode Available	1
Emergent Modularity in Pre-trained Transformers	May 28, 2023	Mixture-of-Experts	CodeCode Available	1
Few-Shot and Continual Learning with Attentive Independent Mechanisms	Jul 29, 2021	Continual LearningFew-Shot Learning	CodeCode Available	1
FreqMoE: Enhancing Time Series Forecasting through Frequency Decomposition Mixture of Experts	Jan 25, 2025	Mixture-of-ExpertsPrediction	CodeCode Available	1
Heterogeneous Multi-task Learning with Expert Diversity	Jun 20, 2021	DiversityMixture-of-Experts	CodeCode Available	1
Learning to Skip the Middle Layers of Transformers	Jun 26, 2025	Mixture-of-Experts	CodeCode Available	1
Modality Interactive Mixture-of-Experts for Fake News Detection	Jan 21, 2025	Fake News DetectionMisinformation	CodeCode Available	1
Mixture of Experts Meets Prompt-Based Continual Learning	May 23, 2024	Continual LearningMixture-of-Experts	CodeCode Available	1

Show:10 25 50

← PrevPage 13 of 53Next →

No leaderboard results yet.