Mixture-of-Experts

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 276–300 of 1312 papers

Title	Date	Tasks	Status	Hype	Score
Multimodal Clinical Trial Outcome Prediction with Large Language Models	Feb 9, 2024	Mixture-of-ExpertsPrediction	CodeCode Available	1	5
Exploiting Inter-Layer Expert Affinity for Accelerating Mixture-of-Experts Model Inference	Jan 16, 2024	GPUMixture-of-Experts	CodeCode Available	1	5
Multi-Head Mixture-of-Experts	Apr 23, 2024	Language ModelingLanguage Modelling	CodeCode Available	1	5
Multimodal Variational Autoencoders for Semi-Supervised Learning: In Defense of Product-of-Experts	Jan 18, 2021	AllMixture-of-Experts	CodeCode Available	1	5
EWMoE: An effective model for global weather forecasting with mixture-of-experts	May 9, 2024	Mixture-of-ExpertsWeather Forecasting	CodeCode Available	1	5
DMT-HI: MOE-based Hyperbolic Interpretable Deep Manifold Transformation for Unspervised Dimensionality Reduction	Oct 25, 2024	Dimensionality ReductionMixture-of-Experts	CodeCode Available	1	5
Specialized federated learning using a mixture of experts	Oct 5, 2020	Federated LearningMixture-of-Experts	CodeCode Available	1	5
Examining Post-Training Quantization for Mixture-of-Experts: A Benchmark	Jun 12, 2024	BenchmarkingMixture-of-Experts	CodeCode Available	1	5
FLAME-MoE: A Transparent End-to-End Research Platform for Mixture-of-Experts Language Models	May 26, 2025	Mixture-of-Experts	CodeCode Available	1	5
MomentumSMoE: Integrating Momentum into Sparse Mixture of Experts	Oct 18, 2024	Language ModelingLanguage Modelling	CodeCode Available	1	5
FineMoGen: Fine-Grained Spatio-Temporal Motion Generation and Editing	Dec 22, 2023	Mixture-of-ExpertsMotion Generation	CodeCode Available	1	5
Distribution-aware Forgetting Compensation for Exemplar-Free Lifelong Person Re-identification	Apr 21, 2025	Exemplar-FreeKnowledge Distillation	CodeCode Available	1	5
XMoE: Sparse Models with Fine-grained and Adaptive Expert Selection	Feb 27, 2024	Language ModelingLanguage Modelling	CodeCode Available	1	5
Distilling the Knowledge in a Neural Network	Mar 9, 2015	Knowledge DistillationMixture-of-Experts	CodeCode Available	1	5
Emergent Modularity in Pre-trained Transformers	May 28, 2023	Mixture-of-Experts	CodeCode Available	1	5
Awaker2.5-VL: Stably Scaling MLLMs with Parameter-Efficient Mixture of Experts	Nov 16, 2024	Mixture-of-ExpertsOptical Character Recognition (OCR)	CodeCode Available	1	5
Emotion-Qwen: Training Hybrid Experts for Unified Emotion and General Vision-Language Understanding	May 10, 2025	DescriptiveEmotion Recognition	CodeCode Available	1	5
MoGERNN: An Inductive Traffic Predictor for Unobserved Locations in Dynamic Sensing Networks	Jan 21, 2025	iFunMixture-of-Experts	CodeCode Available	1	5
Dynamic Data Mixing Maximizes Instruction Tuning for Mixture-of-Experts	Jun 17, 2024	Mixture-of-Experts	CodeCode Available	1	5
Dynamic Language Group-Based MoE: Enhancing Code-Switching Speech Recognition with Hierarchical Routing	Jul 26, 2024	AttributeLanguage Modelling	CodeCode Available	1	5
MoËT: Mixture of Expert Trees and its Application to Verifiable Reinforcement Learning	Jun 16, 2019	Game of GoImitation Learning	CodeCode Available	1	5
DirectMultiStep: Direct Route Generation for Multi-Step Retrosynthesis	May 22, 2024	DiversityMixture-of-Experts	CodeCode Available	1	5
Enhancing Fast Feed Forward Networks with Load Balancing and a Master Leaf Node	May 27, 2024	Computational EfficiencyMixture-of-Experts	CodeCode Available	1	5
MoExtend: Tuning New Experts for Modality and Task Extension	Aug 7, 2024	Mixture-of-Experts	CodeCode Available	1	5
Efficient Dictionary Learning with Switch Sparse Autoencoders	Oct 10, 2024	Dictionary LearningMixture-of-Experts	CodeCode Available	1	5

Show:10 25 50

← PrevPage 12 of 53Next →

No leaderboard results yet.