Mixture-of-Experts

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 51–75 of 1312 papers

Title	Date	Tasks	Status	Hype
MVMoE: Multi-Task Vehicle Routing Solver with Mixture-of-Experts	May 2, 2024	Combinatorial OptimizationMixture-of-Experts	CodeCode Available	3
MixLoRA: Enhancing Large Language Models Fine-Tuning with LoRA-based Mixture of Experts	Apr 22, 2024	Common Sense ReasoningGPU	CodeCode Available	3
Boosting Continual Learning of Vision-Language Models via Mixture-of-Experts Adapters	Mar 18, 2024	Continual LearningIncremental Learning	CodeCode Available	3
MoAI: Mixture of All Intelligence for Large Language and Vision Models	Mar 12, 2024	AllMixture-of-Experts	CodeCode Available	3
Scaling Laws for Fine-Grained Mixture of Experts	Feb 12, 2024	Mixture-of-Experts	CodeCode Available	3
Fiddler: CPU-GPU Orchestration for Fast Inference of Mixture-of-Experts Models	Feb 10, 2024	CPUGPU	CodeCode Available	3
BlackMamba: Mixture of Experts for State-Space Models	Feb 1, 2024	Language ModelingLanguage Modelling	CodeCode Available	3
MoE-Mamba: Efficient Selective State Space Models with Mixture of Experts	Jan 8, 2024	MambaMixture-of-Experts	CodeCode Available	3
SOLAR 10.7B: Scaling Large Language Models with Simple yet Effective Depth Up-Scaling	Dec 23, 2023	Instruction FollowingLanguage Modeling	CodeCode Available	3
MegaBlocks: Efficient Sparse Training with Mixture-of-Experts	Nov 29, 2022	GPUMixture-of-Experts	CodeCode Available	3
ST-MoE: Designing Stable and Transferable Sparse Expert Models	Feb 17, 2022	ARCCommon Sense Reasoning	CodeCode Available	3
MoFE-Time: Mixture of Frequency Domain Experts for Time-Series Forecasting Models	Jul 9, 2025	Mixture-of-ExpertsTime Series	CodeCode Available	2
Learning Robust Stereo Matching in the Wild with Selective Mixture-of-Experts	Jul 7, 2025	Inductive BiasMixture-of-Experts	CodeCode Available	2
WINA: Weight Informed Neuron Activation for Accelerating Large Language Model Inference	May 26, 2025	Language ModelingLanguage Modelling	CodeCode Available	2
I2MoE: Interpretable Multimodal Interaction-aware Mixture-of-Experts	May 25, 2025	Mixture-of-Expertsmultimodal interaction	CodeCode Available	2
HybriMoE: Hybrid CPU-GPU Scheduling and Cache Management for Efficient MoE Inference	Apr 8, 2025	CPUGPU	CodeCode Available	2
Mixture of Lookup Experts	Mar 20, 2025	Mixture-of-Experts	CodeCode Available	2
Linear-MoE: Linear Sequence Modeling Meets Mixture-of-Experts	Mar 7, 2025	Mixture-of-ExpertsState Space Models	CodeCode Available	2
Make LoRA Great Again: Boosting LoRA with Adaptive Singular Values and Mixture-of-Experts Optimization Alignment	Feb 24, 2025	image-classificationImage Classification	CodeCode Available	2
Delta Decompression for MoE-based LLMs Compression	Feb 24, 2025	DiversityMixture-of-Experts	CodeCode Available	2
LiMoE: Mixture of LiDAR Representation Learners from Automotive Scenes	Jan 7, 2025	Mixture-of-ExpertsRepresentation Learning	CodeCode Available	2
Superposition in Transformers: A Novel Way of Building Mixture of Experts	Dec 31, 2024	Mixture-of-Experts	CodeCode Available	2
ReMoE: Fully Differentiable Mixture-of-Experts with ReLU Routing	Dec 19, 2024	Mixture-of-Experts	CodeCode Available	2
DeMo: Decoupled Feature-Based Mixture of Experts for Multi-Modal Object Re-Identification	Dec 14, 2024	Mixture-of-ExpertsObject	CodeCode Available	2
Towards a Multimodal Large Language Model with Pixel-Level Insight for Biomedicine	Dec 12, 2024	Language ModelingLanguage Modelling	CodeCode Available	2

Show:10 25 50

← PrevPage 3 of 53Next →

No leaderboard results yet.