SOTAVerified

Mixture-of-Experts

Papers

Showing 5175 of 1312 papers

TitleStatusHype
ST-MoE: Designing Stable and Transferable Sparse Expert ModelsCode3
Reservoir History Matching of the Norne field with generative exotic priors and a coupled Mixture of Experts -- Physics Informed Neural Operator Forward ModelCode3
Generalizing Motion Planners with Mixture of Experts for Autonomous DrivingCode3
MVMoE: Multi-Task Vehicle Routing Solver with Mixture-of-ExpertsCode3
A Survey on Mixture of ExpertsCode3
FlashDMoE: Fast Distributed MoE in a Single KernelCode3
AnyGraph: Graph Foundation Model in the WildCode3
A Survey on Inference Optimization Techniques for Mixture of Experts ModelsCode3
Fiddler: CPU-GPU Orchestration for Fast Inference of Mixture-of-Experts ModelsCode3
MoE-Mamba: Efficient Selective State Space Models with Mixture of ExpertsCode3
SOLAR 10.7B: Scaling Large Language Models with Simple yet Effective Depth Up-ScalingCode3
ModuleFormer: Modularity Emerges from Mixture-of-ExpertsCode2
Mixture of Lookup ExpertsCode2
Mixture of A Million ExpertsCode2
Mixture of Tokens: Continuous MoE through Cross-Example AggregationCode2
Efficiently Democratizing Medical LLMs for 50 Languages via a Mixture of Language Family ExpertsCode2
MiniDrive: More Efficient Vision-Language Models with Multi-Level 2D Features as Text Tokens for Autonomous DrivingCode2
MoE-FFD: Mixture of Experts for Generalized and Parameter-Efficient Face Forgery DetectionCode2
Make LoRA Great Again: Boosting LoRA with Adaptive Singular Values and Mixture-of-Experts Optimization AlignmentCode2
LoRA-IR: Taming Low-Rank Experts for Efficient All-in-One Image RestorationCode2
MC-MoE: Mixture Compressor for Mixture-of-Experts LLMs Gains MoreCode2
LLaMA-MoE v2: Exploring Sparsity of LLaMA from Perspective of Mixture-of-Experts with Post-TrainingCode2
CLIP-MoE: Towards Building Mixture of Experts for CLIP with Diversified Multiplet UpcyclingCode2
MDFEND: Multi-domain Fake News DetectionCode2
Dynamic Mixture of Experts: An Auto-Tuning Approach for Efficient Transformer ModelsCode2
Show:102550
← PrevPage 3 of 53Next →

No leaderboard results yet.