SOTAVerified

Mixture-of-Experts

Papers

Showing 76100 of 1312 papers

TitleStatusHype
Demystifying the Compression of Mixture-of-Experts Through a Unified FrameworkCode2
MoEUT: Mixture-of-Experts Universal TransformersCode2
MoFE-Time: Mixture of Frequency Domain Experts for Time-Series Forecasting ModelsCode2
ReMoE: Fully Differentiable Mixture-of-Experts with ReLU RoutingCode2
MoE-FFD: Mixture of Experts for Generalized and Parameter-Efficient Face Forgery DetectionCode2
ModuleFormer: Modularity Emerges from Mixture-of-ExpertsCode2
MoE Jetpack: From Dense Checkpoints to Adaptive Mixture of Experts for Vision TasksCode2
Mixture of Lookup ExpertsCode2
Mixture of A Million ExpertsCode2
Mixture of Tokens: Continuous MoE through Cross-Example AggregationCode2
Fast Feedforward NetworksCode2
CNMBERT: A Model for Converting Hanyu Pinyin Abbreviations to Chinese CharactersCode2
MDFEND: Multi-domain Fake News DetectionCode2
MC-MoE: Mixture Compressor for Mixture-of-Experts LLMs Gains MoreCode2
CLIP-MoE: Towards Building Mixture of Experts for CLIP with Diversified Multiplet UpcyclingCode2
Make LoRA Great Again: Boosting LoRA with Adaptive Singular Values and Mixture-of-Experts Optimization AlignmentCode2
Flex-MoE: Modeling Arbitrary Modality Combination via the Flexible Mixture-of-ExpertsCode2
MiniDrive: More Efficient Vision-Language Models with Multi-Level 2D Features as Text Tokens for Autonomous DrivingCode2
CuMo: Scaling Multimodal LLM with Co-Upcycled Mixture-of-ExpertsCode2
LiMoE: Mixture of LiDAR Representation Learners from Automotive ScenesCode2
Linear-MoE: Linear Sequence Modeling Meets Mixture-of-ExpertsCode2
LLaMA-MoE v2: Exploring Sparsity of LLaMA from Perspective of Mixture-of-Experts with Post-TrainingCode2
Efficiently Democratizing Medical LLMs for 50 Languages via a Mixture of Language Family ExpertsCode2
Learning Robust Stereo Matching in the Wild with Selective Mixture-of-ExpertsCode2
A Closer Look into Mixture-of-Experts in Large Language ModelsCode2
Show:102550
← PrevPage 4 of 53Next →

No leaderboard results yet.