SOTAVerified

Mixture-of-Experts

Papers

Showing 576600 of 1312 papers

TitleStatusHype
Multi-Treatment Multi-Task Uplift Modeling for Enhancing User Growth0
DutyTTE: Deciphering Uncertainty in Origin-Destination Travel Time EstimationCode0
SQL-GEN: Bridging the Dialect Gap for Text-to-SQL Via Synthetic Data And Model Merging0
Jamba-1.5: Hybrid Transformer-Mamba Models at ScaleCode5
Improving Factuality in Large Language Models via Decoding-Time Hallucinatory and Truthful ComparatorsCode0
MoE-LPR: Multilingual Extension of Large Language Models through Mixture-of-Experts with Language Priors RoutingCode0
FedMoE: Personalized Federated Learning via Heterogeneous Mixture of Experts0
KAN4TSF: Are KAN and KAN-based models Effective for Time Series Forecasting?Code2
HMoE: Heterogeneous Mixture of Experts for Language Modeling0
Navigating Spatio-Temporal Heterogeneity: A Graph Transformer Approach for Traffic ForecastingCode1
AnyGraph: Graph Foundation Model in the WildCode3
AdapMoE: Adaptive Sensitivity-based Expert Gating and Management for Efficient MoE InferenceCode1
A Unified Framework for Iris Anti-Spoofing: Introducing IrisGeneral Dataset and Masked-MoE Method0
Customizing Language Models with Instance-wise LoRA for Sequential RecommendationCode1
FEDKIM: Adaptive Federated Knowledge Injection into Medical Foundation ModelsCode0
Integrating Multi-view Analysis: Multi-view Mixture-of-Expert for Textual Personality DetectionCode0
FactorLLM: Factorizing Knowledge via Mixture of Experts for Large Language ModelsCode0
BAM! Just Like That: Simple and Efficient Parameter Upcycling for Mixture of Experts0
A Survey on Model MoErging: Recycling and Routing Among Specialized Experts for Collaborative Learning0
AquilaMoE: Efficient Training for MoE Models with Scale-Up and Scale-Out StrategiesCode1
Layerwise Recurrent Router for Mixture-of-ExpertsCode1
HoME: Hierarchy of Multi-Gate Experts for Multi-Task Learning at Kuaishou0
Understanding the Performance and Estimating the Cost of LLM Fine-TuningCode0
LaDiMo: Layer-wise Distillation Inspired MoEfier0
MoC-System: Efficient Fault Tolerance for Sparse Mixture-of-Experts Model Training0
Show:102550
← PrevPage 24 of 53Next →

No leaderboard results yet.