| Functional mixture-of-experts for classification | Feb 28, 2022 | ClassificationMixture-of-Experts | —Unverified | 0 |
| FuseMoE: Mixture-of-Experts Transformers for Fleximodal Fusion | Feb 5, 2024 | Missing ElementsMixture-of-Experts | —Unverified | 0 |
| FuxiMT: Sparsifying Large Language Models for Chinese-Centric Multilingual Machine Translation | May 20, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Finding Fantastic Experts in MoEs: A Unified Study for Expert Dropping Strategies and Observations | Apr 8, 2025 | Instruction FollowingMixture-of-Experts | —Unverified | 0 |
| Filtered not Mixed: Stochastic Filtering-Based Online Gating for Mixture of Large Language Models | Jun 5, 2024 | Mixture-of-ExpertsTime Series | —Unverified | 0 |
| A Review of DeepSeek Models' Key Innovative Techniques | Mar 14, 2025 | Mixture-of-Expertsreinforcement-learning | —Unverified | 0 |
| AdaMV-MoE: Adaptive Multi-Task Vision Mixture-of-Experts | Jan 1, 2023 | Instance SegmentationMixture-of-Experts | —Unverified | 0 |
| Gating Dropout: Communication-efficient Regularization for Sparsely Activated Transformers | May 28, 2022 | Machine TranslationMixture-of-Experts | —Unverified | 0 |
| Imitation Learning from MPC for Quadrupedal Multi-Gait Control | Mar 26, 2021 | Imitation LearningMixture-of-Experts | —Unverified | 0 |
| Coordination with Humans via Strategy Matching | Oct 27, 2022 | Mixture-of-Experts | —Unverified | 0 |
| GEMNET: Effective Gated Gazetteer Representations for Recognizing Complex Entities in Low-context Input | Jun 1, 2021 | Mixture-of-Expertsnamed-entity-recognition | —Unverified | 0 |
| Generalizable Person Re-identification with Relevance-aware Mixture of Experts | May 19, 2021 | Generalizable Person Re-identificationMixture-of-Experts | —Unverified | 0 |
| Generalization Error Analysis for Sparse Mixture-of-Experts: A Preliminary Study | Mar 26, 2024 | Learning TheoryMixture-of-Experts | —Unverified | 0 |
| Improved Training of Mixture-of-Experts Language GANs | Feb 23, 2023 | Adversarial TextImage Generation | —Unverified | 0 |
| Affect in Tweets Using Experts Model | Mar 20, 2019 | Mixture-of-Expertsmodel | —Unverified | 0 |
| Generator Assisted Mixture of Experts For Feature Acquisition in Batch | Dec 19, 2023 | Mixture-of-Experts | —Unverified | 0 |
| GeRM: A Generalist Robotic Model with Mixture-of-experts for Quadruped Robot | Mar 20, 2024 | Mixture-of-ExpertsMulti-Task Learning | —Unverified | 0 |
| Learning to Specialize: Joint Gating-Expert Training for Adaptive MoEs in Decentralized Settings | Jun 14, 2023 | DiversityFederated Learning | —Unverified | 0 |
| GigaChat Family: Efficient Russian Language Modeling Through Mixture of Experts Architecture | Jun 11, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| GLA in MediaEval 2018 Emotional Impact of Movies Task | Nov 27, 2019 | Mixture-of-Experts | —Unverified | 0 |
| GLaM: Efficient Scaling of Language Models with Mixture-of-Experts | Dec 13, 2021 | Common Sense ReasoningIn-Context Learning | —Unverified | 0 |
| FedMoE: Personalized Federated Learning via Heterogeneous Mixture of Experts | Aug 21, 2024 | Federated LearningHeuristic Search | —Unverified | 0 |
| IDEA: An Inverse Domain Expert Adaptation Based Active DNN IP Protection Method | Sep 29, 2024 | Domain AdaptationMixture-of-Experts | —Unverified | 0 |
| FedMoE-DA: Federated Mixture of Experts via Domain Aware Fine-grained Aggregation | Nov 4, 2024 | Federated LearningMixture-of-Experts | —Unverified | 0 |
| Hypertext Entity Extraction in Webpage | Mar 4, 2024 | Mixture-of-Experts | —Unverified | 0 |