| Switch Transformers: Scaling to Trillion Parameter Models with Simple and Efficient Sparsity | Jan 11, 2021 | Language ModellingMixture-of-Experts | CodeCode Available | 2 |
| Federated learning using mixture of experts | Jan 1, 2021 | Federated LearningMixture-of-Experts | —Unverified | 0 |
| Exploring Routing Strategies for Multilingual Mixture-of-Experts Models | Jan 1, 2021 | DecoderMixture-of-Experts | —Unverified | 0 |
| Gated Ensemble of Spatio-temporal Mixture of Experts for Multi-task Learning in Ride-hailing System | Dec 31, 2020 | Mixture-of-ExpertsMulti-Task Learning | —Unverified | 0 |
| PFL-MoE: Personalized Federated Learning Based on Mixture of Experts | Dec 31, 2020 | Decision MakingFederated Learning | CodeCode Available | 1 |
| Self-Supervised Multimodal Domino: in Search of Biomarkers for Alzheimer's Disease | Dec 25, 2020 | Contrastive LearningDecoder | CodeCode Available | 0 |
| Channel Gain Cartography via Mixture of Experts | Dec 8, 2020 | Mixture-of-Experts | —Unverified | 0 |
| A similarity-based Bayesian mixture-of-experts model | Dec 3, 2020 | Mixture-of-Expertsmodel | —Unverified | 0 |
| A Mixture-of-Experts Model for Learning Multi-Facet Entity Embeddings | Dec 1, 2020 | Entity EmbeddingsMixture-of-Experts | CodeCode Available | 0 |
| Neural Transduction for Multilingual Lexical Translation | Dec 1, 2020 | Mixture-of-ExpertsTranslation | —Unverified | 0 |