| Unbiased Gradient Estimation with Balanced Assignments for Mixtures of Experts | Sep 24, 2021 | Mixture-of-Experts | —Unverified | 0 |
| Scalable and Efficient MoE Training for Multitask Multilingual Models | Sep 22, 2021 | Machine TranslationMixture-of-Experts | —Unverified | 0 |
| Universal Simultaneous Machine Translation with Mixture-of-Experts Wait-k Policy | Sep 11, 2021 | Machine TranslationMixture-of-Experts | CodeCode Available | 0 |
| Improving Video-Text Retrieval by Multi-Stream Corpus Alignment and Dual Softmax Loss | Sep 9, 2021 | Mixture-of-ExpertsRetrieval | CodeCode Available | 1 |
| Cross-token Modeling with Conditional Computation | Sep 5, 2021 | Computational EfficiencyImage Classification | —Unverified | 0 |
| Personalised Federated Learning: A Combinational Approach | Aug 22, 2021 | Federated LearningKnowledge Distillation | —Unverified | 0 |
| SPMoE: Generate Multiple Pattern-Aware Outputs with Sparse Pattern Mixture of Experts | Aug 17, 2021 | DiversityMixture-of-Experts | —Unverified | 0 |
| AIREX: Neural Network-based Approach for Air Quality Inference in Unmonitored Cities | Aug 16, 2021 | Air Quality InferenceMixture-of-Experts | —Unverified | 0 |
| Strength in Numbers: Averaging and Clustering Effects in Mixture of Experts for Graph-Based Dependency Parsing | Aug 1, 2021 | ClusteringDependency Parsing | —Unverified | 0 |
| A Mixture-of-Experts Model for Antonym-Synonym Discrimination | Aug 1, 2021 | Mixture-of-Experts | CodeCode Available | 0 |