| One Student Knows All Experts Know: From Sparse to Dense | Jan 26, 2022 | AllKnowledge Distillation | —Unverified | 0 |
| MoEBERT: from BERT to Mixture-of-Experts via Importance-Guided Adaptation | Jan 16, 2022 | Knowledge DistillationMixture-of-Experts | —Unverified | 0 |
| Sparsely Activated Mixture-of-Experts are Robust Multi-Task Learners | Jan 16, 2022 | Mixture-of-ExpertsMulti-Task Learning | —Unverified | 0 |
| DeepSpeed-MoE: Advancing Mixture-of-Experts Inference and Training to Power Next-Generation AI Scale | Jan 14, 2022 | DecoderMixture-of-Experts | CodeCode Available | 0 |
| Towards Lightweight Neural Animation : Exploration of Neural Network Pruning in Mixture of Experts-based Animation Models | Jan 11, 2022 | Mixture-of-ExpertsNetwork Pruning | —Unverified | 0 |
| Combinations of Adaptive Filters | Dec 22, 2021 | Mixture-of-Experts | —Unverified | 0 |
| Efficient Large Scale Language Modeling with Mixtures of Experts | Dec 20, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| GLaM: Efficient Scaling of Language Models with Mixture-of-Experts | Dec 13, 2021 | Common Sense ReasoningIn-Context Learning | —Unverified | 0 |
| Building a great multi-lingual teacher with sparsely-gated mixture of experts for speech recognition | Dec 10, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Specializing Versatile Skill Libraries using Local Mixture of Experts | Dec 8, 2021 | Incremental LearningMixture-of-Experts | CodeCode Available | 0 |
| Anchoring to Exemplars for Training Mixture-of-Expert Cell Embeddings | Dec 6, 2021 | Drug DiscoveryGPU | —Unverified | 0 |
| A Mixture of Expert Based Deep Neural Network for Improved ASR | Dec 2, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| TAL: Two-stream Adaptive Learning for Generalizable Person Re-identification | Nov 29, 2021 | Domain GeneralizationGeneralizable Person Re-identification | —Unverified | 0 |
| Expert Aggregation for Financial Forecasting | Nov 25, 2021 | BIG-bench Machine LearningMixture-of-Experts | —Unverified | 0 |
| SpeechMoE2: Mixture-of-Experts Model with Improved Routing | Nov 23, 2021 | Computational EfficiencyMixture-of-Experts | —Unverified | 0 |
| Table-based Fact Verification with Self-adaptive Mixture of Experts | Nov 16, 2021 | Fact VerificationLogical Reasoning | —Unverified | 0 |
| MoEfication: Conditional Computation of Transformer Models for Efficient Inference | Nov 16, 2021 | Mixture-of-Experts | —Unverified | 0 |
| StableMoE: Stable Routing Strategy for Mixture of Experts | Nov 16, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| M6-T: Exploring Sparse Expert Models and Beyond | Nov 16, 2021 | Mixture-of-Experts | —Unverified | 0 |
| SummaReranker: A Multi-Task Mixture-of-Experts Re-ranking Framework for Abstractive Summarization | Nov 16, 2021 | Abstractive Text SummarizationMixture-of-Experts | —Unverified | 0 |
| Elucidating Robust Learning with Uncertainty-Aware Corruption Pattern Estimation | Nov 2, 2021 | Mixture-of-Experts | CodeCode Available | 0 |
| RTM Super Learner Results at Quality Estimation Task | Nov 1, 2021 | Mixture-of-ExpertsTranslation | —Unverified | 0 |
| Polynomial-Spline Neural Networks with Exact Integrals | Oct 26, 2021 | Mixture-of-Expertsregression | —Unverified | 0 |
| P-Adapters: Robustly Extracting Factual Information from Language Models with Diverse Prompts | Oct 14, 2021 | Mixture-of-ExpertsNatural Language Queries | —Unverified | 0 |
| Simple or Complex? Complexity-Controllable Question Generation with Soft Templates and Deep Mixture of Experts Model | Oct 13, 2021 | Mixture-of-ExpertsQuestion Generation | —Unverified | 0 |