| Quantitative Stock Investment by Routing Uncertainty-Aware Trading Experts: A Multi-Task Learning Approach | Jun 7, 2022 | Decision MakingMixture-of-Experts | —Unverified | 0 |
| Tutel: Adaptive Mixture-of-Experts at Scale | Jun 7, 2022 | Mixture-of-ExpertsObject Detection | CodeCode Available | 2 |
| Multimodal Contrastive Learning with LIMoE: the Language-Image Mixture of Experts | Jun 6, 2022 | Contrastive LearningMixture-of-Experts | —Unverified | 0 |
| Interpretable Mixture of Experts | Jun 5, 2022 | Decision MakingMixture-of-Experts | —Unverified | 0 |
| Patcher: Patch Transformers with Mixture of Experts for Precise Medical Image Segmentation | Jun 3, 2022 | DecoderImage Segmentation | CodeCode Available | 1 |
| Task-Specific Expert Pruning for Sparse Mixture-of-Experts | Jun 1, 2022 | Mixture-of-Experts | —Unverified | 0 |
| Text2Human: Text-Driven Controllable Human Image Generation | May 31, 2022 | DiversityHuman Parsing | CodeCode Available | 2 |
| Gating Dropout: Communication-efficient Regularization for Sparsely Activated Transformers | May 28, 2022 | Machine TranslationMixture-of-Experts | —Unverified | 0 |
| Automatic Expert Selection for Multi-Scenario and Multi-Task Search | May 28, 2022 | Mixture-of-ExpertsMulti-Task Learning | —Unverified | 0 |
| Eliciting and Understanding Cross-Task Skills with Task-Level Mixture-of-Experts | May 25, 2022 | Mixture-of-ExpertsMulti-Task Learning | CodeCode Available | 0 |
| Sparse Mixers: Combining MoE and Mixing to build a more efficient BERT | May 24, 2022 | Mixture-of-Experts | —Unverified | 0 |
| MoESys: A Distributed and Efficient Mixture-of-Experts Training and Inference System for Internet Services | May 20, 2022 | CPUDistributed Computing | —Unverified | 0 |
| Pluralistic Image Completion with Probabilistic Mixture-of-Experts | May 18, 2022 | DiversityMixture-of-Experts | —Unverified | 0 |
| Unified Modeling of Multi-Domain Multi-Device ASR Systems | May 13, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Addressing Confounding Feature Issue for Causal Recommendation | May 13, 2022 | Mixture-of-ExpertsRecommendation Systems | CodeCode Available | 1 |
| ST-ExpertNet: A Deep Expert Framework for Traffic Prediction | May 5, 2022 | Mixture-of-ExpertsPrediction | —Unverified | 0 |
| Optimizing Mixture of Experts using Dynamic Recompilations | May 4, 2022 | Mixture-of-Experts | —Unverified | 0 |
| How Can Cross-lingual Knowledge Contribute Better to Fine-Grained Entity Typing? | May 1, 2022 | Entity TypingMixture-of-Experts | —Unverified | 0 |
| On the Representation Collapse of Sparse Mixture of Experts | Apr 20, 2022 | ClusteringLanguage Modeling | —Unverified | 0 |
| Residual Mixture of Experts | Apr 20, 2022 | Mixture-of-Expertsobject-detection | —Unverified | 0 |
| Table-based Fact Verification with Self-adaptive Mixture of Experts | Apr 19, 2022 | Fact VerificationLogical Reasoning | CodeCode Available | 0 |
| Towards Efficient Single Image Dehazing and Desnowing | Apr 19, 2022 | Image DehazingImage Restoration | —Unverified | 0 |
| StableMoE: Stable Routing Strategy for Mixture of Experts | Apr 18, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Sparsely Activated Mixture-of-Experts are Robust Multi-Task Learners | Apr 16, 2022 | Mixture-of-ExpertsMulti-Task Learning | —Unverified | 0 |
| Mixture of Experts for Biomedical Question Answering | Apr 15, 2022 | Mixture-of-ExpertsQuestion Answering | —Unverified | 0 |
| MoEBERT: from BERT to Mixture-of-Experts via Importance-Guided Adaptation | Apr 15, 2022 | Knowledge DistillationMixture-of-Experts | CodeCode Available | 1 |
| Mixture-of-experts VAEs can disregard variation in surjective multimodal data | Apr 11, 2022 | Mixture-of-Experts | —Unverified | 0 |
| 3M: Multi-loss, Multi-path and Multi-level Neural Networks for speech recognition | Apr 7, 2022 | Mixture-of-Expertsspeech-recognition | CodeCode Available | 1 |
| Learning to Adapt Clinical Sequences with Residual Mixture of Experts | Apr 6, 2022 | Mixture-of-Experts | CodeCode Available | 0 |
| Combining Spectral and Self-Supervised Features for Low Resource Speech Recognition and Translation | Apr 5, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| On the Adaptation to Concept Drift for CTR Prediction | Apr 1, 2022 | Click-Through Rate PredictionIncremental Learning | —Unverified | 0 |
| Efficient Reflectance Capture with a Deep Gated Mixture-of-Experts | Mar 29, 2022 | DecoderMixture-of-Experts | —Unverified | 0 |
| Efficient and Degradation-Adaptive Network for Real-World Image Super-Resolution | Mar 27, 2022 | Image Super-ResolutionMixture-of-Experts | CodeCode Available | 1 |
| Build a Robust QA System with Transformer-based Mixture of Experts | Mar 20, 2022 | Data AugmentationMixture-of-Experts | CodeCode Available | 0 |
| Efficient Language Modeling with Sparse all-MLP | Mar 14, 2022 | AllCommon Sense Reasoning | —Unverified | 0 |
| SummaReranker: A Multi-Task Mixture-of-Experts Re-ranking Framework for Abstractive Summarization | Mar 13, 2022 | Abstractive Text SummarizationDocument Summarization | CodeCode Available | 1 |
| SkillNet-NLU: A Sparsely Activated Model for General-Purpose Natural Language Understanding | Mar 7, 2022 | Language ModellingMasked Language Modeling | —Unverified | 0 |
| Parameter-Efficient Mixture-of-Experts Architecture for Pre-trained Language Models | Mar 2, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Functional mixture-of-experts for classification | Feb 28, 2022 | ClassificationMixture-of-Experts | —Unverified | 0 |
| Mixture-of-Experts with Expert Choice Routing | Feb 18, 2022 | Mixture-of-Experts | —Unverified | 0 |
| ST-MoE: Designing Stable and Transferable Sparse Expert Models | Feb 17, 2022 | ARCCommon Sense Reasoning | CodeCode Available | 3 |
| A Survey on Dynamic Neural Networks for Natural Language Processing | Feb 15, 2022 | Dynamic neural networksMixture-of-Experts | —Unverified | 0 |
| Physics-Guided Problem Decomposition for Scaling Deep Learning of High-dimensional Eigen-Solvers: The Case of Schrödinger's Equation | Feb 12, 2022 | Mixture-of-ExpertsProblem Decomposition | —Unverified | 0 |
| One Student Knows All Experts Know: From Sparse to Dense | Jan 26, 2022 | AllKnowledge Distillation | —Unverified | 0 |
| Sparsely Activated Mixture-of-Experts are Robust Multi-Task Learners | Jan 16, 2022 | Mixture-of-ExpertsMulti-Task Learning | —Unverified | 0 |
| MoEBERT: from BERT to Mixture-of-Experts via Importance-Guided Adaptation | Jan 16, 2022 | Knowledge DistillationMixture-of-Experts | —Unverified | 0 |
| DeepSpeed-MoE: Advancing Mixture-of-Experts Inference and Training to Power Next-Generation AI Scale | Jan 14, 2022 | DecoderMixture-of-Experts | CodeCode Available | 0 |
| Towards Lightweight Neural Animation : Exploration of Neural Network Pruning in Mixture of Experts-based Animation Models | Jan 11, 2022 | Mixture-of-ExpertsNetwork Pruning | —Unverified | 0 |
| MDFEND: Multi-domain Fake News Detection | Jan 4, 2022 | Fake News DetectionMixture-of-Experts | CodeCode Available | 2 |
| EvoMoE: An Evolutional Mixture-of-Experts Training Framework via Dense-To-Sparse Gate | Dec 29, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |