| Double Deep Q-Learning in Opponent Modeling | Nov 24, 2022 | Mixture-of-ExpertsQ-Learning | —Unverified | 0 |
| Spatial Mixture-of-Experts | Nov 24, 2022 | Mixture-of-Experts | CodeCode Available | 1 |
| Who Says Elephants Can't Run: Bringing Large Scale MoE Models into Cloud Scale Production | Nov 18, 2022 | Machine TranslationMixture-of-Experts | —Unverified | 0 |
| A Bird's-eye View of Reranking: from List Level to Page Level | Nov 17, 2022 | Mixture-of-ExpertsRecommendation Systems | CodeCode Available | 0 |
| HMOE: Hypernetwork-based Mixture of Experts for Domain Generalization | Nov 15, 2022 | Domain GeneralizationMixture-of-Experts | —Unverified | 0 |
| Handling Trade-Offs in Speech Separation with Sparsely-Gated Mixture of Experts | Nov 11, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| PAD-Net: An Efficient Framework for Dynamic Networks | Nov 10, 2022 | image-classificationImage Classification | CodeCode Available | 1 |
| SpeechMatrix: A Large-Scale Mined Corpus of Multilingual Speech-to-Speech Translations | Nov 8, 2022 | Mixture-of-ExpertsSpeech-to-Speech Translation | —Unverified | 0 |
| Using Deep Mixture-of-Experts to Detect Word Meaning Shift for TempoWiC | Nov 7, 2022 | Data AugmentationMixture-of-Experts | —Unverified | 0 |
| Safe Real-World Autonomous Driving by Learning to Predict and Plan with a Mixture of Experts | Nov 3, 2022 | Autonomous DrivingAutonomous Vehicles | —Unverified | 0 |
| Contextual Mixture of Experts: Integrating Knowledge into Predictive Modeling | Nov 1, 2022 | Mixture-of-Experts | —Unverified | 0 |
| Prediction Sets for High-Dimensional Mixture of Experts Models | Oct 30, 2022 | Mixture-of-ExpertsPrediction | —Unverified | 0 |
| Knowledge-in-Context: Towards Knowledgeable Semi-Parametric Language Models | Oct 28, 2022 | Common Sense ReasoningCoreference Resolution | —Unverified | 0 |
| Coordination with Humans via Strategy Matching | Oct 27, 2022 | Mixture-of-Experts | —Unverified | 0 |
| M^3ViT: Mixture-of-Experts Vision Transformer for Efficient Multi-task Learning with Model-Accelerator Co-design | Oct 26, 2022 | Mixture-of-ExpertsMulti-Task Learning | CodeCode Available | 1 |
| On the Adversarial Robustness of Mixture of Experts | Oct 19, 2022 | Adversarial RobustnessMixture-of-Experts | —Unverified | 0 |
| Tiny-Attention Adapter: Contexts Are More Important Than the Number of Parameters | Oct 18, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| AutoMoE: Heterogeneous Mixture-of-Experts with Adaptive Computation for Efficient Neural Machine Translation | Oct 14, 2022 | CPUMachine Translation | CodeCode Available | 1 |
| Mixture of Attention Heads: Selecting Attention Heads Per Token | Oct 11, 2022 | Computational EfficiencyLanguage Modeling | CodeCode Available | 1 |
| FEAMOE: Fair, Explainable and Adaptive Mixture of Experts | Oct 10, 2022 | FairnessMixture-of-Experts | —Unverified | 0 |
| Meta-DMoE: Adapting to Domain Shift by Meta-Distillation from Mixture-of-Experts | Oct 8, 2022 | Domain GeneralizationKnowledge Distillation | CodeCode Available | 1 |
| Deep Learning Mixture-of-Experts Approach for Cytotoxic Edema Assessment in Infants and Children | Oct 6, 2022 | image-classificationImage Classification | —Unverified | 0 |
| Probabilistic partition of unity networks for high-dimensional regression problems | Oct 6, 2022 | Dimensionality ReductionMixture-of-Experts | —Unverified | 0 |
| Table-based Fact Verification with Self-labeled Keypoint Alignment | Oct 1, 2022 | AttributeContrastive Learning | —Unverified | 0 |
| Parameter-varying neural ordinary differential equations with partition-of-unity networks | Oct 1, 2022 | Mixture-of-ExpertsUnity | —Unverified | 0 |
| Sparsity-Constrained Optimal Transport | Sep 30, 2022 | Mixture-of-Experts | —Unverified | 0 |
| Mixture of experts models for multilevel data: modelling framework and approximation theory | Sep 30, 2022 | Mixture-of-Expertsregression | —Unverified | 0 |
| Tuning of Mixture-of-Experts Mixed-Precision Neural Networks | Sep 29, 2022 | image-classificationImage Classification | —Unverified | 0 |
| Diversified Dynamic Routing for Vision Tasks | Sep 26, 2022 | Instance SegmentationMixture-of-Experts | —Unverified | 0 |
| Parameter-Efficient Conformers via Sharing Sparsely-Gated Experts for End-to-End Speech Recognition | Sep 17, 2022 | Knowledge DistillationMixture-of-Experts | —Unverified | 0 |
| Sparse Video Representation Using Steered Mixture-of-Experts With Global Motion Compensation | Sep 13, 2022 | Mixture-of-ExpertsMotion Compensation | —Unverified | 0 |
| A Review of Sparse Expert Models in Deep Learning | Sep 4, 2022 | Deep LearningMixture-of-Experts | —Unverified | 0 |
| ADMoE: Anomaly Detection with Mixture-of-Experts from Noisy Labels | Aug 24, 2022 | Anomaly DetectionMixture-of-Experts | —Unverified | 0 |
| Mask and Reason: Pre-Training Knowledge Graph Transformers for Complex Logical Queries | Aug 16, 2022 | Mixture-of-Experts | CodeCode Available | 1 |
| Context-aware Mixture-of-Experts for Unbiased Scene Graph Generation | Aug 15, 2022 | DiversityGraph Generation | —Unverified | 0 |
| A Theoretical View on Sparsely Activated Networks | Aug 8, 2022 | Mixture-of-Experts | —Unverified | 0 |
| Towards Understanding Mixture of Experts in Deep Learning | Aug 4, 2022 | Deep LearningMixture-of-Experts | CodeCode Available | 1 |
| Edge-Aware Autoencoder Design for Real-Time Mixture-of-Experts Image Compression | Jul 25, 2022 | DenoisingImage Compression | —Unverified | 0 |
| Learning Soccer Juggling Skills with Layer-wise Mixture-of-Experts | Jul 24, 2022 | Deep Reinforcement LearningHumanoid Control | CodeCode Available | 1 |
| Adaptive Mixture of Experts Learning for Generalizable Face Anti-Spoofing | Jul 20, 2022 | Domain GeneralizationFace Anti-Spoofing | —Unverified | 0 |
| MoEC: Mixture of Expert Clusters | Jul 19, 2022 | Machine TranslationMixture-of-Experts | —Unverified | 0 |
| Learning Large-scale Universal User Representation with Sparse Mixture of Experts | Jul 11, 2022 | Mixture-of-Experts | —Unverified | 0 |
| No Language Left Behind: Scaling Human-Centered Machine Translation | Jul 11, 2022 | Machine TranslationMixture-of-Experts | CodeCode Available | 2 |
| DeepSpeed Inference: Enabling Efficient Inference of Transformer Models at Unprecedented Scale | Jun 30, 2022 | CPUGPU | CodeCode Available | 4 |
| RoME: Role-aware Mixture-of-Expert Transformer for Text-to-Video Retrieval | Jun 26, 2022 | Mixture-of-ExpertsRetrieval | CodeCode Available | 0 |
| Scalable Neural Data Server: A Data Recommender for Transfer Learning | Jun 19, 2022 | Mixture-of-ExpertsTransfer Learning | —Unverified | 0 |
| Adaptive Expert Models for Personalization in Federated Learning | Jun 15, 2022 | Federated LearningMixture-of-Experts | CodeCode Available | 0 |
| Towards Universal Sequence Representation Learning for Recommender Systems | Jun 13, 2022 | Mixture-of-ExpertsRecommendation Systems | CodeCode Available | 2 |
| Uni-Perceiver-MoE: Learning Sparse Generalist Models with Conditional MoEs | Jun 9, 2022 | Image CaptioningImage Classification | CodeCode Available | 2 |
| Sparse Mixture-of-Experts are Domain Generalizable Learners | Jun 8, 2022 | Domain GeneralizationMixture-of-Experts | CodeCode Available | 1 |