| Double Deep Q-Learning in Opponent Modeling | Nov 24, 2022 | Mixture-of-ExpertsQ-Learning | —Unverified | 0 |
| Spatial Mixture-of-Experts | Nov 24, 2022 | Mixture-of-Experts | CodeCode Available | 1 |
| Who Says Elephants Can't Run: Bringing Large Scale MoE Models into Cloud Scale Production | Nov 18, 2022 | Machine TranslationMixture-of-Experts | —Unverified | 0 |
| A Bird's-eye View of Reranking: from List Level to Page Level | Nov 17, 2022 | Mixture-of-ExpertsRecommendation Systems | CodeCode Available | 0 |
| HMOE: Hypernetwork-based Mixture of Experts for Domain Generalization | Nov 15, 2022 | Domain GeneralizationMixture-of-Experts | —Unverified | 0 |
| Handling Trade-Offs in Speech Separation with Sparsely-Gated Mixture of Experts | Nov 11, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| PAD-Net: An Efficient Framework for Dynamic Networks | Nov 10, 2022 | image-classificationImage Classification | CodeCode Available | 1 |
| SpeechMatrix: A Large-Scale Mined Corpus of Multilingual Speech-to-Speech Translations | Nov 8, 2022 | Mixture-of-ExpertsSpeech-to-Speech Translation | —Unverified | 0 |
| Using Deep Mixture-of-Experts to Detect Word Meaning Shift for TempoWiC | Nov 7, 2022 | Data AugmentationMixture-of-Experts | —Unverified | 0 |
| Safe Real-World Autonomous Driving by Learning to Predict and Plan with a Mixture of Experts | Nov 3, 2022 | Autonomous DrivingAutonomous Vehicles | —Unverified | 0 |
| Contextual Mixture of Experts: Integrating Knowledge into Predictive Modeling | Nov 1, 2022 | Mixture-of-Experts | —Unverified | 0 |
| Prediction Sets for High-Dimensional Mixture of Experts Models | Oct 30, 2022 | Mixture-of-ExpertsPrediction | —Unverified | 0 |
| Knowledge-in-Context: Towards Knowledgeable Semi-Parametric Language Models | Oct 28, 2022 | Common Sense ReasoningCoreference Resolution | —Unverified | 0 |
| Coordination with Humans via Strategy Matching | Oct 27, 2022 | Mixture-of-Experts | —Unverified | 0 |
| M^3ViT: Mixture-of-Experts Vision Transformer for Efficient Multi-task Learning with Model-Accelerator Co-design | Oct 26, 2022 | Mixture-of-ExpertsMulti-Task Learning | CodeCode Available | 1 |
| On the Adversarial Robustness of Mixture of Experts | Oct 19, 2022 | Adversarial RobustnessMixture-of-Experts | —Unverified | 0 |
| Tiny-Attention Adapter: Contexts Are More Important Than the Number of Parameters | Oct 18, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| AutoMoE: Heterogeneous Mixture-of-Experts with Adaptive Computation for Efficient Neural Machine Translation | Oct 14, 2022 | CPUMachine Translation | CodeCode Available | 1 |
| Mixture of Attention Heads: Selecting Attention Heads Per Token | Oct 11, 2022 | Computational EfficiencyLanguage Modeling | CodeCode Available | 1 |
| FEAMOE: Fair, Explainable and Adaptive Mixture of Experts | Oct 10, 2022 | FairnessMixture-of-Experts | —Unverified | 0 |
| Meta-DMoE: Adapting to Domain Shift by Meta-Distillation from Mixture-of-Experts | Oct 8, 2022 | Domain GeneralizationKnowledge Distillation | CodeCode Available | 1 |
| Deep Learning Mixture-of-Experts Approach for Cytotoxic Edema Assessment in Infants and Children | Oct 6, 2022 | image-classificationImage Classification | —Unverified | 0 |
| Probabilistic partition of unity networks for high-dimensional regression problems | Oct 6, 2022 | Dimensionality ReductionMixture-of-Experts | —Unverified | 0 |
| Table-based Fact Verification with Self-labeled Keypoint Alignment | Oct 1, 2022 | AttributeContrastive Learning | —Unverified | 0 |
| Parameter-varying neural ordinary differential equations with partition-of-unity networks | Oct 1, 2022 | Mixture-of-ExpertsUnity | —Unverified | 0 |