| A Mixture of Experts Approach to 3D Human Motion Prediction | May 9, 2024 | Human motion predictionMixture-of-Experts | CodeCode Available | 0 | 5 |
| k-Winners-Take-All Ensemble Neural Network | Jan 4, 2024 | AllMixture-of-Experts | CodeCode Available | 0 | 5 |
| Information Maximizing Curriculum: A Curriculum-Based Approach for Imitating Diverse Skills | Mar 27, 2023 | Imitation LearningMixture-of-Experts | CodeCode Available | 0 | 5 |
| Jamba: A Hybrid Transformer-Mamba Language Model | Mar 28, 2024 | GPULanguage Modeling | CodeCode Available | 0 | 5 |
| Improving Factuality in Large Language Models via Decoding-Time Hallucinatory and Truthful Comparators | Aug 22, 2024 | HallucinationMixture-of-Experts | CodeCode Available | 0 | 5 |
| Intrinsic User-Centric Interpretability through Global Mixture of Experts | Feb 5, 2024 | Mixture-of-ExpertsNews Classification | CodeCode Available | 0 | 5 |
| Adaptive Expert Models for Personalization in Federated Learning | Jun 15, 2022 | Federated LearningMixture-of-Experts | CodeCode Available | 0 | 5 |
| Learning CHARME models with neural networks | Feb 8, 2020 | Learning TheoryMixture-of-Experts | CodeCode Available | 0 | 5 |
| Mixture of Modular Experts: Distilling Knowledge from a Multilingual Teacher into Specialized Modular Language Models | Jul 28, 2024 | Knowledge DistillationMixture-of-Experts | CodeCode Available | 0 | 5 |
| Improved Training of Mixture-of-Experts Language GANs | Feb 23, 2023 | Adversarial TextImage Generation | —Unverified | 0 | 0 |
| Imitation Learning from Observations: An Autoregressive Mixture of Experts Approach | Nov 12, 2024 | Autonomous DrivingImitation Learning | —Unverified | 0 | 0 |
| Denoising OCT Images Using Steered Mixture of Experts with Multi-Model Inference | Feb 20, 2024 | DenoisingDiagnostic | —Unverified | 0 | 0 |
| Imitation Learning from MPC for Quadrupedal Multi-Gait Control | Mar 26, 2021 | Imitation LearningMixture-of-Experts | —Unverified | 0 | 0 |
| iMedImage Technical Report | Mar 27, 2025 | Anomaly DetectionDiagnostic | —Unverified | 0 | 0 |
| Automatic Document Sketching: Generating Drafts from Analogous Texts | Jun 14, 2021 | Mixture-of-ExpertsReinforcement Learning (RL) | —Unverified | 0 | 0 |
| Identifying Shopping Intent in Product QA for Proactive Recommendations | Apr 9, 2024 | FrictionMixture-of-Experts | —Unverified | 0 | 0 |
| Demystifying Softmax Gating Function in Gaussian Mixture of Experts | May 5, 2023 | Mixture-of-Expertsparameter estimation | —Unverified | 0 | 0 |
| IDEA: An Inverse Domain Expert Adaptation Based Active DNN IP Protection Method | Sep 29, 2024 | Domain AdaptationMixture-of-Experts | —Unverified | 0 | 0 |
| Demons in the Detail: On Implementing Load Balancing Loss for Training Specialized Mixture-of-Expert Models | Jan 21, 2025 | Mixture-of-Experts | —Unverified | 0 | 0 |
| Automatically Extracting Information in Medical Dialogue: Expert System And Attention for Labelling | Nov 28, 2022 | Mixture-of-Experts | —Unverified | 0 | 0 |
| A Mixture of Expert Approach for Low-Cost Customization of Deep Neural Networks | Oct 31, 2018 | Mixture-of-Experts | —Unverified | 0 | 0 |
| Hypertext Entity Extraction in Webpage | Mar 4, 2024 | Mixture-of-Experts | —Unverified | 0 | 0 |
| HydraSum - Disentangling Stylistic Features in Text Summarization using Multi-Decoder Models | Sep 29, 2021 | Abstractive Text SummarizationDecoder | —Unverified | 0 | 0 |
| Hunyuan-TurboS: Advancing Large Language Models through Mamba-Transformer Synergy and Adaptive Chain-of-Thought | May 21, 2025 | ChatbotInstruction Following | —Unverified | 0 | 0 |
| A Universal Approximation Theorem for Mixture of Experts Models | Feb 11, 2016 | General ClassificationMixture-of-Experts | —Unverified | 0 | 0 |