| Learning Deep Mixtures of Gaussian Process Experts Using Sum-Product Networks | Sep 12, 2018 | Gaussian ProcessesMixture-of-Experts | CodeCode Available | 0 | 5 |
| Improving Factuality in Large Language Models via Decoding-Time Hallucinatory and Truthful Comparators | Aug 22, 2024 | HallucinationMixture-of-Experts | CodeCode Available | 0 | 5 |
| Adaptive Expert Models for Personalization in Federated Learning | Jun 15, 2022 | Federated LearningMixture-of-Experts | CodeCode Available | 0 | 5 |
| k-Winners-Take-All Ensemble Neural Network | Jan 4, 2024 | AllMixture-of-Experts | CodeCode Available | 0 | 5 |
| Latent Prototype Routing: Achieving Near-Perfect Load Balancing in Mixture-of-Experts | Jun 26, 2025 | Mixture-of-Experts | CodeCode Available | 0 | 5 |
| Learning Gating ConvNet for Two-Stream based Methods in Action Recognition | Sep 12, 2017 | Action ClassificationAction Recognition | CodeCode Available | 0 | 5 |
| Lifelong Mixture of Variational Autoencoders | Jul 9, 2021 | Lifelong learningMixture-of-Experts | CodeCode Available | 0 | 5 |
| Mixture Content Selection for Diverse Sequence Generation | Sep 4, 2019 | Abstractive Text SummarizationDecoder | CodeCode Available | 0 | 5 |
| RouterKT: Mixture-of-Experts for Knowledge Tracing | Apr 11, 2025 | Knowledge TracingMixture-of-Experts | CodeCode Available | 0 | 5 |
| Improved Training of Mixture-of-Experts Language GANs | Feb 23, 2023 | Adversarial TextImage Generation | —Unverified | 0 | 0 |
| Imitation Learning from Observations: An Autoregressive Mixture of Experts Approach | Nov 12, 2024 | Autonomous DrivingImitation Learning | —Unverified | 0 | 0 |
| Denoising OCT Images Using Steered Mixture of Experts with Multi-Model Inference | Feb 20, 2024 | DenoisingDiagnostic | —Unverified | 0 | 0 |
| Imitation Learning from MPC for Quadrupedal Multi-Gait Control | Mar 26, 2021 | Imitation LearningMixture-of-Experts | —Unverified | 0 | 0 |
| iMedImage Technical Report | Mar 27, 2025 | Anomaly DetectionDiagnostic | —Unverified | 0 | 0 |
| Automatic Document Sketching: Generating Drafts from Analogous Texts | Jun 14, 2021 | Mixture-of-ExpertsReinforcement Learning (RL) | —Unverified | 0 | 0 |
| Identifying Shopping Intent in Product QA for Proactive Recommendations | Apr 9, 2024 | FrictionMixture-of-Experts | —Unverified | 0 | 0 |
| Demystifying Softmax Gating Function in Gaussian Mixture of Experts | May 5, 2023 | Mixture-of-Expertsparameter estimation | —Unverified | 0 | 0 |
| IDEA: An Inverse Domain Expert Adaptation Based Active DNN IP Protection Method | Sep 29, 2024 | Domain AdaptationMixture-of-Experts | —Unverified | 0 | 0 |
| Demons in the Detail: On Implementing Load Balancing Loss for Training Specialized Mixture-of-Expert Models | Jan 21, 2025 | Mixture-of-Experts | —Unverified | 0 | 0 |
| Automatically Extracting Information in Medical Dialogue: Expert System And Attention for Labelling | Nov 28, 2022 | Mixture-of-Experts | —Unverified | 0 | 0 |
| A Mixture of Expert Approach for Low-Cost Customization of Deep Neural Networks | Oct 31, 2018 | Mixture-of-Experts | —Unverified | 0 | 0 |
| Hypertext Entity Extraction in Webpage | Mar 4, 2024 | Mixture-of-Experts | —Unverified | 0 | 0 |
| HydraSum - Disentangling Stylistic Features in Text Summarization using Multi-Decoder Models | Sep 29, 2021 | Abstractive Text SummarizationDecoder | —Unverified | 0 | 0 |
| Hunyuan-TurboS: Advancing Large Language Models through Mamba-Transformer Synergy and Adaptive Chain-of-Thought | May 21, 2025 | ChatbotInstruction Following | —Unverified | 0 | 0 |
| A Universal Approximation Theorem for Mixture of Experts Models | Feb 11, 2016 | General ClassificationMixture-of-Experts | —Unverified | 0 | 0 |
| AMEND: A Mixture of Experts Framework for Long-tailed Trajectory Prediction | Feb 13, 2024 | Contrastive LearningMixture-of-Experts | —Unverified | 0 | 0 |
| Adaptive Detection of Fast Moving Celestial Objects Using a Mixture of Experts and Physical-Inspired Neural Network | Apr 10, 2025 | Mixture-of-Expertsobject-detection | —Unverified | 0 | 0 |
| How to Upscale Neural Networks with Scaling Law? A Survey and Practical Guidelines | Feb 17, 2025 | Mixture-of-Experts | —Unverified | 0 | 0 |
| How Lightweight Can A Vision Transformer Be | Jul 25, 2024 | Mixture-of-ExpertsTransfer Learning | —Unverified | 0 | 0 |
| How does Architecture Influence the Base Capabilities of Pre-trained Language Models? A Case Study Based on FFN-Wider and MoE Transformers | Mar 4, 2024 | Few-Shot LearningLanguage Modeling | —Unverified | 0 | 0 |
| A Unified Virtual Mixture-of-Experts Framework:Enhanced Inference and Hallucination Mitigation in Single-Model System | Apr 1, 2025 | Dialogue GenerationEnsemble Learning | —Unverified | 0 | 0 |
| How Do Consumers Really Choose: Exposing Hidden Preferences with the Mixture of Experts Model | Mar 3, 2025 | Decision MakingDemand Forecasting | —Unverified | 0 | 0 |
| How Can Cross-lingual Knowledge Contribute Better to Fine-Grained Entity Typing? | May 1, 2022 | Entity TypingMixture-of-Experts | —Unverified | 0 | 0 |
| HOMOE: A Memory-Based and Composition-Aware Framework for Zero-Shot Learning with Hopfield Network and Soft Mixture of Experts | Nov 23, 2023 | Compositional Zero-Shot LearningMixture-of-Experts | —Unverified | 0 | 0 |
| HoME: Hierarchy of Multi-Gate Experts for Multi-Task Learning at Kuaishou | Aug 10, 2024 | Mixture-of-ExpertsMulti-Task Learning | —Unverified | 0 | 0 |
| A Unified Framework for Iris Anti-Spoofing: Introducing IrisGeneral Dataset and Masked-MoE Method | Aug 19, 2024 | Iris RecognitionMixture-of-Experts | —Unverified | 0 | 0 |
| Holistic Capability Preservation: Towards Compact Yet Comprehensive Reasoning Models | Apr 9, 2025 | Instruction FollowingMathematical Problem-Solving | —Unverified | 0 | 0 |
| HOBBIT: A Mixed Precision Expert Offloading System for Fast MoE Inference | Nov 3, 2024 | Mixture-of-Experts | —Unverified | 0 | 0 |
| HMOE: Hypernetwork-based Mixture of Experts for Domain Generalization | Nov 15, 2022 | Domain GeneralizationMixture-of-Experts | —Unverified | 0 | 0 |
| HMoE: Heterogeneous Mixture of Experts for Language Modeling | Aug 20, 2024 | Computational EfficiencyLanguage Modeling | —Unverified | 0 | 0 |
| A Unified Approach to Universal Prediction: Generalized Upper and Lower Bounds | Nov 25, 2013 | Learning TheoryMixture-of-Experts | —Unverified | 0 | 0 |
| HiMoE: Heterogeneity-Informed Mixture-of-Experts for Fair Spatial-Temporal Forecasting | Nov 30, 2024 | FairnessMixture-of-Experts | —Unverified | 0 | 0 |
| Hierarchical Routing Mixture of Experts | Mar 18, 2019 | Mixture-of-Expertsregression | —Unverified | 0 | 0 |
| Deep Learning Mixture-of-Experts Approach for Cytotoxic Edema Assessment in Infants and Children | Oct 6, 2022 | image-classificationImage Classification | —Unverified | 0 | 0 |
| A Two-Phase Deep Learning Framework for Adaptive Time-Stepping in High-Speed Flow Modeling | Jun 9, 2025 | Mixture-of-Experts | —Unverified | 0 | 0 |
| Alternating Updates for Efficient Transformers | Jan 30, 2023 | Mixture-of-Experts | —Unverified | 0 | 0 |
| Adaptive Conditional Expert Selection Network for Multi-domain Recommendation | Nov 11, 2024 | Computational EfficiencyMixture-of-Experts | —Unverified | 0 | 0 |
| Accelerating Mixture-of-Experts Training with Adaptive Expert Replication | Apr 28, 2025 | GPUMixture-of-Experts | —Unverified | 0 | 0 |
| Hierarchical Mixture-of-Experts Model for Large-Scale Gaussian Process Regression | Dec 9, 2014 | Mixture-of-Expertsregression | —Unverified | 0 | 0 |
| Deep Gaussian Covariance Network | Oct 17, 2017 | Gaussian ProcessesMixture-of-Experts | —Unverified | 0 | 0 |