| Self-Routing Capsule Networks | Dec 1, 2019 | ClusteringMixture-of-Experts | CodeCode Available | 0 |
| ASEM: Enhancing Empathy in Chatbot through Attention-based Sentiment and Emotion Modeling | Feb 25, 2024 | ChatbotDiversity | CodeCode Available | 0 |
| Binary-Integer-Programming Based Algorithm for Expert Load Balancing in Mixture-of-Experts Models | Feb 21, 2025 | Mixture-of-Experts | CodeCode Available | 0 |
| DAOP: Data-Aware Offloading and Predictive Pre-Calculation for Efficient MoE Inference | Dec 16, 2024 | CPUGPU | CodeCode Available | 0 |
| DA-MoE: Addressing Depth-Sensitivity in Graph-Level Analysis through Mixture of Experts | Nov 5, 2024 | Mixture-of-ExpertsSensitivity | CodeCode Available | 0 |
| Union of Experts: Adapting Hierarchical Routing to Equivalently Decomposed Transformer | Mar 4, 2025 | Computational EfficiencyMixture-of-Experts | CodeCode Available | 0 |
| Growing Transformers: Modular Composition and Layer-wise Expansion on a Frozen Substrate | Jul 8, 2025 | Continual LearningMixture-of-Experts | CodeCode Available | 0 |
| Sequential Gaussian Processes for Online Learning of Nonstationary Functions | May 24, 2019 | Gaussian ProcessesHyperparameter Optimization | CodeCode Available | 0 |
| Self-Supervised Multimodal Domino: in Search of Biomarkers for Alzheimer's Disease | Dec 25, 2020 | Contrastive LearningDecoder | CodeCode Available | 0 |
| OTCE: Hybrid SSM and Attention with Cross Domain Mixture of Experts to construct Observer-Thinker-Conceiver-Expresser | Jun 24, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Mixture of Experts Meets Decoupled Message Passing: Towards General and Adaptive Node Classification | Dec 11, 2024 | Computational Efficiency | CodeCode Available | 0 |
| Video Relationship Detection Using Mixture of Experts | Mar 6, 2024 | Action RecognitionMixture-of-Experts | CodeCode Available | 0 |
| Graph Knowledge Distillation to Mixture of Experts | Jun 17, 2024 | Knowledge DistillationMixture-of-Experts | CodeCode Available | 0 |
| Tensor-variate Mixture of Experts for Proportional Myographic Control of a Robotic Hand | Feb 28, 2019 | Mixture-of-Expertsregression | CodeCode Available | 0 |
| Mixture-of-Experts Graph Transformers for Interpretable Particle Collision Detection | Jan 6, 2025 | Decision MakingMixture-of-Experts | CodeCode Available | 0 |
| Granger-causal Attentive Mixtures of Experts: Learning Important Features with Neural Networks | Feb 6, 2018 | Feature ImportanceMixture-of-Experts | CodeCode Available | 0 |
| Adversarial Mixture Of Experts with Category Hierarchy Soft Constraint | Jul 24, 2020 | ClusteringFeature Importance | CodeCode Available | 0 |
| A non-asymptotic approach for model selection via penalization in high-dimensional mixture of experts models | Apr 6, 2021 | Mixture-of-ExpertsModel Selection | CodeCode Available | 0 |
| Covariate-guided Bayesian mixture model for multivariate time series | Jan 3, 2023 | Mixture-of-ExpertsTime Series | CodeCode Available | 0 |
| Mixture Content Selection for Diverse Sequence Generation | Sep 4, 2019 | Abstractive Text SummarizationDecoder | CodeCode Available | 0 |
| Countering Mainstream Bias via End-to-End Adaptive Local Learning | Apr 13, 2024 | Collaborative FilteringMixture-of-Experts | CodeCode Available | 0 |
| Co-Supervised Learning: Improving Weak-to-Strong Generalization with Hierarchical Mixture of Experts | Feb 23, 2024 | Mixture-of-Experts | CodeCode Available | 0 |
| MicarVLMoE: A Modern Gated Cross-Aligned Vision-Language Mixture of Experts Model for Medical Image Captioning and Report Generation | Apr 29, 2025 | cross-modal alignmentDecoder | CodeCode Available | 0 |
| MaskMoE: Boosting Token-Level Learning via Routing Mask in Mixture-of-Experts | Jul 13, 2024 | DiversityMixture-of-Experts | CodeCode Available | 0 |
| Peirce in the Machine: How Mixture of Experts Models Perform Hypothesis Construction | Jun 24, 2024 | Mixture-of-Experts | CodeCode Available | 0 |
| Condensing Multilingual Knowledge with Lightweight Language-Specific Modules | May 23, 2023 | Machine TranslationMixture-of-Experts | CodeCode Available | 0 |
| Completed Feature Disentanglement Learning for Multimodal MRIs Analysis | Jul 6, 2024 | DisentanglementMixture-of-Experts | CodeCode Available | 0 |
| Skeleton-Based Human Action Recognition with Noisy Labels | Mar 15, 2024 | Action RecognitionDenoising | CodeCode Available | 0 |
| UniRestorer: Universal Image Restoration via Adaptively Estimating Image Degradation at Proper Granularity | Dec 28, 2024 | Image RestorationMixture-of-Experts | CodeCode Available | 0 |
| Manifold-Preserving Transformers are Effective for Short-Long Range Encoding | Oct 22, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| GEMINUS: Dual-aware Global and Scene-Adaptive Mixture-of-Experts for End-to-End Autonomous Driving | Jul 19, 2025 | Autonomous DrivingBench2Drive | CodeCode Available | 0 |
| FT-Shield: A Watermark Against Unauthorized Fine-tuning in Text-to-Image Diffusion Models | Oct 3, 2023 | Face TransferMixture-of-Experts | CodeCode Available | 0 |
| From Knowledge to Noise: CTIM-Rover and the Pitfalls of Episodic Memory in Software Engineering Agents | May 29, 2025 | AI AgentMixture-of-Experts | CodeCode Available | 0 |
| A Gaussian Process-based Streaming Algorithm for Prediction of Time Series With Regimes and Outliers | Jun 1, 2024 | Gaussian ProcessesMixture-of-Experts | CodeCode Available | 0 |
| Anomaly Detection by Recombining Gated Unsupervised Experts | Aug 31, 2020 | Anomaly DetectionMixture-of-Experts | CodeCode Available | 0 |
| SMOSE: Sparse Mixture of Shallow Experts for Interpretable Reinforcement Learning in Continuous Control Tasks | Dec 17, 2024 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Finger Pose Estimation for Under-screen Fingerprint Sensor | May 5, 2025 | Mixture-of-ExpertsPose Estimation | CodeCode Available | 0 |
| pFedMoE: Data-Level Personalization with Mixture of Experts for Model-Heterogeneous Personalized Federated Learning | Feb 2, 2024 | Federated LearningMixture-of-Experts | CodeCode Available | 0 |
| FEDKIM: Adaptive Federated Knowledge Injection into Medical Foundation Models | Aug 17, 2024 | Federated LearningMixture-of-Experts | CodeCode Available | 0 |
| m2mKD: Module-to-Module Knowledge Distillation for Modular Transformers | Feb 26, 2024 | Knowledge DistillationMixture-of-Experts | CodeCode Available | 0 |
| BIG-MoE: Bypass Isolated Gating MoE for Generalized Multimodal Face Anti-Spoofing | Dec 24, 2024 | Decision MakingFace Anti-Spoofing | CodeCode Available | 0 |
| Fast filtering of non-Gaussian models using Amortized Optimal Transport Maps | Mar 16, 2025 | Mixture-of-Experts | CodeCode Available | 0 |
| A Gated Residual Kolmogorov-Arnold Networks for Mixtures of Experts | Sep 23, 2024 | Kolmogorov-Arnold NetworksMixture-of-Experts | CodeCode Available | 0 |
| Bidirectional Attention as a Mixture of Continuous Word Experts | Jul 8, 2023 | Language ModellingMixture-of-Experts | CodeCode Available | 0 |
| Universal Simultaneous Machine Translation with Mixture-of-Experts Wait-k Policy | Sep 11, 2021 | Machine TranslationMixture-of-Experts | CodeCode Available | 0 |
| Tight Clusters Make Specialized Experts | Feb 21, 2025 | ClusteringLanguage Modeling | CodeCode Available | 0 |
| CompeteSMoE -- Statistically Guaranteed Mixture of Experts Training via Competition | May 19, 2025 | Mixture-of-Experts | CodeCode Available | 0 |
| Two Heads are Better than One: Nested PoE for Robust Defense Against Multi-Backdoors | Apr 2, 2024 | Data PoisoningHate Speech Detection | CodeCode Available | 0 |
| LLM-e Guess: Can LLMs Capabilities Advance Without Hardware Progress? | May 7, 2025 | Large Language ModelMixture-of-Experts | CodeCode Available | 0 |
| FactorLLM: Factorizing Knowledge via Mixture of Experts for Large Language Models | Aug 15, 2024 | Mixture-of-Experts | CodeCode Available | 0 |