| Self-Routing Capsule Networks | Dec 1, 2019 | ClusteringMixture-of-Experts | CodeCode Available | 0 |
| ASEM: Enhancing Empathy in Chatbot through Attention-based Sentiment and Emotion Modeling | Feb 25, 2024 | ChatbotDiversity | CodeCode Available | 0 |
| Binary-Integer-Programming Based Algorithm for Expert Load Balancing in Mixture-of-Experts Models | Feb 21, 2025 | Mixture-of-Experts | CodeCode Available | 0 |
| DAOP: Data-Aware Offloading and Predictive Pre-Calculation for Efficient MoE Inference | Dec 16, 2024 | CPUGPU | CodeCode Available | 0 |
| DA-MoE: Addressing Depth-Sensitivity in Graph-Level Analysis through Mixture of Experts | Nov 5, 2024 | Mixture-of-ExpertsSensitivity | CodeCode Available | 0 |
| Union of Experts: Adapting Hierarchical Routing to Equivalently Decomposed Transformer | Mar 4, 2025 | Computational EfficiencyMixture-of-Experts | CodeCode Available | 0 |
| Growing Transformers: Modular Composition and Layer-wise Expansion on a Frozen Substrate | Jul 8, 2025 | Continual LearningMixture-of-Experts | CodeCode Available | 0 |
| Sequential Gaussian Processes for Online Learning of Nonstationary Functions | May 24, 2019 | Gaussian ProcessesHyperparameter Optimization | CodeCode Available | 0 |
| Self-Supervised Multimodal Domino: in Search of Biomarkers for Alzheimer's Disease | Dec 25, 2020 | Contrastive LearningDecoder | CodeCode Available | 0 |
| OTCE: Hybrid SSM and Attention with Cross Domain Mixture of Experts to construct Observer-Thinker-Conceiver-Expresser | Jun 24, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |