| Alternating Updates for Efficient Transformers | Jan 30, 2023 | Mixture-of-Experts | —Unverified | 0 |
| Adaptive Conditional Expert Selection Network for Multi-domain Recommendation | Nov 11, 2024 | Computational EfficiencyMixture-of-Experts | —Unverified | 0 |
| FlexMoE: Scaling Large-scale Sparse Pre-trained Model Training via Dynamic Device Placement | Apr 8, 2023 | Mixture-of-ExpertsScheduling | —Unverified | 0 |
| Deep Gaussian Covariance Network | Oct 17, 2017 | Gaussian ProcessesMixture-of-Experts | —Unverified | 0 |
| Attention Weighted Mixture of Experts with Contrastive Learning for Personalized Ranking in E-commerce | Jun 8, 2023 | Contrastive LearningMixture-of-Experts | —Unverified | 0 |
| Decoding Knowledge Attribution in Mixture-of-Experts: A Framework of Basic-Refinement Collaboration and Efficiency Analysis | May 30, 2025 | BlockingMixture-of-Experts | —Unverified | 0 |
| Data Expansion using Back Translation and Paraphrasing for Hate Speech Detection | May 25, 2021 | Data AugmentationDecoder | —Unverified | 0 |
| A Tree Architecture of LSTM Networks for Sequential Regression with Missing Data | May 22, 2020 | Mixture-of-Expertsregression | —Unverified | 0 |
| Alternating Gradient Descent and Mixture-of-Experts for Integrated Multimodal Perception | May 10, 2023 | Classificationimage-classification | —Unverified | 0 |
| DA-MoE: Towards Dynamic Expert Allocation for Mixture-of-Experts Models | Sep 10, 2024 | Mixture-of-Experts | —Unverified | 0 |