| Explainable data-driven modeling via mixture of experts: towards effective blending of grey and black-box models | Jan 30, 2024 | Mixture-of-Experts | —Unverified | 0 | 0 |
| Exploiting Mixture-of-Experts Redundancy Unlocks Multimodal Generative Abilities | Mar 28, 2025 | Mixture-of-ExpertsText Generation | —Unverified | 0 | 0 |
| Exploring Domain Robust Lightweight Reward Models based on Router Mechanism | Jul 24, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| Exploring Routing Strategies for Multilingual Mixture-of-Experts Models | Jan 1, 2021 | DecoderMixture-of-Experts | —Unverified | 0 | 0 |
| M6-T: Exploring Sparse Expert Models and Beyond | May 31, 2021 | Mixture-of-ExpertsPlaying the Game of 2048 | —Unverified | 0 | 0 |
| Exploring Speaker Diarization with Mixture of Experts | Jun 17, 2025 | Mixture-of-Expertsspeaker-diarization | —Unverified | 0 | 0 |
| Facet-Aware Multi-Head Mixture-of-Experts Model for Sequential Recommendation | Nov 3, 2024 | Mixture-of-ExpertsSequential Recommendation | —Unverified | 0 | 0 |
| Fast, Differentiable and Sparse Top-k: a Convex Analysis Perspective | Feb 2, 2023 | GPUMixture-of-Experts | —Unverified | 0 | 0 |
| Faster Language Models with Better Multi-Token Prediction Using Tensor Decomposition | Oct 23, 2024 | Code GenerationMixture-of-Experts | —Unverified | 0 | 0 |
| Faster MoE LLM Inference for Extremely Large Models | May 6, 2025 | Inference OptimizationMixture-of-Experts | —Unverified | 0 | 0 |