| SMAR: Soft Modality-Aware Routing Strategy for MoE-based Multimodal Large Language Models Preserving Language Capabilities | Jun 6, 2025 | Mixture-of-Experts | —Unverified | 0 | 0 |
| SMILE: Scaling Mixture-of-Experts with Efficient Bi-level Routing | Dec 10, 2022 | Mixture-of-Experts | —Unverified | 0 | 0 |
| Sparse Diffusion Policy: A Sparse, Reusable, and Flexible Policy for Robot Learning | Jul 1, 2024 | Continual LearningMixture-of-Experts | —Unverified | 0 | 0 |
| Sparsely Activated Mixture-of-Experts are Robust Multi-Task Learners | Jan 16, 2022 | Mixture-of-ExpertsMulti-Task Learning | —Unverified | 0 | 0 |
| Sparsely Activated Mixture-of-Experts are Robust Multi-Task Learners | Apr 16, 2022 | Mixture-of-ExpertsMulti-Task Learning | —Unverified | 0 | 0 |
| Sparse Mixers: Combining MoE and Mixing to build a more efficient BERT | May 24, 2022 | Mixture-of-Experts | —Unverified | 0 | 0 |
| Sparse Mixture of Experts as Unified Competitive Learning | Mar 29, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| Sparse Mixture-of-Experts for Non-Uniform Noise Reduction in MRI Images | Jan 24, 2025 | DenoisingDiagnostic | —Unverified | 0 | 0 |
| Cross-token Modeling with Conditional Computation | Sep 5, 2021 | Computational EfficiencyImage Classification | —Unverified | 0 | 0 |
| Sparse Upcycling: Inference Inefficient Finetuning | Nov 13, 2024 | Mixture-of-Experts | —Unverified | 0 | 0 |
| Sparse Video Representation Using Steered Mixture-of-Experts With Global Motion Compensation | Sep 13, 2022 | Mixture-of-ExpertsMotion Compensation | —Unverified | 0 | 0 |
| Sparsity-Constrained Optimal Transport | Sep 30, 2022 | Mixture-of-Experts | —Unverified | 0 | 0 |
| Speculative MoE: Communication Efficient Parallel MoE Inference with Speculative Token and Expert Pre-scheduling | Mar 6, 2025 | Mixture-of-ExpertsScheduling | —Unverified | 0 | 0 |
| SpeechMatrix: A Large-Scale Mined Corpus of Multilingual Speech-to-Speech Translations | Nov 8, 2022 | Mixture-of-ExpertsSpeech-to-Speech Translation | —Unverified | 0 | 0 |
| SpeechMoE2: Mixture-of-Experts Model with Improved Routing | Nov 23, 2021 | Computational EfficiencyMixture-of-Experts | —Unverified | 0 | 0 |
| Speech Quality Assessment Model Based on Mixture of Experts: System-Level Performance Enhancement and Utterance-Level Challenge Analysis | Jul 8, 2025 | Data AugmentationMixture-of-Experts | —Unverified | 0 | 0 |
| SPMoE: Generate Multiple Pattern-Aware Outputs with Sparse Pattern Mixture of Experts | Aug 17, 2021 | DiversityMixture-of-Experts | —Unverified | 0 | 0 |
| SQL-GEN: Bridging the Dialect Gap for Text-to-SQL Via Synthetic Data And Model Merging | Aug 22, 2024 | DiversityMixture-of-Experts | —Unverified | 0 | 0 |
| StableMoE: Stable Routing Strategy for Mixture of Experts | Nov 16, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| STAR-Rec: Making Peace with Length Variance and Pattern Diversity in Sequential Recommendation | May 6, 2025 | DiversityMixture-of-Experts | —Unverified | 0 | 0 |
| Static Batching of Irregular Workloads on GPUs: Framework and Application to Efficient MoE Model Inference | Jan 27, 2025 | GPUMixture-of-Experts | —Unverified | 0 | 0 |
| Statistical Advantages of Perturbing Cosine Router in Mixture of Experts | May 23, 2024 | Mixture-of-Experts | —Unverified | 0 | 0 |
| Statistical Perspective of Top-K Sparse Softmax Gating Mixture of Experts | Sep 25, 2023 | Density EstimationMixture-of-Experts | —Unverified | 0 | 0 |
| Stealing User Prompts from Mixture of Experts | Oct 30, 2024 | Mixture-of-Experts | —Unverified | 0 | 0 |
| Steered Mixture-of-Experts Autoencoder Design for Real-Time Image Modelling and Denoising | May 5, 2023 | DecoderDenoising | —Unverified | 0 | 0 |