| MoE-CAP: Benchmarking Cost, Accuracy and Performance of Sparse Mixture-of-Experts Systems | May 16, 2025 | BenchmarkingMixture-of-Experts | —Unverified | 0 | 0 |
| MegaScale-MoE: Large-Scale Communication-Efficient Training of Mixture-of-Experts Models in Production | May 16, 2025 | Mixture-of-Experts | —Unverified | 0 | 0 |
| A Survey of Generative Categories and Techniques in Multimodal Large Language Models | May 29, 2025 | Mixture-of-ExpertsSelf-Supervised Learning | —Unverified | 0 | 0 |
| 3D Gaussian Splatting Data Compression with Mixture of Priors | May 6, 2025 | 3DGSData Compression | —Unverified | 0 | 0 |
| 3D-MoE: A Mixture-of-Experts Multi-modal LLM for 3D Vision and Pose Diffusion via Rectified Flow | Jan 28, 2025 | Instruction FollowingMixture-of-Experts | —Unverified | 0 | 0 |
| Accelerating Mixture-of-Experts Training with Adaptive Expert Replication | Apr 28, 2025 | GPUMixture-of-Experts | —Unverified | 0 | 0 |
| Accelerating MoE Model Inference with Expert Sharding | Mar 11, 2025 | DecoderGPU | —Unverified | 0 | 0 |
| Acquiring Diverse Skills using Curriculum Reinforcement Learning with Mixture of Experts | Mar 11, 2024 | Mixture-of-ExpertsReinforcement Learning (RL) | —Unverified | 0 | 0 |
| Modular Action Concept Grounding in Semantic Video Prediction | Nov 23, 2020 | Action RecognitionMixture-of-Experts | —Unverified | 0 | 0 |
| AdaEnsemble: Learning Adaptively Sparse Structured Ensemble Network for Click-Through Rate Prediction | Jan 6, 2023 | Click-Through Rate PredictionMixture-of-Experts | —Unverified | 0 | 0 |