| Optimizing Distributed Deployment of Mixture-of-Experts Model Inference in Serverless Computing | Jan 9, 2025 | Bayesian OptimizationCPU | —Unverified | 0 |
| Optimizing LLMs for Resource-Constrained Environments: A Survey of Model Compression Techniques | May 5, 2025 | Knowledge DistillationMixture-of-Experts | —Unverified | 0 |
| Optimizing Mixture-of-Experts Inference Time Combining Model Deployment and Communication Scheduling | Oct 22, 2024 | AllGPU | —Unverified | 0 |
| Optimizing Mixture of Experts using Dynamic Recompilations | May 4, 2022 | Mixture-of-Experts | —Unverified | 0 |
| Optimizing Robustness and Accuracy in Mixture of Experts: A Dual-Model Approach | Feb 5, 2025 | Adversarial RobustnessMixture-of-Experts | —Unverified | 0 |
| Optimus-3: Towards Generalist Multimodal Minecraft Agents with Scalable Task Experts | Jun 12, 2025 | DiversityMinecraft | —Unverified | 0 |
| P-Adapters: Robustly Extracting Factual Information from Language Models with Diverse Prompts | Oct 14, 2021 | Mixture-of-ExpertsNatural Language Queries | —Unverified | 0 |
| Pangu Ultra MoE: How to Train Your Big MoE on Ascend NPUs | May 7, 2025 | Mixture-of-Experts | —Unverified | 0 |
| Parameter-Efficient Conformers via Sharing Sparsely-Gated Experts for End-to-End Speech Recognition | Sep 17, 2022 | Knowledge DistillationMixture-of-Experts | —Unverified | 0 |
| Parameter-Efficient Quantized Mixture-of-Experts Meets Vision-Language Instruction Tuning for Semiconductor Electron Micrograph Analysis | Aug 27, 2024 | Instruction FollowingLanguage Modeling | —Unverified | 0 |