| Wolf: Captioning Everything with a World Summarization Framework | Jul 26, 2024 | Autonomous DrivingMixture-of-Experts | —Unverified | 0 | 0 |
| Yi-Lightning Technical Report | Dec 2, 2024 | ChatbotLarge Language Model | —Unverified | 0 | 0 |
| PMoE: Progressive Mixture of Experts with Asymmetric Transformer for Continual Learning | Jul 31, 2024 | Continual LearningGeneral Knowledge | —Unverified | 0 | 0 |
| Zero-Resource Multilingual Model Transfer: Learning What to Share | Sep 27, 2018 | Cross-Lingual TransferMixture-of-Experts | —Unverified | 0 | 0 |
| Multimodal Fusion and Coherence Modeling for Video Topic Segmentation | Aug 1, 2024 | Contrastive LearningMixture-of-Experts | —Unverified | 0 | 0 |
| HMDN: Hierarchical Multi-Distribution Network for Click-Through Rate Prediction | Aug 2, 2024 | Click-Through Rate PredictionMixture-of-Experts | —Unverified | 0 | 0 |
| Mixture-of-Noises Enhanced Forgery-Aware Predictor for Multi-Face Manipulation Detection and Localization | Aug 5, 2024 | Face DetectionMixture-of-Experts | —Unverified | 0 | 0 |
| Routing in Sparsely-gated Language Models responds to Context | Sep 21, 2024 | DecoderMixture-of-Experts | —Unverified | 0 | 0 |
| On DeepSeekMoE: Statistical Benefits of Shared Experts and Normalized Sigmoid Gating | May 16, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| A Fast Kernel-based Conditional Independence test with Application to Causal Discovery | May 16, 2025 | Causal DiscoveryCausal Inference | —Unverified | 0 | 0 |
| MoE-CAP: Benchmarking Cost, Accuracy and Performance of Sparse Mixture-of-Experts Systems | May 16, 2025 | BenchmarkingMixture-of-Experts | —Unverified | 0 | 0 |
| MegaScale-MoE: Large-Scale Communication-Efficient Training of Mixture-of-Experts Models in Production | May 16, 2025 | Mixture-of-Experts | —Unverified | 0 | 0 |
| A Survey of Generative Categories and Techniques in Multimodal Large Language Models | May 29, 2025 | Mixture-of-ExpertsSelf-Supervised Learning | —Unverified | 0 | 0 |
| 3D Gaussian Splatting Data Compression with Mixture of Priors | May 6, 2025 | 3DGSData Compression | —Unverified | 0 | 0 |
| 3D-MoE: A Mixture-of-Experts Multi-modal LLM for 3D Vision and Pose Diffusion via Rectified Flow | Jan 28, 2025 | Instruction FollowingMixture-of-Experts | —Unverified | 0 | 0 |
| Accelerating Mixture-of-Experts Training with Adaptive Expert Replication | Apr 28, 2025 | GPUMixture-of-Experts | —Unverified | 0 | 0 |
| Accelerating MoE Model Inference with Expert Sharding | Mar 11, 2025 | DecoderGPU | —Unverified | 0 | 0 |
| Acquiring Diverse Skills using Curriculum Reinforcement Learning with Mixture of Experts | Mar 11, 2024 | Mixture-of-ExpertsReinforcement Learning (RL) | —Unverified | 0 | 0 |
| Modular Action Concept Grounding in Semantic Video Prediction | Nov 23, 2020 | Action RecognitionMixture-of-Experts | —Unverified | 0 | 0 |
| AdaEnsemble: Learning Adaptively Sparse Structured Ensemble Network for Click-Through Rate Prediction | Jan 6, 2023 | Click-Through Rate PredictionMixture-of-Experts | —Unverified | 0 | 0 |
| Ada-K Routing: Boosting the Efficiency of MoE-based LLMs | Oct 14, 2024 | Computational EfficiencyMixture-of-Experts | —Unverified | 0 | 0 |
| AdaMV-MoE: Adaptive Multi-Task Vision Mixture-of-Experts | Jan 1, 2023 | Instance SegmentationMixture-of-Experts | —Unverified | 0 | 0 |
| Adapted-MoE: Mixture of Experts with Test-Time Adaption for Anomaly Detection | Sep 9, 2024 | Anomaly DetectionMixture-of-Experts | —Unverified | 0 | 0 |
| Adaptive Conditional Expert Selection Network for Multi-domain Recommendation | Nov 11, 2024 | Computational EfficiencyMixture-of-Experts | —Unverified | 0 | 0 |
| Adaptive Detection of Fast Moving Celestial Objects Using a Mixture of Experts and Physical-Inspired Neural Network | Apr 10, 2025 | Mixture-of-Expertsobject-detection | —Unverified | 0 | 0 |
| Adaptive Gating in Mixture-of-Experts based Language Models | Oct 11, 2023 | Mixture-of-Experts | —Unverified | 0 | 0 |
| Adaptive Mixture of Experts Learning for Generalizable Face Anti-Spoofing | Jul 20, 2022 | Domain GeneralizationFace Anti-Spoofing | —Unverified | 0 | 0 |
| Adaptive Mixture of Low-Rank Experts for Robust Audio Spoofing Detection | Mar 15, 2025 | Mixture-of-Experts | —Unverified | 0 | 0 |
| Adaptive Prompting for Continual Relation Extraction: A Within-Task Variance Perspective | Dec 11, 2024 | Continual Relation ExtractionMixture-of-Experts | —Unverified | 0 | 0 |
| Adaptive Prompt: Unlocking the Power of Visual Prompt Tuning | Jan 31, 2025 | Mixture-of-ExpertsVisual Prompt Tuning | —Unverified | 0 | 0 |
| Adaptive Segmentation-Based Initialization for Steered Mixture of Experts Image Regression | Sep 16, 2024 | DenoisingMixture-of-Experts | —Unverified | 0 | 0 |
| AdaSpeech 3: Adaptive Text to Speech for Spontaneous Style | Jul 6, 2021 | DecoderMixture-of-Experts | —Unverified | 0 | 0 |
| AdaTag: Multi-Attribute Value Extraction from Product Profiles with Adaptive Decoding | Jun 4, 2021 | AttributeAttribute Extraction | —Unverified | 0 | 0 |
| Addressing Complex and Subjective Product-Related Queries with Customer Reviews | Dec 21, 2015 | Mixture-of-Experts | —Unverified | 0 | 0 |
| ADMoE: Anomaly Detection with Mixture-of-Experts from Noisy Labels | Aug 24, 2022 | Anomaly DetectionMixture-of-Experts | —Unverified | 0 | 0 |
| Advancing Enterprise Spatio-Temporal Forecasting Applications: Data Mining Meets Instruction Tuning of Language Models For Multi-modal Time Series Analysis in Low-Resource Settings | Aug 24, 2024 | Decision MakingMixture-of-Experts | —Unverified | 0 | 0 |
| Advancing Expert Specialization for Better MoE | May 28, 2025 | Mixture-of-Experts | —Unverified | 0 | 0 |
| Advancing MoE Efficiency: A Collaboration-Constrained Routing (C2R) Strategy for Better Expert Parallelism Design | Apr 2, 2025 | AttributeMixture-of-Experts | —Unverified | 0 | 0 |
| Advancing Robust Underwater Acoustic Target Recognition through Multi-task Learning and Multi-Gate Mixture-of-Experts | Nov 5, 2024 | Mixture-of-ExpertsMulti-Task Learning | —Unverified | 0 | 0 |
| A Dynamic Approach to Stock Price Prediction: Comparing RNN and Mixture of Experts Models Across Different Volatility Profiles | Oct 4, 2024 | Mixture-of-ExpertsStock Price Prediction | —Unverified | 0 | 0 |
| Affect in Tweets Using Experts Model | Mar 20, 2019 | Mixture-of-Expertsmodel | —Unverified | 0 | 0 |
| A Generalist Cross-Domain Molecular Learning Framework for Structure-Based Drug Discovery | Mar 6, 2025 | DenoisingDrug Discovery | —Unverified | 0 | 0 |
| A General Theory for Softmax Gating Multinomial Logistic Mixture of Experts | Oct 22, 2023 | Density EstimationMixture-of-Experts | —Unverified | 0 | 0 |
| Agent4Ranking: Semantic Robust Ranking via Personalized Query Rewriting Using Multi-agent LLM | Dec 24, 2023 | Mixture-of-Experts | —Unverified | 0 | 0 |
| AIREX: Neural Network-based Approach for Air Quality Inference in Unmonitored Cities | Aug 16, 2021 | Air Quality InferenceMixture-of-Experts | —Unverified | 0 | 0 |
| A Large-scale Medical Visual Task Adaptation Benchmark | Apr 19, 2024 | Mixture-of-Experts | —Unverified | 0 | 0 |
| Alternating Gradient Descent and Mixture-of-Experts for Integrated Multimodal Perception | May 10, 2023 | Classificationimage-classification | —Unverified | 0 | 0 |
| Alternating Updates for Efficient Transformers | Jan 30, 2023 | Mixture-of-Experts | —Unverified | 0 | 0 |
| AMEND: A Mixture of Experts Framework for Long-tailed Trajectory Prediction | Feb 13, 2024 | Contrastive LearningMixture-of-Experts | —Unverified | 0 | 0 |
| A Mixture of Expert Approach for Low-Cost Customization of Deep Neural Networks | Oct 31, 2018 | Mixture-of-Experts | —Unverified | 0 | 0 |