| A Provably Effective Method for Pruning Experts in Fine-tuned Sparse Mixture-of-Experts | May 26, 2024 | Binary ClassificationMixture-of-Experts | —Unverified | 0 | 0 |
| A Review of DeepSeek Models' Key Innovative Techniques | Mar 14, 2025 | Mixture-of-Expertsreinforcement-learning | —Unverified | 0 | 0 |
| A Review of Sparse Expert Models in Deep Learning | Sep 4, 2022 | Deep LearningMixture-of-Experts | —Unverified | 0 | 0 |
| A similarity-based Bayesian mixture-of-experts model | Dec 3, 2020 | Mixture-of-Expertsmodel | —Unverified | 0 | 0 |
| A Simple Architecture for Enterprise Large Language Model Applications based on Role based security and Clearance Levels using Retrieval-Augmented Generation or Mixture of Experts | Jul 9, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| Improving Transformer Performance for French Clinical Notes Classification Using Mixture of Experts on a Limited Dataset | Mar 22, 2023 | Mixture-of-Expertstext-classification | —Unverified | 0 | 0 |
| Astrea: A MOE-based Visual Understanding Model with Progressive Alignment | Mar 12, 2025 | Contrastive LearningCross-Modal Retrieval | —Unverified | 0 | 0 |
| A Survey on Dynamic Neural Networks for Natural Language Processing | Feb 15, 2022 | Dynamic neural networksMixture-of-Experts | —Unverified | 0 | 0 |
| A Survey on Model MoErging: Recycling and Routing Among Specialized Experts for Collaborative Learning | Aug 13, 2024 | Mixture-of-ExpertsSurvey | —Unverified | 0 | 0 |
| A Theoretical View on Sparsely Activated Networks | Aug 8, 2022 | Mixture-of-Experts | —Unverified | 0 | 0 |
| AT-MoE: Adaptive Task-planning Mixture of Experts via LoRA Approach | Oct 12, 2024 | Mixture-of-ExpertsTask Planning | —Unverified | 0 | 0 |
| A Tree Architecture of LSTM Networks for Sequential Regression with Missing Data | May 22, 2020 | Mixture-of-Expertsregression | —Unverified | 0 | 0 |
| Attention Weighted Mixture of Experts with Contrastive Learning for Personalized Ranking in E-commerce | Jun 8, 2023 | Contrastive LearningMixture-of-Experts | —Unverified | 0 | 0 |
| A Two-Phase Deep Learning Framework for Adaptive Time-Stepping in High-Speed Flow Modeling | Jun 9, 2025 | Mixture-of-Experts | —Unverified | 0 | 0 |
| A Unified Approach to Universal Prediction: Generalized Upper and Lower Bounds | Nov 25, 2013 | Learning TheoryMixture-of-Experts | —Unverified | 0 | 0 |
| A Unified Framework for Iris Anti-Spoofing: Introducing IrisGeneral Dataset and Masked-MoE Method | Aug 19, 2024 | Iris RecognitionMixture-of-Experts | —Unverified | 0 | 0 |
| A Unified Virtual Mixture-of-Experts Framework:Enhanced Inference and Hallucination Mitigation in Single-Model System | Apr 1, 2025 | Dialogue GenerationEnsemble Learning | —Unverified | 0 | 0 |
| A Universal Approximation Theorem for Mixture of Experts Models | Feb 11, 2016 | General ClassificationMixture-of-Experts | —Unverified | 0 | 0 |
| Automatically Extracting Information in Medical Dialogue: Expert System And Attention for Labelling | Nov 28, 2022 | Mixture-of-Experts | —Unverified | 0 | 0 |
| Automatic Document Sketching: Generating Drafts from Analogous Texts | Jun 14, 2021 | Mixture-of-ExpertsReinforcement Learning (RL) | —Unverified | 0 | 0 |
| Automatic Expert Selection for Multi-Scenario and Multi-Task Search | May 28, 2022 | Mixture-of-ExpertsMulti-Task Learning | —Unverified | 0 | 0 |
| Automatic Operator-level Parallelism Planning for Distributed Deep Learning -- A Mixed-Integer Programming Approach | Mar 12, 2025 | Computational EfficiencyMixture-of-Experts | —Unverified | 0 | 0 |
| 類神經網路訓練結合環境群集及專家混合系統於強健性語音辨識(Automatic Speech Recognition using Neural Network based Acoustic Model with the Environment Clustering and Mixture of Experts Algorithms) [In Chinese] | Oct 1, 2015 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| Autonomy-of-Experts Models | Jan 22, 2025 | Decision MakingMixture-of-Experts | —Unverified | 0 | 0 |
| Auxiliary-Loss-Free Load Balancing Strategy for Mixture-of-Experts | Aug 28, 2024 | Mixture-of-Experts | —Unverified | 0 | 0 |