| Variational Distillation of Diffusion Policies into Mixture of Experts | Jun 18, 2024 | DenoisingMixture-of-Experts | —Unverified | 0 |
| GW-MoE: Resolving Uncertainty in MoE Router with Global Workspace Theory | Jun 18, 2024 | Code GenerationMathematical Problem-Solving | CodeCode Available | 0 |
| Not Eliminate but Aggregate: Post-Hoc Control over Mixture-of-Experts to Address Shortcut Shifts in Natural Language Understanding | Jun 17, 2024 | Mixture-of-ExpertsNatural Language Understanding | CodeCode Available | 0 |
| Graph Knowledge Distillation to Mixture of Experts | Jun 17, 2024 | Knowledge DistillationMixture-of-Experts | CodeCode Available | 0 |
| Interpretable Cascading Mixture-of-Experts for Urban Traffic Congestion Prediction | Jun 14, 2024 | Mixture-of-ExpertsPrediction | —Unverified | 0 |
| Continual Traffic Forecasting via Mixture of Experts | Jun 5, 2024 | Continual LearningMixture-of-Experts | —Unverified | 0 |
| Filtered not Mixed: Stochastic Filtering-Based Online Gating for Mixture of Large Language Models | Jun 5, 2024 | Mixture-of-ExpertsTime Series | —Unverified | 0 |
| Style Mixture of Experts for Expressive Text-To-Speech Synthesis | Jun 5, 2024 | Mixture-of-ExpertsSpeech Synthesis | —Unverified | 0 |
| Node-wise Filtering in Graph Neural Networks: A Mixture of Experts Approach | Jun 5, 2024 | Mixture-of-ExpertsNode Classification | —Unverified | 0 |
| A Gaussian Process-based Streaming Algorithm for Prediction of Time Series With Regimes and Outliers | Jun 1, 2024 | Gaussian ProcessesMixture-of-Experts | CodeCode Available | 0 |
| Optimizing 6G Integrated Sensing and Communications (ISAC) via Expert Networks | Jun 1, 2024 | ISACMixture-of-Experts | —Unverified | 0 |
| Training-efficient density quantum machine learning | May 30, 2024 | LEMMAMixture-of-Experts | —Unverified | 0 |
| Learning Mixture-of-Experts for General-Purpose Black-Box Discrete Optimization | May 29, 2024 | Mixture-of-Experts | CodeCode Available | 0 |
| MEMoE: Enhancing Model Editing with Mixture of Experts Adaptors | May 29, 2024 | Mixture-of-ExpertsModel Editing | —Unverified | 0 |
| MoNDE: Mixture of Near-Data Experts for Large-Scale Sparse Models | May 29, 2024 | DecoderGPU | —Unverified | 0 |
| LoRA-Switch: Boosting the Efficiency of Dynamic LLM Adapters via System-Algorithm Co-design | May 28, 2024 | Mixture-of-Experts | —Unverified | 0 |
| A Provably Effective Method for Pruning Experts in Fine-tuned Sparse Mixture-of-Experts | May 26, 2024 | Binary ClassificationMixture-of-Experts | —Unverified | 0 |
| Expert-Token Resonance: Redefining MoE Routing through Affinity-Driven Active Selection | May 24, 2024 | Computational EfficiencyMixture-of-Experts | —Unverified | 0 |
| Statistical Advantages of Perturbing Cosine Router in Mixture of Experts | May 23, 2024 | Mixture-of-Experts | —Unverified | 0 |
| Sigmoid Gating is More Sample Efficient than Softmax Gating in Mixture of Experts | May 22, 2024 | Mixture-of-Experts | —Unverified | 0 |
| Ensemble and Mixture-of-Experts DeepONets For Operator Learning | May 20, 2024 | Mixture-of-ExpertsOperator learning | CodeCode Available | 0 |
| Learning More Generalized Experts by Merging Experts in Mixture-of-Experts | May 19, 2024 | Incremental LearningMixture-of-Experts | —Unverified | 0 |
| Many Hands Make Light Work: Task-Oriented Dialogue System with Module-Based Mixture-of-Experts | May 16, 2024 | Dialogue State TrackingMixture-of-Experts | —Unverified | 0 |
| A Mixture of Experts Approach to 3D Human Motion Prediction | May 9, 2024 | Human motion predictionMixture-of-Experts | CodeCode Available | 0 |
| A Mixture-of-Experts Approach to Few-Shot Task Transfer in Open-Ended Text Worlds | May 9, 2024 | Few-Shot LearningMixture-of-Experts | —Unverified | 0 |