| Functional-level Uncertainty Quantification for Calibrated Fine-tuning on LLMs | Oct 9, 2024 | Common Sense ReasoningMixture-of-Experts | —Unverified | 0 | 0 |
| Functional mixture-of-experts for classification | Feb 28, 2022 | ClassificationMixture-of-Experts | —Unverified | 0 | 0 |
| FuseMoE: Mixture-of-Experts Transformers for Fleximodal Fusion | Feb 5, 2024 | Missing ElementsMixture-of-Experts | —Unverified | 0 | 0 |
| FuxiMT: Sparsifying Large Language Models for Chinese-Centric Multilingual Machine Translation | May 20, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| Galaxy Walker: Geometry-aware VLMs For Galaxy-scale Understanding | Mar 24, 2025 | Mixture-of-ExpertsMorphology classification | —Unverified | 0 | 0 |
| Gated Ensemble of Spatio-temporal Mixture of Experts for Multi-task Learning in Ride-hailing System | Dec 31, 2020 | Mixture-of-ExpertsMulti-Task Learning | —Unverified | 0 | 0 |
| Gating Dropout: Communication-efficient Regularization for Sparsely Activated Transformers | May 28, 2022 | Machine TranslationMixture-of-Experts | —Unverified | 0 | 0 |
| GEMNET: Effective Gated Gazetteer Representations for Recognizing Complex Entities in Low-context Input | Jun 1, 2021 | Mixture-of-Expertsnamed-entity-recognition | —Unverified | 0 | 0 |
| Generalizable Person Re-identification with Relevance-aware Mixture of Experts | May 19, 2021 | Generalizable Person Re-identificationMixture-of-Experts | —Unverified | 0 | 0 |
| Generalization Error Analysis for Sparse Mixture-of-Experts: A Preliminary Study | Mar 26, 2024 | Learning TheoryMixture-of-Experts | —Unverified | 0 | 0 |
| Generalizing Multimodal Variational Methods to Sets | Dec 19, 2022 | Mixture-of-Experts | —Unverified | 0 | 0 |
| Generator Assisted Mixture of Experts For Feature Acquisition in Batch | Dec 19, 2023 | Mixture-of-Experts | —Unverified | 0 | 0 |
| GeRM: A Generalist Robotic Model with Mixture-of-experts for Quadruped Robot | Mar 20, 2024 | Mixture-of-ExpertsMulti-Task Learning | —Unverified | 0 | 0 |
| GETS: Ensemble Temperature Scaling for Calibration in Graph Neural Networks | Oct 12, 2024 | Mixture-of-Experts | —Unverified | 0 | 0 |
| GigaChat Family: Efficient Russian Language Modeling Through Mixture of Experts Architecture | Jun 11, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| GLA in MediaEval 2018 Emotional Impact of Movies Task | Nov 27, 2019 | Mixture-of-Experts | —Unverified | 0 | 0 |
| GLaM: Efficient Scaling of Language Models with Mixture-of-Experts | Dec 13, 2021 | Common Sense ReasoningIn-Context Learning | —Unverified | 0 | 0 |
| GM-MoE: Low-Light Enhancement with Gated-Mechanism Mixture-of-Experts | Mar 10, 2025 | 3D ReconstructionAutonomous Driving | —Unverified | 0 | 0 |
| GradPower: Powering Gradients for Faster Language Model Pre-Training | May 30, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| Graph Mixture of Experts and Memory-augmented Routers for Multivariate Time Series Anomaly Detection | Dec 26, 2024 | Anomaly DetectionMixture-of-Experts | —Unverified | 0 | 0 |
| GRAPHMOE: Amplifying Cognitive Depth of Mixture-of-Experts Network via Introducing Self-Rethinking Mechanism | Jan 14, 2025 | Mixture-of-Experts | —Unverified | 0 | 0 |
| GRIN: GRadient-INformed MoE | Sep 18, 2024 | HellaSwagHumanEval | —Unverified | 0 | 0 |
| HAECcity: Open-Vocabulary Scene Understanding of City-Scale Point Clouds with Superpoint Graph Clustering | Apr 18, 2025 | ClusteringGraph Clustering | —Unverified | 0 | 0 |
| Half-Space Feature Learning in Neural Networks | Apr 5, 2024 | Mixture-of-Experts | —Unverified | 0 | 0 |
| Hard Mixtures of Experts for Large Scale Weakly Supervised Vision | Apr 20, 2017 | GPUMixture-of-Experts | —Unverified | 0 | 0 |