| HDformer: A Higher Dimensional Transformer for Diabetes Detection Utilizing Long Range Vascular Signals | Mar 17, 2023 | Computational EfficiencyMixture-of-Experts | —Unverified | 0 | 0 |
| HeterMoE: Efficient Training of Mixture-of-Experts Models on Heterogeneous GPUs | Apr 4, 2025 | GPUMixture-of-Experts | —Unverified | 0 | 0 |
| Heuristic-Informed Mixture of Experts for Link Prediction in Multilayer Networks | Jan 29, 2025 | Link PredictionMixture-of-Experts | —Unverified | 0 | 0 |
| Hierarchical mixture of discriminative Generalized Dirichlet classifiers | May 2, 2024 | Mixture-of-ExpertsSpam detection | —Unverified | 0 | 0 |
| Hierarchical Mixture-of-Experts Model for Large-Scale Gaussian Process Regression | Dec 9, 2014 | Mixture-of-Expertsregression | —Unverified | 0 | 0 |
| Hierarchical Routing Mixture of Experts | Mar 18, 2019 | Mixture-of-Expertsregression | —Unverified | 0 | 0 |
| HiMoE: Heterogeneity-Informed Mixture-of-Experts for Fair Spatial-Temporal Forecasting | Nov 30, 2024 | FairnessMixture-of-Experts | —Unverified | 0 | 0 |
| HMoE: Heterogeneous Mixture of Experts for Language Modeling | Aug 20, 2024 | Computational EfficiencyLanguage Modeling | —Unverified | 0 | 0 |
| HMOE: Hypernetwork-based Mixture of Experts for Domain Generalization | Nov 15, 2022 | Domain GeneralizationMixture-of-Experts | —Unverified | 0 | 0 |
| HOBBIT: A Mixed Precision Expert Offloading System for Fast MoE Inference | Nov 3, 2024 | Mixture-of-Experts | —Unverified | 0 | 0 |
| Holistic Capability Preservation: Towards Compact Yet Comprehensive Reasoning Models | Apr 9, 2025 | Instruction FollowingMathematical Problem-Solving | —Unverified | 0 | 0 |
| HoME: Hierarchy of Multi-Gate Experts for Multi-Task Learning at Kuaishou | Aug 10, 2024 | Mixture-of-ExpertsMulti-Task Learning | —Unverified | 0 | 0 |
| HOMOE: A Memory-Based and Composition-Aware Framework for Zero-Shot Learning with Hopfield Network and Soft Mixture of Experts | Nov 23, 2023 | Compositional Zero-Shot LearningMixture-of-Experts | —Unverified | 0 | 0 |
| How Can Cross-lingual Knowledge Contribute Better to Fine-Grained Entity Typing? | May 1, 2022 | Entity TypingMixture-of-Experts | —Unverified | 0 | 0 |
| How Do Consumers Really Choose: Exposing Hidden Preferences with the Mixture of Experts Model | Mar 3, 2025 | Decision MakingDemand Forecasting | —Unverified | 0 | 0 |
| How does Architecture Influence the Base Capabilities of Pre-trained Language Models? A Case Study Based on FFN-Wider and MoE Transformers | Mar 4, 2024 | Few-Shot LearningLanguage Modeling | —Unverified | 0 | 0 |
| How Lightweight Can A Vision Transformer Be | Jul 25, 2024 | Mixture-of-ExpertsTransfer Learning | —Unverified | 0 | 0 |
| How to Upscale Neural Networks with Scaling Law? A Survey and Practical Guidelines | Feb 17, 2025 | Mixture-of-Experts | —Unverified | 0 | 0 |
| Hunyuan-TurboS: Advancing Large Language Models through Mamba-Transformer Synergy and Adaptive Chain-of-Thought | May 21, 2025 | ChatbotInstruction Following | —Unverified | 0 | 0 |
| HydraSum - Disentangling Stylistic Features in Text Summarization using Multi-Decoder Models | Sep 29, 2021 | Abstractive Text SummarizationDecoder | —Unverified | 0 | 0 |
| Hypertext Entity Extraction in Webpage | Mar 4, 2024 | Mixture-of-Experts | —Unverified | 0 | 0 |
| IDEA: An Inverse Domain Expert Adaptation Based Active DNN IP Protection Method | Sep 29, 2024 | Domain AdaptationMixture-of-Experts | —Unverified | 0 | 0 |
| Identifying Shopping Intent in Product QA for Proactive Recommendations | Apr 9, 2024 | FrictionMixture-of-Experts | —Unverified | 0 | 0 |
| iMedImage Technical Report | Mar 27, 2025 | Anomaly DetectionDiagnostic | —Unverified | 0 | 0 |
| Imitation Learning from MPC for Quadrupedal Multi-Gait Control | Mar 26, 2021 | Imitation LearningMixture-of-Experts | —Unverified | 0 | 0 |
| Imitation Learning from Observations: An Autoregressive Mixture of Experts Approach | Nov 12, 2024 | Autonomous DrivingImitation Learning | —Unverified | 0 | 0 |
| Improved Training of Mixture-of-Experts Language GANs | Feb 23, 2023 | Adversarial TextImage Generation | —Unverified | 0 | 0 |
| Improving Coverage in Combined Prediction Sets with Weighted p-values | May 17, 2025 | Conformal PredictionMixture-of-Experts | —Unverified | 0 | 0 |
| Improving Expert Specialization in Mixture of Experts | Feb 28, 2023 | Continual LearningMixture-of-Experts | —Unverified | 0 | 0 |
| Improving Sepsis Treatment Strategies by Combining Deep and Kernel-Based Reinforcement Learning | Jan 15, 2019 | Deep Reinforcement LearningMixture-of-Experts | —Unverified | 0 | 0 |
| Incorporating Polar Field Data for Improved Solar Flare Prediction | Dec 4, 2022 | Mixture-of-ExpertsPrediction | —Unverified | 0 | 0 |
| Incorporating Visual Experts to Resolve the Information Loss in Multimodal Large Language Models | Jan 6, 2024 | Instruction FollowingMixture-of-Experts | —Unverified | 0 | 0 |
| Insights into DeepSeek-V3: Scaling Challenges and Reflections on Hardware for AI Architectures | May 14, 2025 | Computational EfficiencyMixture-of-Experts | —Unverified | 0 | 0 |
| Integrating AI's Carbon Footprint into Risk Management Frameworks: Strategies and Tools for Sustainable Compliance in Banking Sector | Sep 15, 2024 | Cloud ComputingManagement | —Unverified | 0 | 0 |
| Integrating Dynamical Systems Learning with Foundational Models: A Meta-Evolutionary AI Framework for Clinical Trials | May 25, 2025 | Evolutionary AlgorithmsLarge Language Model | —Unverified | 0 | 0 |
| Integration of Mixture of Experts and Multimodal Generative AI in Internet of Vehicles: A Survey | Apr 25, 2024 | Autonomous DrivingDecision Making | —Unverified | 0 | 0 |
| Intentional Biases in LLM Responses | Nov 11, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| Inter2Former: Dynamic Hybrid Attention for Efficient High-Precision Interactive | Jul 13, 2025 | CPUInteractive Segmentation | —Unverified | 0 | 0 |
| Generative AI Agents with Large Language Model for Satellite Networks via a Mixture of Experts Transmission | Apr 14, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| Interpretable Cascading Mixture-of-Experts for Urban Traffic Congestion Prediction | Jun 14, 2024 | Mixture-of-ExpertsPrediction | —Unverified | 0 | 0 |
| Interpretable Mixture of Experts | Jun 5, 2022 | Decision MakingMixture-of-Experts | —Unverified | 0 | 0 |
| Interpretable mixture of experts for time series prediction under recurrent and non-recurrent conditions | Sep 5, 2024 | Mixture-of-ExpertsTime Series | —Unverified | 0 | 0 |
| Intuition-aware Mixture-of-Rank-1-Experts for Parameter Efficient Finetuning | Apr 13, 2024 | DiversityMixture-of-Experts | —Unverified | 0 | 0 |
| Investigating Mixture of Experts in Dense Retrieval | Dec 16, 2024 | Information RetrievalMixture-of-Experts | —Unverified | 0 | 0 |
| Investigating the potential of Sparse Mixtures-of-Experts for multi-domain neural machine translation | Jul 1, 2024 | Machine TranslationMixture-of-Experts | —Unverified | 0 | 0 |
| Is Temperature Sample Efficient for Softmax Gaussian Mixture of Experts? | Jan 25, 2024 | Mixture-of-Expertsparameter estimation | —Unverified | 0 | 0 |
| JiuZhang 2.0: A Unified Chinese Pre-trained Language Model for Multi-task Mathematical Problem Solving | Jun 19, 2023 | In-Context LearningLanguage Modeling | —Unverified | 0 | 0 |
| Joint MoE Scaling Laws: Mixture of Experts Can Be Memory Efficient | Feb 7, 2025 | Computational EfficiencyMixture-of-Experts | —Unverified | 0 | 0 |
| KAAE: Numerical Reasoning for Knowledge Graphs via Knowledge-aware Attributes Learning | Nov 20, 2024 | AttributeContrastive Learning | —Unverified | 0 | 0 |
| KAT-V1: Kwai-AutoThink Technical Report | Jul 11, 2025 | Knowledge DistillationLarge Language Model | —Unverified | 0 | 0 |