| EC-DIT: Scaling Diffusion Transformers with Adaptive Expert-Choice Routing | Oct 2, 2024 | Image GenerationMixture-of-Experts | —Unverified | 0 |
| The Labyrinth of Links: Navigating the Associative Maze of Multi-modal LLMs | Oct 2, 2024 | BenchmarkingHallucination | —Unverified | 0 |
| Open-RAG: Enhanced Retrieval-Augmented Reasoning with Open-Source Large Language Models | Oct 2, 2024 | Mixture-of-ExpertsNavigate | CodeCode Available | 2 |
| Upcycling Instruction Tuning from Dense to Mixture-of-Experts via Parameter Merging | Oct 2, 2024 | DiversityMixture-of-Experts | —Unverified | 0 |
| MoS: Unleashing Parameter Efficiency of Low-Rank Adaptation with Mixture of Shards | Oct 1, 2024 | GPUMixture-of-Experts | —Unverified | 0 |
| UniAdapt: A Universal Adapter for Knowledge Calibration | Oct 1, 2024 | Mixture-of-ExpertsModel Editing | —Unverified | 0 |
| Robust Traffic Forecasting against Spatial Shift over Years | Oct 1, 2024 | AttributeMixture-of-Experts | CodeCode Available | 0 |
| MM1.5: Methods, Analysis & Insights from Multimodal LLM Fine-tuning | Sep 30, 2024 | Mixture-of-ExpertsOptical Character Recognition (OCR) | —Unverified | 0 |
| IDEA: An Inverse Domain Expert Adaptation Based Active DNN IP Protection Method | Sep 29, 2024 | Domain AdaptationMixture-of-Experts | —Unverified | 0 |
| CLIP-MoE: Towards Building Mixture of Experts for CLIP with Diversified Multiplet Upcycling | Sep 28, 2024 | image-classificationImage Classification | CodeCode Available | 2 |
| SciDFM: A Large Language Model with Mixture-of-Experts for Science | Sep 27, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| A Time Series is Worth Five Experts: Heterogeneous Mixture of Experts for Traffic Flow Prediction | Sep 26, 2024 | Mixture-of-ExpertsPrediction | CodeCode Available | 1 |
| Uni-Med: A Unified Medical Generalist Foundation Model For Multi-Task Learning Via Connector-MoE | Sep 26, 2024 | image-classificationImage Classification | CodeCode Available | 1 |
| Time-MoE: Billion-Scale Time Series Foundation Models with Mixture of Experts | Sep 24, 2024 | Computational EfficiencyMixture-of-Experts | CodeCode Available | 4 |
| Toward Mixture-of-Experts Enabled Trustworthy Semantic Communication for 6G Networks | Sep 24, 2024 | Mixture-of-ExpertsSemantic Communication | —Unverified | 0 |
| Leveraging Mixture of Experts for Improved Speech Deepfake Detection | Sep 24, 2024 | DeepFake DetectionFace Swapping | —Unverified | 0 |
| Boosting Code-Switching ASR with Mixture of Experts Enhanced Speech-Conditioned LLM | Sep 24, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Multi-Modal Generative AI: Multi-modal LLM, Diffusion and Beyond | Sep 23, 2024 | Language ModellingLarge Language Model | —Unverified | 0 |
| A Gated Residual Kolmogorov-Arnold Networks for Mixtures of Experts | Sep 23, 2024 | Kolmogorov-Arnold NetworksMixture-of-Experts | CodeCode Available | 0 |
| Routing in Sparsely-gated Language Models responds to Context | Sep 21, 2024 | DecoderMixture-of-Experts | —Unverified | 0 |
| Multi-omics data integration for early diagnosis of hepatocellular carcinoma (HCC) using machine learning | Sep 20, 2024 | Data IntegrationMixture-of-Experts | —Unverified | 0 |
| On-Device Collaborative Language Modeling via a Mixture of Generalists and Specialists | Sep 20, 2024 | Federated LearningLanguage Modeling | CodeCode Available | 0 |
| Robust Audiovisual Speech Recognition Models with Mixture-of-Experts | Sep 19, 2024 | Mixture-of-ExpertsRobust Speech Recognition | —Unverified | 0 |
| Mixture of Diverse Size Experts | Sep 18, 2024 | Mixture-of-Experts | —Unverified | 0 |
| GRIN: GRadient-INformed MoE | Sep 18, 2024 | HellaSwagHumanEval | —Unverified | 0 |