| Condensing Multilingual Knowledge with Lightweight Language-Specific Modules | May 23, 2023 | Machine TranslationMixture-of-Experts | CodeCode Available | 0 |
| Completed Feature Disentanglement Learning for Multimodal MRIs Analysis | Jul 6, 2024 | DisentanglementMixture-of-Experts | CodeCode Available | 0 |
| Skeleton-Based Human Action Recognition with Noisy Labels | Mar 15, 2024 | Action RecognitionDenoising | CodeCode Available | 0 |
| UniRestorer: Universal Image Restoration via Adaptively Estimating Image Degradation at Proper Granularity | Dec 28, 2024 | Image RestorationMixture-of-Experts | CodeCode Available | 0 |
| Manifold-Preserving Transformers are Effective for Short-Long Range Encoding | Oct 22, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| GEMINUS: Dual-aware Global and Scene-Adaptive Mixture-of-Experts for End-to-End Autonomous Driving | Jul 19, 2025 | Autonomous DrivingBench2Drive | CodeCode Available | 0 |
| FT-Shield: A Watermark Against Unauthorized Fine-tuning in Text-to-Image Diffusion Models | Oct 3, 2023 | Face TransferMixture-of-Experts | CodeCode Available | 0 |
| From Knowledge to Noise: CTIM-Rover and the Pitfalls of Episodic Memory in Software Engineering Agents | May 29, 2025 | AI AgentMixture-of-Experts | CodeCode Available | 0 |
| A Gaussian Process-based Streaming Algorithm for Prediction of Time Series With Regimes and Outliers | Jun 1, 2024 | Gaussian ProcessesMixture-of-Experts | CodeCode Available | 0 |
| Anomaly Detection by Recombining Gated Unsupervised Experts | Aug 31, 2020 | Anomaly DetectionMixture-of-Experts | CodeCode Available | 0 |
| SMOSE: Sparse Mixture of Shallow Experts for Interpretable Reinforcement Learning in Continuous Control Tasks | Dec 17, 2024 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Finger Pose Estimation for Under-screen Fingerprint Sensor | May 5, 2025 | Mixture-of-ExpertsPose Estimation | CodeCode Available | 0 |
| pFedMoE: Data-Level Personalization with Mixture of Experts for Model-Heterogeneous Personalized Federated Learning | Feb 2, 2024 | Federated LearningMixture-of-Experts | CodeCode Available | 0 |
| FEDKIM: Adaptive Federated Knowledge Injection into Medical Foundation Models | Aug 17, 2024 | Federated LearningMixture-of-Experts | CodeCode Available | 0 |
| m2mKD: Module-to-Module Knowledge Distillation for Modular Transformers | Feb 26, 2024 | Knowledge DistillationMixture-of-Experts | CodeCode Available | 0 |
| BIG-MoE: Bypass Isolated Gating MoE for Generalized Multimodal Face Anti-Spoofing | Dec 24, 2024 | Decision MakingFace Anti-Spoofing | CodeCode Available | 0 |
| Fast filtering of non-Gaussian models using Amortized Optimal Transport Maps | Mar 16, 2025 | Mixture-of-Experts | CodeCode Available | 0 |
| A Gated Residual Kolmogorov-Arnold Networks for Mixtures of Experts | Sep 23, 2024 | Kolmogorov-Arnold NetworksMixture-of-Experts | CodeCode Available | 0 |
| Bidirectional Attention as a Mixture of Continuous Word Experts | Jul 8, 2023 | Language ModellingMixture-of-Experts | CodeCode Available | 0 |
| Universal Simultaneous Machine Translation with Mixture-of-Experts Wait-k Policy | Sep 11, 2021 | Machine TranslationMixture-of-Experts | CodeCode Available | 0 |
| Tight Clusters Make Specialized Experts | Feb 21, 2025 | ClusteringLanguage Modeling | CodeCode Available | 0 |
| CompeteSMoE -- Statistically Guaranteed Mixture of Experts Training via Competition | May 19, 2025 | Mixture-of-Experts | CodeCode Available | 0 |
| Two Heads are Better than One: Nested PoE for Robust Defense Against Multi-Backdoors | Apr 2, 2024 | Data PoisoningHate Speech Detection | CodeCode Available | 0 |
| LLM-e Guess: Can LLMs Capabilities Advance Without Hardware Progress? | May 7, 2025 | Large Language ModelMixture-of-Experts | CodeCode Available | 0 |
| FactorLLM: Factorizing Knowledge via Mixture of Experts for Large Language Models | Aug 15, 2024 | Mixture-of-Experts | CodeCode Available | 0 |