| OMoE: Diversifying Mixture of Low-Rank Adaptation by Orthogonal Finetuning | Jan 17, 2025 | Computational EfficiencyDiversity | —Unverified | 0 |
| LLM-Based Routing in Mixture of Experts: A Novel Framework for Trading | Jan 16, 2025 | Mixture-of-ExpertsWorld Knowledge | —Unverified | 0 |
| MiniMax-01: Scaling Foundation Models with Lightning Attention | Jan 14, 2025 | Mixture-of-Experts | CodeCode Available | 7 |
| PSReg: Prior-guided Sparse Mixture of Experts for Point Cloud Registration | Jan 14, 2025 | Mixture-of-ExpertsPoint Cloud Registration | —Unverified | 0 |
| GRAPHMOE: Amplifying Cognitive Depth of Mixture-of-Experts Network via Introducing Self-Rethinking Mechanism | Jan 14, 2025 | Mixture-of-Experts | —Unverified | 0 |
| A Multi-Modal Deep Learning Framework for Pan-Cancer Prognosis | Jan 13, 2025 | Deep LearningMixture-of-Experts | CodeCode Available | 0 |
| Transforming Vision Transformer: Towards Efficient Multi-Task Asynchronous Learning | Jan 12, 2025 | Mixture-of-ExpertsMulti-Task Learning | CodeCode Available | 1 |
| TAMER: A Test-Time Adaptive MoE-Driven Framework for EHR Representation Learning | Jan 10, 2025 | Mixture-of-ExpertsRepresentation Learning | CodeCode Available | 0 |
| Optimizing Distributed Deployment of Mixture-of-Experts Model Inference in Serverless Computing | Jan 9, 2025 | Bayesian OptimizationCPU | —Unverified | 0 |
| LiMoE: Mixture of LiDAR Representation Learners from Automotive Scenes | Jan 7, 2025 | Mixture-of-ExpertsRepresentation Learning | CodeCode Available | 2 |
| mFabric: An Efficient and Scalable Fabric for Mixture-of-Experts Training | Jan 7, 2025 | BlockingGPU | —Unverified | 0 |
| Mixture-of-Experts Graph Transformers for Interpretable Particle Collision Detection | Jan 6, 2025 | Decision MakingMixture-of-Experts | CodeCode Available | 0 |
| Fresh-CL: Feature Realignment through Experts on Hypersphere in Continual Learning | Jan 4, 2025 | Continual LearningMixture-of-Experts | —Unverified | 0 |
| MoVE-KD: Knowledge Distillation for VLMs with Mixture of Visual Encoders | Jan 3, 2025 | Knowledge DistillationMixture-of-Experts | —Unverified | 0 |
| UNIALIGN: Scaling Multimodal Alignment within One Unified Model | Jan 1, 2025 | Mixture-of-Experts | —Unverified | 0 |
| Learning Heterogeneous Tissues with Mixture of Experts for Gigapixel Whole Slide Images | Jan 1, 2025 | Mixture-of-Expertswhole slide images | —Unverified | 0 |
| Correlative and Discriminative Label Grouping for Multi-Label Visual Prompt Tuning | Jan 1, 2025 | image-classificationImage Classification | —Unverified | 0 |
| Towards Efficient Foundation Model for Zero-shot Amodal Segmentation | Jan 1, 2025 | Mixture-of-Experts | —Unverified | 0 |
| MExD: An Expert-Infused Diffusion Model for Whole-Slide Image Classification | Jan 1, 2025 | image-classificationImage Classification | —Unverified | 0 |
| REM: A Scalable Reinforced Multi-Expert Framework for Multiplex Influence Maximization | Jan 1, 2025 | Mixture-of-Experts | —Unverified | 0 |
| CNC: Cross-modal Normality Constraint for Unsupervised Multi-class Anomaly Detection | Dec 31, 2024 | Anomaly DetectionAttribute | —Unverified | 0 |
| Superposition in Transformers: A Novel Way of Building Mixture of Experts | Dec 31, 2024 | Mixture-of-Experts | CodeCode Available | 2 |
| Multimodal Variational Autoencoder: a Barycentric View | Dec 29, 2024 | Mixture-of-ExpertsRepresentation Learning | —Unverified | 0 |
| UniRestorer: Universal Image Restoration via Adaptively Estimating Image Degradation at Proper Granularity | Dec 28, 2024 | Image RestorationMixture-of-Experts | CodeCode Available | 0 |
| DeepSeek-V3 Technical Report | Dec 27, 2024 | GPULanguage Modeling | CodeCode Available | 16 |