| Fast filtering of non-Gaussian models using Amortized Optimal Transport Maps | Mar 16, 2025 | Mixture-of-Experts | CodeCode Available | 0 |
| Adaptive Mixture of Low-Rank Experts for Robust Audio Spoofing Detection | Mar 15, 2025 | Mixture-of-Experts | —Unverified | 0 |
| MoLEx: Mixture of Layer Experts for Finetuning with Sparse Upcycling | Mar 14, 2025 | Mixture-of-Expertsparameter-efficient fine-tuning | CodeCode Available | 0 |
| A Review of DeepSeek Models' Key Innovative Techniques | Mar 14, 2025 | Mixture-of-Expertsreinforcement-learning | —Unverified | 0 |
| Ensemble Learning for Large Language Models in Text and Code Generation: A Survey | Mar 13, 2025 | Code GenerationEnsemble Learning | —Unverified | 0 |
| dFLMoE: Decentralized Federated Learning via Mixture of Experts for Medical Data Analysis | Mar 13, 2025 | Federated LearningMixture-of-Experts | —Unverified | 0 |
| Priority-Aware Preemptive Scheduling for Mixed-Priority Workloads in MoE Inference | Mar 12, 2025 | BlockingGPU | —Unverified | 0 |
| Astrea: A MOE-based Visual Understanding Model with Progressive Alignment | Mar 12, 2025 | Contrastive LearningCross-Modal Retrieval | —Unverified | 0 |
| FaVChat: Unlocking Fine-Grained Facail Video Understanding with Multimodal Large Language Models | Mar 12, 2025 | Mixture-of-ExpertsQuestion Answering | —Unverified | 0 |
| Towards Robust Multimodal Representation: A Unified Approach with Adaptive Experts and Alignment | Mar 12, 2025 | Contrastive LearningDecision Making | CodeCode Available | 0 |