Mixture-of-Experts

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 351–375 of 1312 papers

Title	Date	Tasks	Status
GuiLoMo: Allocating Expert Number and Rank for LoRA-MoE via Bilevel Optimization with GuidedSelection Vectors	Jun 17, 2025	Bilevel OptimizationMixture-of-Experts	CodeCode Available
Scaling Intelligence: Designing Data Centers for Next-Gen Language Models	Jun 17, 2025	Mixture-of-Experts	—Unverified
Single-Example Learning in a Mixture of GPDMs with Latent Geometries	Jun 17, 2025	Mixture-of-Experts	—Unverified
Load Balancing Mixture of Experts with Similarity Preserving Routers	Jun 16, 2025	Mixture-of-Experts	—Unverified
EAQuant: Enhancing Post-Training Quantization for MoE Models via Expert-Aware Optimization	Jun 16, 2025	Mixture-of-ExpertsModel Compression	CodeCode Available
Serving Large Language Models on Huawei CloudMatrix384	Jun 15, 2025	Mixture-of-ExpertsQuantization	—Unverified
Optimus-3: Towards Generalist Multimodal Minecraft Agents with Scalable Task Experts	Jun 12, 2025	DiversityMinecraft	—Unverified
GigaChat Family: Efficient Russian Language Modeling Through Mixture of Experts Architecture	Jun 11, 2025	Language ModelingLanguage Modelling	—Unverified
MedMoE: Modality-Specialized Mixture of Experts for Medical Vision-Language Understanding	Jun 10, 2025	DiagnosticMixture-of-Experts	—Unverified
A Two-Phase Deep Learning Framework for Adaptive Time-Stepping in High-Speed Flow Modeling	Jun 9, 2025	Mixture-of-Experts	—Unverified
MoE-GPS: Guidlines for Prediction Strategy for Dynamic Expert Duplication in MoE Load Balancing	Jun 9, 2025	GPUMixture-of-Experts	—Unverified
M2Restore: Mixture-of-Experts-based Mamba-CNN Fusion Framework for All-in-One Image Restoration	Jun 9, 2025	AllImage Restoration	—Unverified
MoE-MLoRA for Multi-Domain CTR Prediction: Efficient Adaptation with Expert Specialization	Jun 9, 2025	Click-Through Rate PredictionDiversity	CodeCode Available
MIRA: Medical Time Series Foundation Model for Real-World Health Data	Jun 9, 2025	EthicsMissing Values	—Unverified
STAMImputer: Spatio-Temporal Attention MoE for Traffic Data Imputation	Jun 9, 2025	Graph AttentionImputation	CodeCode Available
Breaking Data Silos: Towards Open and Scalable Mobility Foundation Models via Generative Continual Learning	Jun 7, 2025	Continual LearningFederated Learning	—Unverified
SMAR: Soft Modality-Aware Routing Strategy for MoE-based Multimodal Large Language Models Preserving Language Capabilities	Jun 6, 2025	Mixture-of-Experts	—Unverified
Lifelong Evolution: Collaborative Learning between Large and Small Language Models for Continuous Emergent Fake News Detection	Jun 5, 2025	Fake News Detectionknowledge editing	—Unverified
Brain-Like Processing Pathways Form in Models With Heterogeneous Experts	Jun 3, 2025	FormMixture-of-Experts	—Unverified
Enhancing Multimodal Continual Instruction Tuning with BranchLoRA	May 31, 2025	Mixture-of-Experts	—Unverified
Decoding Knowledge Attribution in Mixture-of-Experts: A Framework of Basic-Refinement Collaboration and Efficiency Analysis	May 30, 2025	BlockingMixture-of-Experts	—Unverified
GradPower: Powering Gradients for Faster Language Model Pre-Training	May 30, 2025	Language ModelingLanguage Modelling	—Unverified
On the Expressive Power of Mixture-of-Experts for Structured Complex Tasks	May 30, 2025	Mixture-of-Experts	—Unverified
Mixture-of-Experts for Personalized and Semantic-Aware Next Location Prediction	May 30, 2025	Domain GeneralizationMixture-of-Experts	—Unverified
From Knowledge to Noise: CTIM-Rover and the Pitfalls of Episodic Memory in Software Engineering Agents	May 29, 2025	AI AgentMixture-of-Experts	CodeCode Available

Show:10 25 50

← PrevPage 15 of 53Next →

No leaderboard results yet.