Mixture-of-Experts

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 401–425 of 1312 papers

Title	Date	Tasks	Status
A Review of Sparse Expert Models in Deep Learning	Sep 4, 2022	Deep LearningMixture-of-Experts	—Unverified
HMoE: Heterogeneous Mixture of Experts for Language Modeling	Aug 20, 2024	Computational EfficiencyLanguage Modeling	—Unverified
FineQuant: Unlocking Efficiency with Fine-Grained Weight-Only Quantization for LLMs	Aug 16, 2023	GPUMixture-of-Experts	—Unverified
Complexity Experts are Task-Discriminative Learners for Any Image Restoration	Nov 27, 2024	AttributeBlind All-in-One Image Restoration	—Unverified
Finding Fantastic Experts in MoEs: A Unified Study for Expert Dropping Strategies and Observations	Apr 8, 2025	Instruction FollowingMixture-of-Experts	—Unverified
Filtered not Mixed: Stochastic Filtering-Based Online Gating for Mixture of Large Language Models	Jun 5, 2024	Mixture-of-ExpertsTime Series	—Unverified
A Review of DeepSeek Models' Key Innovative Techniques	Mar 14, 2025	Mixture-of-Expertsreinforcement-learning	—Unverified
AdaMV-MoE: Adaptive Multi-Task Vision Mixture-of-Experts	Jan 1, 2023	Instance SegmentationMixture-of-Experts	—Unverified
HMOE: Hypernetwork-based Mixture of Experts for Domain Generalization	Nov 15, 2022	Domain GeneralizationMixture-of-Experts	—Unverified
FlexMoE: Scaling Large-scale Sparse Pre-trained Model Training via Dynamic Device Placement	Apr 8, 2023	Mixture-of-ExpertsScheduling	—Unverified
FloE: On-the-Fly MoE Inference on Memory-constrained GPU	May 9, 2025	CPUGPU	—Unverified
fMoE: Fine-Grained Expert Offloading for Large Mixture-of-Experts Serving	Feb 7, 2025	CPUGPU	—Unverified
FMT:A Multimodal Pneumonia Detection Model Based on Stacking MOE Framework	Mar 7, 2025	DiagnosticMedical Image Analysis	—Unverified
ForceVLA: Enhancing VLA Models with a Force-aware MoE for Contact-rich Manipulation	May 28, 2025	Contact-rich ManipulationMixture-of-Experts	—Unverified
Holistic Capability Preservation: Towards Compact Yet Comprehensive Reasoning Models	Apr 9, 2025	Instruction FollowingMathematical Problem-Solving	—Unverified
FreqMoE: Dynamic Frequency Enhancement for Neural PDE Solvers	May 11, 2025	Computational EfficiencyMixture-of-Experts	—Unverified
Affect in Tweets Using Experts Model	Mar 20, 2019	Mixture-of-Expertsmodel	—Unverified
Learning to Specialize: Joint Gating-Expert Training for Adaptive MoEs in Decentralized Settings	Jun 14, 2023	DiversityFederated Learning	—Unverified
Fresh-CL: Feature Realignment through Experts on Hypersphere in Continual Learning	Jan 4, 2025	Continual LearningMixture-of-Experts	—Unverified
From Google Gemini to OpenAI Q* (Q-Star): A Survey of Reshaping the Generative Artificial Intelligence (AI) Research Landscape	Dec 18, 2023	Mixture-of-Experts	—Unverified
ContextWIN: Whittle Index Based Mixture-of-Experts Neural Model For Restless Bandits Via Deep RL	Oct 13, 2024	Decision MakingMixture-of-Experts	—Unverified
FSMoE: A Flexible and Scalable Training System for Sparse Mixture-of-Experts Models	Jan 18, 2025	GPUMixture-of-Experts	—Unverified
Continual Learning Using Task Conditional Neural Networks	Sep 29, 2021	Continual LearningMixture-of-Experts	—Unverified
Full-Precision Free Binary Graph Neural Networks	Sep 29, 2021	Graph Neural NetworkMixture-of-Experts	—Unverified
FedMoE: Personalized Federated Learning via Heterogeneous Mixture of Experts	Aug 21, 2024	Federated LearningHeuristic Search	—Unverified

Show:10 25 50

← PrevPage 17 of 53Next →

No leaderboard results yet.