Mixture-of-Experts

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 576–600 of 1312 papers

Title	Date	Tasks	Status
Mixture of Tunable Experts - Behavior Modification of DeepSeek-R1 at Inference Time	Feb 16, 2025	Mixture-of-Experts	—Unverified
ClimateLLM: Efficient Weather Forecasting via Frequency-Aware Large Language Models	Feb 16, 2025	energy managementMixture-of-Experts	—Unverified
Probing Semantic Routing in Large Mixture-of-Expert Models	Feb 15, 2025	Mixture-of-ExpertsSentence	—Unverified
Eidetic Learning: an Efficient and Provable Solution to Catastrophic Forgetting	Feb 13, 2025	Mixture-of-Experts	CodeCode Available
Mixture of Decoupled Message Passing Experts with Entropy Constraint for General Node Classification	Feb 12, 2025	Mixture-of-ExpertsNode Classification	—Unverified
MoHAVE: Mixture of Hierarchical Audio-Visual Experts for Robust Speech Recognition	Feb 11, 2025	Audio-Visual Speech RecognitionComputational Efficiency	—Unverified
Memory Analysis on the Training Course of DeepSeek Models	Feb 11, 2025	GPUMixture-of-Experts	—Unverified
MoENAS: Mixture-of-Expert based Neural Architecture Search for jointly Accurate, Fair, and Robust Edge Deep Neural Networks	Feb 11, 2025	Fairnessimage-classification	—Unverified
MoETuner: Optimized Mixture of Expert Serving with Balanced Expert Placement and Token Routing	Feb 10, 2025	GPUMixture-of-Experts	—Unverified
MoEMba: A Mamba-based Mixture of Experts for High-Density EMG-based Hand Gesture Recognition	Feb 9, 2025	Gesture RecognitionHand Gesture Recognition	—Unverified
Klotski: Efficient Mixture-of-Expert Inference via Expert-Aware Multi-Batch Pipeline	Feb 9, 2025	CPUGPU	CodeCode Available
Mol-MoE: Training Preference-Guided Routers for Molecule Generation	Feb 8, 2025	BenchmarkingDrug Design	CodeCode Available
fMoE: Fine-Grained Expert Offloading for Large Mixture-of-Experts Serving	Feb 7, 2025	CPUGPU	—Unverified
Leveraging Pre-Trained Models for Multimodal Class-Incremental Learning under Adaptive Fusion	Feb 7, 2025	class-incremental learningClass Incremental Learning	—Unverified
Towards Foundational Models for Dynamical System Reconstruction: Hierarchical Meta-Learning via Mixture of Experts	Feb 7, 2025	Meta-LearningMixture-of-Experts	—Unverified
Joint MoE Scaling Laws: Mixture of Experts Can Be Memory Efficient	Feb 7, 2025	Computational EfficiencyMixture-of-Experts	—Unverified
Mixture of neural operator experts for learning boundary conditions and model selection	Feb 6, 2025	Mixture-of-ExpertsModel Selection	—Unverified
Optimizing Robustness and Accuracy in Mixture of Experts: A Dual-Model Approach	Feb 5, 2025	Adversarial RobustnessMixture-of-Experts	—Unverified
ReGNet: Reciprocal Space-Aware Long-Range Modeling for Crystalline Property Prediction	Feb 4, 2025	Computational EfficiencyLong-range modeling	—Unverified
Brief analysis of DeepSeek R1 and it's implications for Generative AI	Feb 4, 2025	GPUMixture-of-Experts	—Unverified
M2R2: Mixture of Multi-Rate Residuals for Efficient Transformer Inference	Feb 4, 2025	Mixture-of-Experts	—Unverified
MJ-VIDEO: Fine-Grained Benchmarking and Rewarding Video Preferences in Video Generation	Feb 3, 2025	BenchmarkingFairness	—Unverified
CLIP-UP: A Simple and Efficient Mixture-of-Experts CLIP Training Recipe with Sparse Upcycling	Feb 3, 2025	Mixture-of-Experts	—Unverified
MergeME: Model Merging Techniques for Homogeneous and Heterogeneous MoEs	Feb 3, 2025	Mathematical ReasoningMixture-of-Experts	—Unverified
Distribution-aware Fairness Learning in Medical Image Segmentation From A Control-Theoretic Perspective	Feb 2, 2025	FairnessImage Segmentation	CodeCode Available

Show:10 25 50

← PrevPage 24 of 53Next →

No leaderboard results yet.