Mixture-of-Experts

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 451–500 of 1312 papers

Title	Date	Tasks	Status	Hype
MoE-I^2: Compressing Mixture of Experts Models through Inter-Expert Pruning and Intra-Expert Low-Rank Decomposition	Nov 1, 2024	Mixture-of-Experts	CodeCode Available	0
MoNTA: Accelerating Mixture-of-Experts Training with Network-Traffc-Aware Parallel Optimization	Nov 1, 2024	8kMixture-of-Experts	CodeCode Available	0
LIBMoE: A Library for comprehensive benchmarking Mixture of Experts in Large Language Models	Nov 1, 2024	BenchmarkingMixture-of-Experts	CodeCode Available	1
Stereo-Talker: Audio-driven 3D Human Synthesis with Prior-Guided Mixture-of-Experts	Oct 31, 2024	Language ModelingLanguage Modelling	—Unverified	0
Efficient and Interpretable Grammatical Error Correction with Mixture of Experts	Oct 30, 2024	Grammatical Error CorrectionMixture-of-Experts	CodeCode Available	0
MALoRA: Mixture of Asymmetric Low-Rank Adaptation for Enhanced Multi-Task Learning	Oct 30, 2024	Computational EfficiencyMixture-of-Experts	—Unverified	0
Stealing User Prompts from Mixture of Experts	Oct 30, 2024	Mixture-of-Experts	—Unverified	0
Neural Experts: Mixture of Experts for Implicit Neural Representations	Oct 29, 2024	Image ReconstructionMixture-of-Experts	—Unverified	0
ProMoE: Fast MoE-based LLM Serving using Proactive Caching	Oct 29, 2024	GPUMixture-of-Experts	—Unverified	0
Efficient and Effective Weight-Ensembling Mixture of Experts for Multi-Task Model Merging	Oct 29, 2024	Mixture-of-ExpertsMulti-Task Learning	—Unverified	0
FinTeamExperts: Role Specialized MOEs For Financial Analysis	Oct 28, 2024	Financial AnalysisMixture-of-Experts	—Unverified	0
Efficient Mixture-of-Expert for Video-based Driver State and Physiological Multi-task Estimation in Conditional Autonomous Driving	Oct 28, 2024	Autonomous DrivingMixture-of-Experts	—Unverified	0
Hierarchical Mixture of Experts: Generalizable Learning for High-Level Synthesis	Oct 25, 2024	High-Level SynthesisMixture-of-Experts	CodeCode Available	0
DMT-HI: MOE-based Hyperbolic Interpretable Deep Manifold Transformation for Unspervised Dimensionality Reduction	Oct 25, 2024	Dimensionality ReductionMixture-of-Experts	CodeCode Available	1
Read-ME: Refactorizing LLMs as Router-Decoupled Mixture of Experts with System Co-Design	Oct 24, 2024	Mixture-of-ExpertsMMLU	CodeCode Available	1
Mixture of Parrots: Experts improve memorization more than reasoning	Oct 24, 2024	MathMemorization	—Unverified	0
MoMQ: Mixture-of-Experts Enhances Multi-Dialect Query Generation across Relational and Non-Relational Databases	Oct 24, 2024	Mixture-of-Experts	—Unverified	0
Robust and Explainable Depression Identification from Speech Using Vowel-Based Ensemble Learning Approaches	Oct 23, 2024	Ensemble LearningMixture-of-Experts	—Unverified	0
MiLoRA: Efficient Mixture of Low-Rank Adaptation for Large Language Models Fine-tuning	Oct 23, 2024	MathMixture-of-Experts	—Unverified	0
Faster Language Models with Better Multi-Token Prediction Using Tensor Decomposition	Oct 23, 2024	Code GenerationMixture-of-Experts	—Unverified	0
ExpertFlow: Optimized Expert Activation and Token Allocation for Efficient Mixture-of-Experts Inference	Oct 23, 2024	Computational EfficiencyCPU	—Unverified	0
Optimizing Mixture-of-Experts Inference Time Combining Model Deployment and Communication Scheduling	Oct 22, 2024	AllGPU	—Unverified	0
Generalizing Motion Planners with Mixture of Experts for Autonomous Driving	Oct 21, 2024	Autonomous DrivingData Augmentation	CodeCode Available	3
CartesianMoE: Boosting Knowledge Sharing among Experts via Cartesian Product Routing in Mixture-of-Experts	Oct 21, 2024	Mixture-of-Experts	CodeCode Available	0
LMHaze: Intensity-aware Image Dehazing with a Large-scale Multi-intensity Real Haze Dataset	Oct 21, 2024	Image DehazingMamba	CodeCode Available	1
ViMoE: An Empirical Study of Designing Vision Mixture-of-Experts	Oct 21, 2024	image-classificationImage Classification	—Unverified	0
LoRA-IR: Taming Low-Rank Experts for Efficient All-in-One Image Restoration	Oct 20, 2024	AllComputational Efficiency	CodeCode Available	2
MENTOR: Mixture-of-Experts Network with Task-Oriented Perturbation for Visual Reinforcement Learning	Oct 19, 2024	Deep Reinforcement LearningMixture-of-Experts	—Unverified	0
MomentumSMoE: Integrating Momentum into Sparse Mixture of Experts	Oct 18, 2024	Language ModelingLanguage Modelling	CodeCode Available	1
ST-MoE-BERT: A Spatial-Temporal Mixture-of-Experts Framework for Long-Term Cross-City Mobility Prediction	Oct 18, 2024	ClassificationHuman Dynamics	CodeCode Available	1
Enhancing Generalization in Sparse Mixture of Experts Models: The Case for Increased Expert Activation in Compositional Tasks	Oct 17, 2024	Mixture-of-Experts	—Unverified	0
On the Risk of Evidence Pollution for Malicious Social Text Detection in the Era of LLMs	Oct 16, 2024	Mixture-of-ExpertsText Detection	—Unverified	0
EPS-MoE: Expert Pipeline Scheduler for Cost-Efficient MoE Inference	Oct 16, 2024	Computational EfficiencyLarge Language Model	—Unverified	0
Understanding Expert Structures on Minimax Parameter Estimation in Contaminated Mixture of Experts	Oct 16, 2024	Mixture-of-Expertsparameter estimation	—Unverified	0
Transformer Layer Injection: A Novel Approach for Efficient Upscaling of Large Language Models	Oct 15, 2024	Mixture-of-Experts	—Unverified	0
MoE-Pruner: Pruning Mixture-of-Experts Large Language Model using the Hints from Its Router	Oct 15, 2024	Knowledge DistillationLanguage Modeling	—Unverified	0
MoH: Multi-Head Attention as Mixture-of-Head Attention	Oct 15, 2024	Mixture-of-Experts	CodeCode Available	4
Quadratic Gating Functions in Mixture of Experts: A Statistical Insight	Oct 15, 2024	Computational EfficiencyMixture-of-Experts	—Unverified	0
GaVaMoE: Gaussian-Variational Gated Mixture of Experts for Explainable Recommendation	Oct 15, 2024	Explainable RecommendationLanguage Modelling	CodeCode Available	1
AlphaLoRA: Assigning LoRA Experts Based on Layer Training Quality	Oct 14, 2024	Mixture-of-Expertsparameter-efficient fine-tuning	CodeCode Available	1
Ada-K Routing: Boosting the Efficiency of MoE-based LLMs	Oct 14, 2024	Computational EfficiencyMixture-of-Experts	—Unverified	0
Efficiently Democratizing Medical LLMs for 50 Languages via a Mixture of Language Family Experts	Oct 14, 2024	Mixture-of-Experts	CodeCode Available	2
Learning to Ground VLMs without Forgetting	Oct 14, 2024	DecoderLanguage Modelling	—Unverified	0
Your Mixture-of-Experts LLM Is Secretly an Embedding Model For Free	Oct 14, 2024	Mixture-of-Experts	CodeCode Available	2
Scalable Multi-Domain Adaptation of Language Models using Modular Experts	Oct 14, 2024	Domain AdaptationGeneral Knowledge	—Unverified	0
Mixture of Experts Made Personalized: Federated Prompt Learning for Vision-Language Models	Oct 14, 2024	Federated LearningMixture-of-Experts	CodeCode Available	1
Moirai-MoE: Empowering Time Series Foundation Models with Sparse Mixture of Experts	Oct 14, 2024	Mixture-of-ExpertsTime Series	CodeCode Available	5
ContextWIN: Whittle Index Based Mixture-of-Experts Neural Model For Restless Bandits Via Deep RL	Oct 13, 2024	Decision MakingMixture-of-Experts	—Unverified	0
MoIN: Mixture of Introvert Experts to Upcycle an LLM	Oct 13, 2024	GPULanguage Modeling	—Unverified	0
AT-MoE: Adaptive Task-planning Mixture of Experts via LoRA Approach	Oct 12, 2024	Mixture-of-ExpertsTask Planning	—Unverified	0

Show:10 25 50

← PrevPage 10 of 27Next →

No leaderboard results yet.