Mixture-of-Experts

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 351–400 of 1312 papers

Title	Date	Tasks	Status	Score
Not Eliminate but Aggregate: Post-Hoc Control over Mixture-of-Experts to Address Shortcut Shifts in Natural Language Understanding	Jun 17, 2024	Mixture-of-ExpertsNatural Language Understanding	CodeCode Available	5
Non-Normal Mixtures of Experts	Jun 22, 2015	ClusteringMixture-of-Experts	CodeCode Available	5
Octavius: Mitigating Task Interference in MLLMs via LoRA-MoE	Nov 5, 2023	DecoderMixture-of-Experts	CodeCode Available	5
Named Entity and Relation Extraction with Multi-Modal Retrieval	Dec 3, 2022	Mixture-of-ExpertsMulti-modal Named Entity Recognition	CodeCode Available	5
AskChart: Universal Chart Understanding through Textual Enhancement	Dec 26, 2024	Chart UnderstandingMixture-of-Experts	CodeCode Available	5
Nesti-Net: Normal Estimation for Unstructured 3D Point Clouds using Convolutional Neural Networks	Dec 3, 2018	Mixture-of-ExpertsSurface Normals Estimation	CodeCode Available	5
Multi-view Contrastive Learning for Entity Typing over Knowledge Graphs	Oct 18, 2023	Contrastive LearningEntity Typing	CodeCode Available	5
Multi-Source Domain Adaptation with Mixture of Experts	Sep 7, 2018	Domain AdaptationMixture-of-Experts	CodeCode Available	5
ASEM: Enhancing Empathy in Chatbot through Attention-based Sentiment and Emotion Modeling	Feb 25, 2024	ChatbotDiversity	CodeCode Available	5
Multi-modal Collaborative Optimization and Expansion Network for Event-assisted Single-eye Expression Recognition	May 17, 2025	Deep AttentionMamba	CodeCode Available	5
Condensing Multilingual Knowledge with Lightweight Language-Specific Modules	May 23, 2023	Machine TranslationMixture-of-Experts	CodeCode Available	5
A Gaussian Process-based Streaming Algorithm for Prediction of Time Series With Regimes and Outliers	Jun 1, 2024	Gaussian ProcessesMixture-of-Experts	CodeCode Available	5
Multimodal Cultural Safety: Evaluation Frameworks and Alignment Strategies	May 20, 2025	Mixture-of-Experts	CodeCode Available	5
AdaMV-MoE: Adaptive Multi-Task Vision Mixture-of-Experts	Jan 1, 2023	Instance SegmentationMixture-of-Experts	CodeCode Available	5
A Gated Residual Kolmogorov-Arnold Networks for Mixtures of Experts	Sep 23, 2024	Kolmogorov-Arnold NetworksMixture-of-Experts	CodeCode Available	5
Completed Feature Disentanglement Learning for Multimodal MRIs Analysis	Jul 6, 2024	DisentanglementMixture-of-Experts	CodeCode Available	5
MoVEInt: Mixture of Variational Experts for Learning Human-Robot Interactions from Demonstrations	Jul 10, 2024	Mixture-of-Experts	CodeCode Available	5
Multimodal Fusion Strategies for Mapping Biophysical Landscape Features	Oct 7, 2024	Mixture-of-Experts	CodeCode Available	5
CompeteSMoE -- Statistically Guaranteed Mixture of Experts Training via Competition	May 19, 2025	Mixture-of-Experts	CodeCode Available	5
MOoSE: Multi-Orientation Sharing Experts for Open-set Scene Text Recognition	Jul 26, 2024	Mixture-of-ExpertsScene Text Recognition	CodeCode Available	5
MoRE-Brain: Routed Mixture of Experts for Interpretable and Generalizable Cross-Subject fMRI Visual Decoding	May 21, 2025	Mixture-of-Experts	CodeCode Available	5
pFedMoE: Data-Level Personalization with Mixture of Experts for Model-Heterogeneous Personalized Federated Learning	Feb 2, 2024	Federated LearningMixture-of-Experts	CodeCode Available	5
FEDKIM: Adaptive Federated Knowledge Injection into Medical Foundation Models	Aug 17, 2024	Federated LearningMixture-of-Experts	CodeCode Available	5
MoNTA: Accelerating Mixture-of-Experts Training with Network-Traffc-Aware Parallel Optimization	Nov 1, 2024	8kMixture-of-Experts	CodeCode Available	5
More Experts Than Galaxies: Conditionally-overlapping Experts With Biologically-Inspired Fixed Routing	Oct 10, 2024	image-classificationImage Classification	CodeCode Available	5
Combining Spectral and Self-Supervised Features for Low Resource Speech Recognition and Translation	Apr 5, 2022	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	CodeCode Available	5
MoLEx: Mixture of Layer Experts for Finetuning with Sparse Upcycling	Mar 14, 2025	Mixture-of-Expertsparameter-efficient fine-tuning	CodeCode Available	5
CoLA: Collaborative Low-Rank Adaptation	May 21, 2025	CoLAMixture-of-Experts	CodeCode Available	5
Fast filtering of non-Gaussian models using Amortized Optimal Transport Maps	Mar 16, 2025	Mixture-of-Experts	CodeCode Available	5
Mol-MoE: Training Preference-Guided Routers for Molecule Generation	Feb 8, 2025	BenchmarkingDrug Design	CodeCode Available	5
Mosaic: Data-Free Knowledge Distillation via Mixture-of-Experts for Heterogeneous Distributed Environments	May 26, 2025	Data-free Knowledge DistillationFederated Learning	CodeCode Available	5
On-Device Collaborative Language Modeling via a Mixture of Generalists and Specialists	Sep 20, 2024	Federated LearningLanguage Modeling	CodeCode Available	5
MoELoRA: Contrastive Learning Guided Mixture of Experts on Parameter-Efficient Fine-Tuning for Large Language Models	Feb 20, 2024	Common Sense ReasoningContrastive Learning	CodeCode Available	5
FactorLLM: Factorizing Knowledge via Mixture of Experts for Large Language Models	Aug 15, 2024	Mixture-of-Experts	CodeCode Available	5
MoE-I^2: Compressing Mixture of Experts Models through Inter-Expert Pruning and Intra-Expert Low-Rank Decomposition	Nov 1, 2024	Mixture-of-Experts	CodeCode Available	5
MoE-LPR: Multilingual Extension of Large Language Models through Mixture-of-Experts with Language Priors Routing	Aug 21, 2024	Mixture-of-Experts	CodeCode Available	5
Extreme Classification in Log Memory using Count-Min Sketch: A Case Study of Amazon Search with 50M Products	Oct 28, 2019	ClassificationGeneral Classification	CodeCode Available	5
Cluster-Driven Expert Pruning for Mixture-of-Experts Large Language Models	Apr 10, 2025	Computational EfficiencyMixture-of-Experts	CodeCode Available	5
Exploring Model Consensus to Generate Translation Paraphrases	Jul 1, 2020	DiversityMachine Translation	CodeCode Available	5
A Hybrid Tensor-Expert-Data Parallelism Approach to Optimize Mixture-of-Experts Training	Mar 11, 2023	Mixture-of-Experts	CodeCode Available	5
Modality-Independent Brain Lesion Segmentation with Privacy-aware Continual Learning	Mar 26, 2025	Continual LearningKnowledge Distillation	CodeCode Available	5
Modeling Task Relationships in Multi-task Learning with Multi-gate Mixture-of-Experts	Jul 19, 2018	Binary ClassificationClick-Through Rate Prediction	CodeCode Available	5
Exploiting Activation Sparsity with Dense to Dynamic-k Mixture-of-Experts Conversion	Oct 6, 2023	Mixture-of-Experts	CodeCode Available	5
MLP-KAN: Unifying Deep Representation and Function Learning	Oct 3, 2024	Kolmogorov-Arnold NetworksMixture-of-Experts	CodeCode Available	5
MoE-MLoRA for Multi-Domain CTR Prediction: Efficient Adaptation with Expert Specialization	Jun 9, 2025	Click-Through Rate PredictionDiversity	CodeCode Available	5
CompeteSMoE - Effective Training of Sparse Mixture of Experts via Competition	Feb 4, 2024	Mixture-of-Experts	CodeCode Available	5
Mixture of Modular Experts: Distilling Knowledge from a Multilingual Teacher into Specialized Modular Language Models	Jul 28, 2024	Knowledge DistillationMixture-of-Experts	CodeCode Available	5
Mixture-of-LoRAs: An Efficient Multitask Tuning for Large Language Models	Mar 6, 2024	Mixture-of-ExpertsMulti-Task Learning	CodeCode Available	5
Mixture of Nested Experts: Adaptive Processing of Visual Tokens	Jul 29, 2024	Mixture-of-Experts	CodeCode Available	5
Mixture of Link Predictors on Graphs	Feb 13, 2024	Link PredictionMixture-of-Experts	CodeCode Available	5

Show:10 25 50

← PrevPage 8 of 27Next →

No leaderboard results yet.