Mixture-of-Experts

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 401–450 of 1312 papers

Title	Date	Tasks	Status	Score
MoE-LPR: Multilingual Extension of Large Language Models through Mixture-of-Experts with Language Priors Routing	Aug 21, 2024	Mixture-of-Experts	CodeCode Available	5
MoE-MLoRA for Multi-Domain CTR Prediction: Efficient Adaptation with Expert Specialization	Jun 9, 2025	Click-Through Rate PredictionDiversity	CodeCode Available	5
Expert Sample Consensus Applied to Camera Re-Localization	Aug 7, 2019	Camera LocalizationMixture-of-Experts	CodeCode Available	5
MoE-I^2: Compressing Mixture of Experts Models through Inter-Expert Pruning and Intra-Expert Low-Rank Decomposition	Nov 1, 2024	Mixture-of-Experts	CodeCode Available	5
A non-asymptotic approach for model selection via penalization in high-dimensional mixture of experts models	Apr 6, 2021	Mixture-of-ExpertsModel Selection	CodeCode Available	5
Checkmating One, by Using Many: Combining Mixture of Experts with MCTS to Improve in Chess	Jan 30, 2024	Mixture-of-Experts	CodeCode Available	5
Modality-Independent Brain Lesion Segmentation with Privacy-aware Continual Learning	Mar 26, 2025	Continual LearningKnowledge Distillation	CodeCode Available	5
MLP-KAN: Unifying Deep Representation and Function Learning	Oct 3, 2024	Kolmogorov-Arnold NetworksMixture-of-Experts	CodeCode Available	5
Modeling Task Relationships in Multi-task Learning with Multi-gate Mixture-of-Experts	Jul 19, 2018	Binary ClassificationClick-Through Rate Prediction	CodeCode Available	5
Condensing Multilingual Knowledge with Lightweight Language-Specific Modules	May 23, 2023	Machine TranslationMixture-of-Experts	CodeCode Available	5
Multimodal Fusion Strategies for Mapping Biophysical Landscape Features	Oct 7, 2024	Mixture-of-Experts	CodeCode Available	5
Mixture of Link Predictors on Graphs	Feb 13, 2024	Link PredictionMixture-of-Experts	CodeCode Available	5
Anomaly Detection by Recombining Gated Unsupervised Experts	Aug 31, 2020	Anomaly DetectionMixture-of-Experts	CodeCode Available	5
Mixture-of-Experts Variational Autoencoder for Clustering and Generating from Similarity-Based Representations on Single Cell Data	Oct 17, 2019	ClusteringDecoder	CodeCode Available	5
Mixture of Modular Experts: Distilling Knowledge from a Multilingual Teacher into Specialized Modular Language Models	Jul 28, 2024	Knowledge DistillationMixture-of-Experts	CodeCode Available	5
Equipping Computational Pathology Systems with Artifact Processing Pipelines: A Showcase for Computation and Performance Trade-offs	Mar 12, 2024	Airbubbles DetectionAnomaly Detection	CodeCode Available	5
Catching Attention with Automatic Pull Quote Selection	May 27, 2020	ArticlesMixture-of-Experts	CodeCode Available	5
CartesianMoE: Boosting Knowledge Sharing among Experts via Cartesian Product Routing in Mixture-of-Experts	Oct 21, 2024	Mixture-of-Experts	CodeCode Available	5
Ensemble and Mixture-of-Experts DeepONets For Operator Learning	May 20, 2024	Mixture-of-ExpertsOperator learning	CodeCode Available	5
Mixture-of-Experts Graph Transformers for Interpretable Particle Collision Detection	Jan 6, 2025	Decision MakingMixture-of-Experts	CodeCode Available	5
Mixture of Experts Meets Decoupled Message Passing: Towards General and Adaptive Node Classification	Dec 11, 2024	Computational Efficiency	CodeCode Available	5
Mixture of Nested Experts: Adaptive Processing of Visual Tokens	Jul 29, 2024	Mixture-of-Experts	CodeCode Available	5
Mixture Content Selection for Diverse Sequence Generation	Sep 4, 2019	Abstractive Text SummarizationDecoder	CodeCode Available	5
Adversarial Mixture Of Experts with Category Hierarchy Soft Constraint	Jul 24, 2020	ClusteringFeature Importance	CodeCode Available	5
An Empirical Study on Model-agnostic Debiasing Strategies for Robust Natural Language Inference	Oct 8, 2020	Data AugmentationMixture-of-Experts	CodeCode Available	5
MicarVLMoE: A Modern Gated Cross-Aligned Vision-Language Mixture of Experts Model for Medical Image Captioning and Report Generation	Apr 29, 2025	cross-modal alignmentDecoder	CodeCode Available	5
Build a Robust QA System with Transformer-based Mixture of Experts	Mar 20, 2022	Data AugmentationMixture-of-Experts	CodeCode Available	5
Embarrassingly Parallel Inference for Gaussian Processes	Feb 27, 2017	Gaussian ProcessesMixture-of-Experts	CodeCode Available	5
Elucidating Robust Learning with Uncertainty-Aware Corruption Pattern Estimation	Nov 2, 2021	Mixture-of-Experts	CodeCode Available	5
Eliciting and Understanding Cross-Task Skills with Task-Level Mixture-of-Experts	May 25, 2022	Mixture-of-ExpertsMulti-Task Learning	CodeCode Available	5
Eidetic Learning: an Efficient and Provable Solution to Catastrophic Forgetting	Feb 13, 2025	Mixture-of-Experts	CodeCode Available	5
Manifold-Preserving Transformers are Effective for Short-Long Range Encoding	Oct 22, 2023	Language ModelingLanguage Modelling	CodeCode Available	5
MaskMoE: Boosting Token-Level Learning via Routing Mask in Mixture-of-Experts	Jul 13, 2024	DiversityMixture-of-Experts	CodeCode Available	5
Mixture-of-Supernets: Improving Weight-Sharing Supernet Training with Architecture-Routed Mixture-of-Experts	Jun 8, 2023	Language ModelingLanguage Modelling	CodeCode Available	5
LLM-e Guess: Can LLMs Capabilities Advance Without Hardware Progress?	May 7, 2025	Large Language ModelMixture-of-Experts	CodeCode Available	5
Robust Federated Learning by Mixture of Experts	Apr 23, 2021	Federated LearningMixture-of-Experts	CodeCode Available	5
m2mKD: Module-to-Module Knowledge Distillation for Modular Transformers	Feb 26, 2024	Knowledge DistillationMixture-of-Experts	CodeCode Available	5
RouterKT: Mixture-of-Experts for Knowledge Tracing	Apr 11, 2025	Knowledge TracingMixture-of-Experts	CodeCode Available	5
Efficient and Interpretable Grammatical Error Correction with Mixture of Experts	Oct 30, 2024	Grammatical Error CorrectionMixture-of-Experts	CodeCode Available	5
Effective Approaches to Batch Parallelization for Dynamic Neural Network Architectures	Jul 8, 2017	Mixture-of-ExpertsQuestion Answering	CodeCode Available	5
Lifelong Mixture of Variational Autoencoders	Jul 9, 2021	Lifelong learningMixture-of-Experts	CodeCode Available	5
Learning Mixture-of-Experts for General-Purpose Black-Box Discrete Optimization	May 29, 2024	Mixture-of-Experts	CodeCode Available	5
Learning multi-modal generative models with permutation-invariant encoders and tighter variational objectives	Sep 1, 2023	Mixture-of-Experts	CodeCode Available	5
EAQuant: Enhancing Post-Training Quantization for MoE Models via Expert-Aware Optimization	Jun 16, 2025	Mixture-of-ExpertsModel Compression	CodeCode Available	5
Countering Mainstream Bias via End-to-End Adaptive Local Learning	Apr 13, 2024	Collaborative FilteringMixture-of-Experts	CodeCode Available	5
SEKE: Specialised Experts for Keyword Extraction	Dec 18, 2024	DescriptiveKeyword Extraction	CodeCode Available	5
A multi-scale lithium-ion battery capacity prediction using mixture of experts and patch-based MLP	Mar 26, 2025	Mixture-of-Experts	CodeCode Available	5
DynMoLE: Boosting Mixture of LoRA Experts Fine-Tuning with a Hybrid Routing Mechanism	Apr 1, 2025	Common Sense ReasoningComputational Efficiency	CodeCode Available	5
Binary-Integer-Programming Based Algorithm for Expert Load Balancing in Mixture-of-Experts Models	Feb 21, 2025	Mixture-of-Experts	CodeCode Available	5
A Multi-Modal Deep Learning Framework for Pan-Cancer Prognosis	Jan 13, 2025	Deep LearningMixture-of-Experts	CodeCode Available	5

Show:10 25 50

← PrevPage 9 of 27Next →

No leaderboard results yet.