Mixture-of-Experts

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 151–175 of 1312 papers

Title	Date	Tasks	Status	Hype
Meta-DMoE: Adapting to Domain Shift by Meta-Distillation from Mixture-of-Experts	Oct 8, 2022	Domain GeneralizationKnowledge Distillation	CodeCode Available	1
Mixture of Experts Made Personalized: Federated Prompt Learning for Vision-Language Models	Oct 14, 2024	Federated LearningMixture-of-Experts	CodeCode Available	1
M4: Multi-Proxy Multi-Gate Mixture of Experts Network for Multiple Instance Learning in Histopathology Image Analysis	Jul 24, 2024	Mixture-of-ExpertsMultiple Instance Learning	CodeCode Available	1
M3oE: Multi-Domain Multi-Task Mixture-of Experts Recommendation Framework	Apr 29, 2024	AutoMLMixture-of-Experts	CodeCode Available	1
3M: Multi-loss, Multi-path and Multi-level Neural Networks for speech recognition	Apr 7, 2022	Mixture-of-Expertsspeech-recognition	CodeCode Available	1
M^3ViT: Mixture-of-Experts Vision Transformer for Efficient Multi-task Learning with Model-Accelerator Co-design	Oct 26, 2022	Mixture-of-ExpertsMulti-Task Learning	CodeCode Available	1
M^4oE: A Foundation Model for Medical Multimodal Image Segmentation with Mixture of Experts	May 15, 2024	Image SegmentationMixture-of-Experts	CodeCode Available	1
LOLA -- An Open-Source Massively Multilingual Large Language Model	Sep 17, 2024	DiversityLanguage Modeling	CodeCode Available	1
LMHaze: Intensity-aware Image Dehazing with a Large-scale Multi-intensity Real Haze Dataset	Oct 21, 2024	Image DehazingMamba	CodeCode Available	1
Long-Tailed Visual Recognition via Self-Heterogeneous Integration with Knowledge Excavation	Apr 3, 2023	Mixture-of-ExpertsTransfer Learning	CodeCode Available	1
LLMBind: A Unified Modality-Task Integration Framework	Feb 22, 2024	AI AgentAudio Generation	CodeCode Available	1
AdapMoE: Adaptive Sensitivity-based Expert Gating and Management for Efficient MoE Inference	Aug 19, 2024	ManagementMixture-of-Experts	CodeCode Available	1
LLMCarbon: Modeling the end-to-end Carbon Footprint of Large Language Models	Sep 25, 2023	GPUMixture-of-Experts	CodeCode Available	1
LITE: Modeling Environmental Ecosystems with Multimodal Large Language Models	Apr 1, 2024	Decision MakingLanguage Modeling	CodeCode Available	1
AquilaMoE: Efficient Training for MoE Models with Scale-Up and Scale-Out Strategies	Aug 13, 2024	Language ModellingMixture-of-Experts	CodeCode Available	1
Condense, Don't Just Prune: Enhancing Efficiency and Performance in MoE Layer Pruning	Nov 26, 2024	Mixture-of-Experts	CodeCode Available	1
LIBMoE: A Library for comprehensive benchmarking Mixture of Experts in Large Language Models	Nov 1, 2024	BenchmarkingMixture-of-Experts	CodeCode Available	1
AdaMoE: Token-Adaptive Routing with Null Experts for Mixture-of-Experts Language Models	Jun 19, 2024	ARCMixture-of-Experts	CodeCode Available	1
COMET: Learning Cardinality Constrained Mixture of Experts with Trees and Local Search	Jun 5, 2023	Language ModelingLanguage Modelling	CodeCode Available	1
Lifting the Curse of Capacity Gap in Distilling Language Models	May 20, 2023	Knowledge DistillationMixture-of-Experts	CodeCode Available	1
Making Neural Networks Interpretable with Attribution: Application to Implicit Signals Prediction	Aug 26, 2020	Interpretable Machine LearningMixture-of-Experts	CodeCode Available	1
Mixture of Experts Meets Prompt-Based Continual Learning	May 23, 2024	Continual LearningMixture-of-Experts	CodeCode Available	1
Towards Crowdsourced Training of Large Neural Networks using Decentralized Mixture-of-Experts	Feb 10, 2020	Language ModellingMixture-of-Experts	CodeCode Available	1
Large Multi-modality Model Assisted AI-Generated Image Quality Assessment	Apr 27, 2024	Image Quality AssessmentMixture-of-Experts	CodeCode Available	1
Layerwise Recurrent Router for Mixture-of-Experts	Aug 13, 2024	AttributeMixture-of-Experts	CodeCode Available	1

Show:10 25 50

← PrevPage 7 of 53Next →

No leaderboard results yet.