Mixture-of-Experts

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 251–300 of 1312 papers

Title	Date	Tasks	Status	Hype
LMHaze: Intensity-aware Image Dehazing with a Large-scale Multi-intensity Real Haze Dataset	Oct 21, 2024	Image DehazingMamba	CodeCode Available	1
Dynamic Language Group-Based MoE: Enhancing Code-Switching Speech Recognition with Hierarchical Routing	Jul 26, 2024	AttributeLanguage Modelling	CodeCode Available	1
BiMediX: Bilingual Medical Mixture of Experts LLM	Feb 20, 2024	Mixture-of-ExpertsMultiple-choice	CodeCode Available	1
LOLA -- An Open-Source Massively Multilingual Large Language Model	Sep 17, 2024	DiversityLanguage Modeling	CodeCode Available	1
Efficient Dictionary Learning with Switch Sparse Autoencoders	Oct 10, 2024	Dictionary LearningMixture-of-Experts	CodeCode Available	1
Efficient Expert Pruning for Sparse Mixture-of-Experts Language Models: Enhancing Performance and Reducing Inference Costs	Jul 1, 2024	GPUMixture-of-Experts	CodeCode Available	1
Lifting the Curse of Capacity Gap in Distilling Language Models	May 20, 2023	Knowledge DistillationMixture-of-Experts	CodeCode Available	1
Learning to Skip the Middle Layers of Transformers	Jun 26, 2025	Mixture-of-Experts	CodeCode Available	1
Efficient Fine-tuning of Audio Spectrogram Transformers via Soft Mixture of Adapters	Feb 1, 2024	Mixture-of-Expertsparameter-efficient fine-tuning	CodeCode Available	1
Towards Crowdsourced Training of Large Neural Networks using Decentralized Mixture-of-Experts	Feb 10, 2020	Language ModellingMixture-of-Experts	CodeCode Available	1
LITE: Modeling Environmental Ecosystems with Multimodal Large Language Models	Apr 1, 2024	Decision MakingLanguage Modeling	CodeCode Available	1
Long-Tailed Visual Recognition via Self-Heterogeneous Integration with Knowledge Excavation	Apr 3, 2023	Mixture-of-ExpertsTransfer Learning	CodeCode Available	1
MedCoT: Medical Chain of Thought via Hierarchical Expert	Dec 18, 2024	DiagnosticMedical Visual Question Answering	CodeCode Available	1
RetGen: A Joint framework for Retrieval and Grounded Text Generation Modeling	May 14, 2021	Dialogue GenerationLanguage Modeling	CodeCode Available	1
JanusDNA: A Powerful Bi-directional Hybrid DNA Foundation Model	May 22, 2025	GPULong-range modeling	CodeCode Available	1
Large Multi-modality Model Assisted AI-Generated Image Quality Assessment	Apr 27, 2024	Image Quality AssessmentMixture-of-Experts	CodeCode Available	1
DMT-HI: MOE-based Hyperbolic Interpretable Deep Manifold Transformation for Unspervised Dimensionality Reduction	Oct 25, 2024	Dimensionality ReductionMixture-of-Experts	CodeCode Available	1
Image Super-resolution Via Latent Diffusion: A Sampling-space Mixture Of Experts And Frequency-augmented Decoder Approach	Oct 18, 2023	Blind Super-ResolutionDecoder	CodeCode Available	1
Jakiro: Boosting Speculative Decoding with Decoupled Multi-Head via MoE	Feb 10, 2025	DiversityLanguage Modeling	CodeCode Available	1
Layerwise Recurrent Router for Mixture-of-Experts	Aug 13, 2024	AttributeMixture-of-Experts	CodeCode Available	1
HydraSum: Disentangling Stylistic Features in Text Summarization using Multi-Decoder Models	Oct 8, 2021	Abstractive Text SummarizationDecoder	CodeCode Available	1
HyperMoE: Towards Better Mixture of Experts via Transferring Among Experts	Feb 20, 2024	Mixture-of-ExpertsMulti-Task Learning	CodeCode Available	1
HyperRouter: Towards Efficient Training and Inference of Sparse Mixture of Experts	Dec 12, 2023	Mixture-of-Experts	CodeCode Available	1
Heterogeneous Multi-task Learning with Expert Diversity	Jun 20, 2021	DiversityMixture-of-Experts	CodeCode Available	1
Heterogeneous Mixture of Experts for Remote Sensing Image Super-Resolution	Feb 12, 2025	Image Super-ResolutionMixture-of-Experts	CodeCode Available	1
Graph Sparsification via Mixture of Graphs	May 23, 2024	Graph LearningMixture-of-Experts	CodeCode Available	1
Distribution-aware Forgetting Compensation for Exemplar-Free Lifelong Person Re-identification	Apr 21, 2025	Exemplar-FreeKnowledge Distillation	CodeCode Available	1
HyperFormer: Enhancing Entity and Relation Interaction for Hyper-Relational Knowledge Graph Completion	Aug 12, 2023	AttributeKnowledge Graph Completion	CodeCode Available	1
Efficient and Degradation-Adaptive Network for Real-World Image Super-Resolution	Mar 27, 2022	Image Super-ResolutionMixture-of-Experts	CodeCode Available	1
DMoERM: Recipes of Mixture-of-Experts for Effective Reward Modeling	Mar 2, 2024	Language ModellingLarge Language Model	CodeCode Available	1
Hierarchical Time-Aware Mixture of Experts for Multi-Modal Sequential Recommendation	Jan 24, 2025	Contrastive LearningMixture-of-Experts	CodeCode Available	1
Improving Video-Text Retrieval by Multi-Stream Corpus Alignment and Dual Softmax Loss	Sep 9, 2021	Mixture-of-ExpertsRetrieval	CodeCode Available	1
GaVaMoE: Gaussian-Variational Gated Mixture of Experts for Explainable Recommendation	Oct 15, 2024	Explainable RecommendationLanguage Modelling	CodeCode Available	1
Distilling the Knowledge in a Neural Network	Mar 9, 2015	Knowledge DistillationMixture-of-Experts	CodeCode Available	1
Awaker2.5-VL: Stably Scaling MLLMs with Parameter-Efficient Mixture of Experts	Nov 16, 2024	Mixture-of-ExpertsOptical Character Recognition (OCR)	CodeCode Available	1
Gated Multimodal Units for Information Fusion	Feb 7, 2017	General ClassificationGenre classification	CodeCode Available	1
Go Wider Instead of Deeper	Jul 25, 2021	Image ClassificationMixture-of-Experts	CodeCode Available	1
FreqMoE: Enhancing Time Series Forecasting through Frequency Decomposition Mixture of Experts	Jan 25, 2025	Mixture-of-ExpertsPrediction	CodeCode Available	1
DirectMultiStep: Direct Route Generation for Multi-Step Retrosynthesis	May 22, 2024	DiversityMixture-of-Experts	CodeCode Available	1
Learning Soccer Juggling Skills with Layer-wise Mixture-of-Experts	Jul 24, 2022	Deep Reinforcement LearningHumanoid Control	CodeCode Available	1
FineMoGen: Fine-Grained Spatio-Temporal Motion Generation and Editing	Dec 22, 2023	Mixture-of-ExpertsMotion Generation	CodeCode Available	1
LIBMoE: A Library for comprehensive benchmarking Mixture of Experts in Large Language Models	Nov 1, 2024	BenchmarkingMixture-of-Experts	CodeCode Available	1
FLAME-MoE: A Transparent End-to-End Research Platform for Mixture-of-Experts Language Models	May 26, 2025	Mixture-of-Experts	CodeCode Available	1
Dynamic Data Mixing Maximizes Instruction Tuning for Mixture-of-Experts	Jun 17, 2024	Mixture-of-Experts	CodeCode Available	1
Frequency-Adaptive Pan-Sharpening with Mixture of Experts	Jan 4, 2024	Mixture-of-Experts	CodeCode Available	1
Gradient-free variational learning with conditional mixture networks	Aug 29, 2024	Computational EfficiencyMixture-of-Experts	CodeCode Available	1
AutoMoE: Heterogeneous Mixture-of-Experts with Adaptive Computation for Efficient Neural Machine Translation	Oct 14, 2022	CPUMachine Translation	CodeCode Available	1
Specialized federated learning using a mixture of experts	Oct 5, 2020	Federated LearningMixture-of-Experts	CodeCode Available	1
LLMBind: A Unified Modality-Task Integration Framework	Feb 22, 2024	AI AgentAudio Generation	CodeCode Available	1
Exploring Sparse MoE in GANs for Text-conditioned Image Synthesis	Sep 7, 2023	Image GenerationMixture-of-Experts	CodeCode Available	1

Show:10 25 50

← PrevPage 6 of 27Next →

No leaderboard results yet.