Mixture-of-Experts

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 201–225 of 1312 papers

Title	Date	Tasks	Status	Hype
MiCE: Mixture of Contrastive Experts for Unsupervised Image Clustering	May 5, 2021	ClusteringContrastive Learning	CodeCode Available	1
Condense, Don't Just Prune: Enhancing Efficiency and Performance in MoE Layer Pruning	Nov 26, 2024	Mixture-of-Experts	CodeCode Available	1
C3PO: Critical-Layer, Core-Expert, Collaborative Pathway Optimization for Test-Time Expert Re-Mixing	Apr 10, 2025	In-Context LearningMixture-of-Experts	CodeCode Available	1
RetGen: A Joint framework for Retrieval and Grounded Text Generation Modeling	May 14, 2021	Dialogue GenerationLanguage Modeling	CodeCode Available	1
Layerwise Recurrent Router for Mixture-of-Experts	Aug 13, 2024	AttributeMixture-of-Experts	CodeCode Available	1
LIBMoE: A Library for comprehensive benchmarking Mixture of Experts in Large Language Models	Nov 1, 2024	BenchmarkingMixture-of-Experts	CodeCode Available	1
Jakiro: Boosting Speculative Decoding with Decoupled Multi-Head via MoE	Feb 10, 2025	DiversityLanguage Modeling	CodeCode Available	1
Image Super-resolution Via Latent Diffusion: A Sampling-space Mixture Of Experts And Frequency-augmented Decoder Approach	Oct 18, 2023	Blind Super-ResolutionDecoder	CodeCode Available	1
Mixture-of-Linear-Experts for Long-term Time Series Forecasting	Dec 11, 2023	Mixture-of-ExpertsTime Series	CodeCode Available	1
Improving Video-Text Retrieval by Multi-Stream Corpus Alignment and Dual Softmax Loss	Sep 9, 2021	Mixture-of-ExpertsRetrieval	CodeCode Available	1
JanusDNA: A Powerful Bi-directional Hybrid DNA Foundation Model	May 22, 2025	GPULong-range modeling	CodeCode Available	1
HyperFormer: Enhancing Entity and Relation Interaction for Hyper-Relational Knowledge Graph Completion	Aug 12, 2023	AttributeKnowledge Graph Completion	CodeCode Available	1
Modality Interactive Mixture-of-Experts for Fake News Detection	Jan 21, 2025	Fake News DetectionMisinformation	CodeCode Available	1
Contrastive Learning and Mixture of Experts Enables Precise Vector Embeddings	Jan 28, 2024	Contrastive LearningDescriptive	CodeCode Available	1
HydraSum: Disentangling Stylistic Features in Text Summarization using Multi-Decoder Models	Oct 8, 2021	Abstractive Text SummarizationDecoder	CodeCode Available	1
EvoMoE: An Evolutional Mixture-of-Experts Training Framework via Dense-To-Sparse Gate	Dec 29, 2021	Language ModelingLanguage Modelling	CodeCode Available	1
HyperMoE: Towards Better Mixture of Experts via Transferring Among Experts	Feb 20, 2024	Mixture-of-ExpertsMulti-Task Learning	CodeCode Available	1
Deep learning techniques for blind image super-resolution: A high-scale multi-domain perspective evaluation	Jun 15, 2023	Image Quality AssessmentImage Super-Resolution	CodeCode Available	1
HyperRouter: Towards Efficient Training and Inference of Sparse Mixture of Experts	Dec 12, 2023	Mixture-of-Experts	CodeCode Available	1
Lifting the Curse of Capacity Gap in Distilling Language Models	May 20, 2023	Knowledge DistillationMixture-of-Experts	CodeCode Available	1
Heterogeneous Multi-task Learning with Expert Diversity	Jun 20, 2021	DiversityMixture-of-Experts	CodeCode Available	1
Heterogeneous Mixture of Experts for Remote Sensing Image Super-Resolution	Feb 12, 2025	Image Super-ResolutionMixture-of-Experts	CodeCode Available	1
BrainMAP: Learning Multiple Activation Pathways in Brain Networks	Dec 23, 2024	MambaMixture-of-Experts	CodeCode Available	1
Graph Sparsification via Mixture of Graphs	May 23, 2024	Graph LearningMixture-of-Experts	CodeCode Available	1
Hierarchical Time-Aware Mixture of Experts for Multi-Modal Sequential Recommendation	Jan 24, 2025	Contrastive LearningMixture-of-Experts	CodeCode Available	1

Show:10 25 50

← PrevPage 9 of 53Next →

No leaderboard results yet.