Mixture-of-Experts

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 451–500 of 1312 papers

Title	Date	Tasks	Status
Complexity Experts are Task-Discriminative Learners for Any Image Restoration	Nov 27, 2024	AttributeBlind All-in-One Image Restoration	—Unverified
A Survey on Model MoErging: Recycling and Routing Among Specialized Experts for Collaborative Learning	Aug 13, 2024	Mixture-of-ExpertsSurvey	—Unverified
Finding Fantastic Experts in MoEs: A Unified Study for Expert Dropping Strategies and Observations	Apr 8, 2025	Instruction FollowingMixture-of-Experts	—Unverified
Graph Mixture of Experts and Memory-augmented Routers for Multivariate Time Series Anomaly Detection	Dec 26, 2024	Anomaly DetectionMixture-of-Experts	—Unverified
Filtered not Mixed: Stochastic Filtering-Based Online Gating for Mixture of Large Language Models	Jun 5, 2024	Mixture-of-ExpertsTime Series	—Unverified
A Review of DeepSeek Models' Key Innovative Techniques	Mar 14, 2025	Mixture-of-Expertsreinforcement-learning	—Unverified
AdaMV-MoE: Adaptive Multi-Task Vision Mixture-of-Experts	Jan 1, 2023	Instance SegmentationMixture-of-Experts	—Unverified
GRIN: GRadient-INformed MoE	Sep 18, 2024	HellaSwagHumanEval	—Unverified
Language-driven All-in-one Adverse Weather Removal	Dec 3, 2023	AllDiversity	—Unverified
A Theoretical View on Sparsely Activated Networks	Aug 8, 2022	Mixture-of-Experts	—Unverified
Large-Scale YouTube-8M Video Understanding with Deep Neural Networks	Jun 14, 2017	ClassificationGeneral Classification	—Unverified
Learning More Generalized Experts by Merging Experts in Mixture-of-Experts	May 19, 2024	Incremental LearningMixture-of-Experts	—Unverified
Affect in Tweets Using Experts Model	Mar 20, 2019	Mixture-of-Expertsmodel	—Unverified
Learning to Specialize: Joint Gating-Expert Training for Adaptive MoEs in Decentralized Settings	Jun 14, 2023	DiversityFederated Learning	—Unverified
FedMoE: Personalized Federated Learning via Heterogeneous Mixture of Experts	Aug 21, 2024	Federated LearningHeuristic Search	—Unverified
FedMoE-DA: Federated Mixture of Experts via Domain Aware Fine-grained Aggregation	Nov 4, 2024	Federated LearningMixture-of-Experts	—Unverified
KAT-V1: Kwai-AutoThink Technical Report	Jul 11, 2025	Knowledge DistillationLarge Language Model	—Unverified
HDformer: A Higher Dimensional Transformer for Diabetes Detection Utilizing Long Range Vascular Signals	Mar 17, 2023	Computational EfficiencyMixture-of-Experts	—Unverified
HeterMoE: Efficient Training of Mixture-of-Experts Models on Heterogeneous GPUs	Apr 4, 2025	GPUMixture-of-Experts	—Unverified
FedMerge: Federated Personalization via Model Merging	Apr 9, 2025	Federated LearningMixture-of-Experts	—Unverified
A Provably Effective Method for Pruning Experts in Fine-tuned Sparse Mixture-of-Experts	May 26, 2024	Binary ClassificationMixture-of-Experts	—Unverified
Heuristic-Informed Mixture of Experts for Link Prediction in Multilayer Networks	Jan 29, 2025	Link PredictionMixture-of-Experts	—Unverified
KDExplainer: A Task-oriented Attention Model for Explaining Knowledge Distillation	May 10, 2021	Knowledge DistillationMixture-of-Experts	—Unverified
Decoding Knowledge Attribution in Mixture-of-Experts: A Framework of Basic-Refinement Collaboration and Efficiency Analysis	May 30, 2025	BlockingMixture-of-Experts	—Unverified
Knowledge-in-Context: Towards Knowledgeable Semi-Parametric Language Models	Oct 28, 2022	Common Sense ReasoningCoreference Resolution	—Unverified
Federated Mixture of Experts	Jul 14, 2021	Federated LearningMixture-of-Experts	—Unverified
Hierarchical Mixture-of-Experts Model for Large-Scale Gaussian Process Regression	Dec 9, 2014	Mixture-of-Expertsregression	—Unverified
Deep Gaussian Covariance Network	Oct 17, 2017	Gaussian ProcessesMixture-of-Experts	—Unverified
Federated learning using mixture of experts	Jan 1, 2021	Federated LearningMixture-of-Experts	—Unverified
Combining Spectral and Self-Supervised Features for Low Resource Speech Recognition and Translation	Apr 5, 2022	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
FEAMOE: Fair, Explainable and Adaptive Mixture of Experts	Oct 10, 2022	FairnessMixture-of-Experts	—Unverified
Combining Parametric and Nonparametric Models for Off-Policy Evaluation	May 14, 2019	Mixture-of-ExpertsOff-policy evaluation	—Unverified
FaVChat: Unlocking Fine-Grained Facail Video Understanding with Multimodal Large Language Models	Mar 12, 2025	Mixture-of-ExpertsQuestion Answering	—Unverified
HMOE: Hypernetwork-based Mixture of Experts for Domain Generalization	Nov 15, 2022	Domain GeneralizationMixture-of-Experts	—Unverified
Combinations of Adaptive Filters	Dec 22, 2021	Mixture-of-Experts	—Unverified
Holistic Capability Preservation: Towards Compact Yet Comprehensive Reasoning Models	Apr 9, 2025	Instruction FollowingMathematical Problem-Solving	—Unverified
Aphasic Speech Recognition using a Mixture of Speech Intelligibility Experts	Aug 25, 2020	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
A Dynamic Approach to Stock Price Prediction: Comparing RNN and Mixture of Experts Models Across Different Volatility Profiles	Oct 4, 2024	Mixture-of-ExpertsStock Price Prediction	—Unverified
LaDiMo: Layer-wise Distillation Inspired MoEfier	Aug 8, 2024	Knowledge DistillationMixture-of-Experts	—Unverified
How Do Consumers Really Choose: Exposing Hidden Preferences with the Mixture of Experts Model	Mar 3, 2025	Decision MakingDemand Forecasting	—Unverified
La-SoftMoE CLIP for Unified Physical-Digital Face Attack Detection	Aug 23, 2024	Mixture-of-Experts	—Unverified
How Lightweight Can A Vision Transformer Be	Jul 25, 2024	Mixture-of-ExpertsTransfer Learning	—Unverified
Learning Heterogeneous Tissues with Mixture of Experts for Gigapixel Whole Slide Images	Jan 1, 2025	Mixture-of-Expertswhole slide images	—Unverified
Lifelong Knowledge Editing for Vision Language Models with Low-Rank Mixture-of-Experts	Nov 23, 2024	knowledge editingMixture-of-Experts	—Unverified
Hunyuan-TurboS: Advancing Large Language Models through Mamba-Transformer Synergy and Adaptive Chain-of-Thought	May 21, 2025	ChatbotInstruction Following	—Unverified
Faster MoE LLM Inference for Extremely Large Models	May 6, 2025	Inference OptimizationMixture-of-Experts	—Unverified
Faster Language Models with Better Multi-Token Prediction Using Tensor Decomposition	Oct 23, 2024	Code GenerationMixture-of-Experts	—Unverified
CoCoAFusE: Beyond Mixtures of Experts via Model Fusion	May 2, 2025	Mixture-of-ExpertsPhilosophy	—Unverified
Fast, Differentiable and Sparse Top-k: a Convex Analysis Perspective	Feb 2, 2023	GPUMixture-of-Experts	—Unverified
An Unsupervised Domain Adaptation Method for Locating Manipulated Region in partially fake Audio	Jul 11, 2024	Data AugmentationDiversity	—Unverified

Show:10 25 50

← PrevPage 10 of 27Next →

No leaderboard results yet.