Mixture-of-Experts

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 801–850 of 1312 papers

Title	Date	Tasks	Status
A Survey on Model MoErging: Recycling and Routing Among Specialized Experts for Collaborative Learning	Aug 13, 2024	Mixture-of-ExpertsSurvey	—Unverified
HoME: Hierarchy of Multi-Gate Experts for Multi-Task Learning at Kuaishou	Aug 10, 2024	Mixture-of-ExpertsMulti-Task Learning	—Unverified
LaDiMo: Layer-wise Distillation Inspired MoEfier	Aug 8, 2024	Knowledge DistillationMixture-of-Experts	—Unverified
Understanding the Performance and Estimating the Cost of LLM Fine-Tuning	Aug 8, 2024	GPUMixture-of-Experts	CodeCode Available
MoC-System: Efficient Fault Tolerance for Sparse Mixture-of-Experts Model Training	Aug 8, 2024	Mixture-of-Experts	—Unverified
Mixture-of-Noises Enhanced Forgery-Aware Predictor for Multi-Face Manipulation Detection and Localization	Aug 5, 2024	Face DetectionMixture-of-Experts	—Unverified
HMDN: Hierarchical Multi-Distribution Network for Click-Through Rate Prediction	Aug 2, 2024	Click-Through Rate PredictionMixture-of-Experts	—Unverified
Multimodal Fusion and Coherence Modeling for Video Topic Segmentation	Aug 1, 2024	Contrastive LearningMixture-of-Experts	—Unverified
MoMa: Efficient Early-Fusion Pre-training with Mixture of Modality-Aware Experts	Jul 31, 2024	Causal InferenceLanguage Modelling	—Unverified
PMoE: Progressive Mixture of Experts with Asymmetric Transformer for Continual Learning	Jul 31, 2024	Continual LearningGeneral Knowledge	—Unverified
Distribution Learning for Molecular Regression	Jul 30, 2024	Mixture-of-ExpertsMolecular Property Prediction	—Unverified
Time series forecasting with high stakes: A field study of the air cargo industry	Jul 29, 2024	Decision MakingDemand Forecasting	—Unverified
Mixture of Nested Experts: Adaptive Processing of Visual Tokens	Jul 29, 2024	Mixture-of-Experts	CodeCode Available
Mixture of Modular Experts: Distilling Knowledge from a Multilingual Teacher into Specialized Modular Language Models	Jul 28, 2024	Knowledge DistillationMixture-of-Experts	CodeCode Available
MOoSE: Multi-Orientation Sharing Experts for Open-set Scene Text Recognition	Jul 26, 2024	Mixture-of-ExpertsScene Text Recognition	CodeCode Available
Wolf: Captioning Everything with a World Summarization Framework	Jul 26, 2024	Autonomous DrivingMixture-of-Experts	—Unverified
How Lightweight Can A Vision Transformer Be	Jul 25, 2024	Mixture-of-ExpertsTransfer Learning	—Unverified
Exploring Domain Robust Lightweight Reward Models based on Router Mechanism	Jul 24, 2024	Language ModelingLanguage Modelling	—Unverified
Wonderful Matrices: More Efficient and Effective Architecture for Language Modeling Tasks	Jul 24, 2024	Language ModelingLanguage Modelling	—Unverified
EEGMamba: Bidirectional State Space Model with Mixture of Experts for EEG Multi-task Classification	Jul 20, 2024	EEGElectroencephalogram (EEG)	—Unverified
EVLM: An Efficient Vision-Language Model for Visual Understanding	Jul 19, 2024	Image CaptioningLanguage Modeling	—Unverified
Mixture of Experts with Mixture of Precisions for Tuning Quality of Service	Jul 19, 2024	CPUGPU	—Unverified
Mixture of Experts based Multi-task Supervise Learning from Crowds	Jul 18, 2024	Mixture-of-Experts	—Unverified
Discussion: Effective and Interpretable Outcome Prediction by Training Sparse Mixtures of Linear Experts	Jul 18, 2024	feature selectionMixture-of-Experts	—Unverified
MoE-DiffIR: Task-customized Diffusion Priors for Universal Compressed Image Restoration	Jul 15, 2024	Image RestorationMixture-of-Experts	—Unverified
Boost Your NeRF: A Model-Agnostic Mixture of Experts Framework for High Quality and Efficient Rendering	Jul 15, 2024	Mixture-of-ExpertsNeRF	—Unverified
MaskMoE: Boosting Token-Level Learning via Routing Mask in Mixture-of-Experts	Jul 13, 2024	DiversityMixture-of-Experts	CodeCode Available
Diversifying the Expert Knowledge for Task-Agnostic Pruning in Sparse Mixture-of-Experts	Jul 12, 2024	Mixture-of-Experts	—Unverified
An Unsupervised Domain Adaptation Method for Locating Manipulated Region in partially fake Audio	Jul 11, 2024	Data AugmentationDiversity	—Unverified
MoVEInt: Mixture of Variational Experts for Learning Human-Robot Interactions from Demonstrations	Jul 10, 2024	Mixture-of-Experts	CodeCode Available
A Simple Architecture for Enterprise Large Language Model Applications based on Role based security and Clearance Levels using Retrieval-Augmented Generation or Mixture of Experts	Jul 9, 2024	Language ModelingLanguage Modelling	—Unverified
SAM-Med3D-MoE: Towards a Non-Forgetting Segment Anything Model via Mixture of Experts for 3D Medical Image Segmentation	Jul 6, 2024	General KnowledgeImage Segmentation	—Unverified
Completed Feature Disentanglement Learning for Multimodal MRIs Analysis	Jul 6, 2024	DisentanglementMixture-of-Experts	CodeCode Available
MobileFlow: A Multimodal LLM For Mobile GUI Agent	Jul 5, 2024	Action AnalysisLanguage Modelling	—Unverified
Lazarus: Resilient and Elastic Training of Mixture-of-Experts Models with Adaptive Expert Placement	Jul 5, 2024	GPUMixture-of-Experts	—Unverified
Terminating Differentiable Tree Experts	Jul 2, 2024	Mixture-of-Experts	—Unverified
Investigating the potential of Sparse Mixtures-of-Experts for multi-domain neural machine translation	Jul 1, 2024	Machine TranslationMixture-of-Experts	—Unverified
Sparse Diffusion Policy: A Sparse, Reusable, and Flexible Policy for Robot Learning	Jul 1, 2024	Continual LearningMixture-of-Experts	—Unverified
LEMoE: Advanced Mixture of Experts Adaptor for Lifelong Model Editing of Large Language Models	Jun 28, 2024	Mixture-of-ExpertsModel Editing	—Unverified
A Teacher Is Worth A Million Instructions	Jun 27, 2024	Mixture-of-Experts	CodeCode Available
Towards Personalized Federated Multi-Scenario Multi-Task Recommendation	Jun 27, 2024	Federated LearningMixture-of-Experts	—Unverified
SC-MoE: Switch Conformer Mixture of Experts for Unified Streaming and Non-streaming Code-Switching ASR	Jun 26, 2024	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Mixture of Experts in a Mixture of RL settings	Jun 26, 2024	Deep Reinforcement LearningMixture-of-Experts	—Unverified
MoESD: Mixture of Experts Stable Diffusion to Mitigate Gender Bias	Jun 25, 2024	Mixture-of-Experts	—Unverified
Peirce in the Machine: How Mixture of Experts Models Perform Hypothesis Construction	Jun 24, 2024	Mixture-of-Experts	CodeCode Available
OTCE: Hybrid SSM and Attention with Cross Domain Mixture of Experts to construct Observer-Thinker-Conceiver-Expresser	Jun 24, 2024	Language ModelingLanguage Modelling	CodeCode Available
Theory on Mixture-of-Experts in Continual Learning	Jun 24, 2024	Continual LearningMixture-of-Experts	—Unverified
SimSMoE: Solving Representational Collapse via Similarity Measure	Jun 22, 2024	Mixture-of-Experts	—Unverified
Low-Rank Mixture-of-Experts for Continual Medical Image Segmentation	Jun 19, 2024	Continual LearningImage Segmentation	—Unverified
P-Tailor: Customizing Personality Traits for Language Models via Mixture of Specialized LoRA Experts	Jun 18, 2024	Mixture-of-Experts	—Unverified

Show:10 25 50

← PrevPage 17 of 27Next →

No leaderboard results yet.