Mixture-of-Experts

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 751–800 of 1312 papers

Title	Date	Tasks	Status
ExpertRAG: Efficient RAG with Mixture of Experts -- Optimizing Context Retrieval for Adaptive LLM Responses	Mar 23, 2025	Language ModelingLanguage Modelling	—Unverified
ExpertRank: A Multi-level Coarse-grained Expert-based Listwise Ranking Loss	Jul 29, 2021	Information RetrievalMixture-of-Experts	—Unverified
Experts Weights Averaging: A New General Training Scheme for Vision Transformers	Aug 11, 2023	Mixture-of-Experts	—Unverified
Expert-Token Resonance: Redefining MoE Routing through Affinity-Driven Active Selection	May 24, 2024	Computational EfficiencyMixture-of-Experts	—Unverified
Explainable Classifier for Malignant Lymphoma Subtyping via Cell Graph and Image Fusion	Mar 2, 2025	Mixture-of-Expertswhole slide images	—Unverified
Explainable data-driven modeling via mixture of experts: towards effective blending of grey and black-box models	Jan 30, 2024	Mixture-of-Experts	—Unverified
Exploiting Mixture-of-Experts Redundancy Unlocks Multimodal Generative Abilities	Mar 28, 2025	Mixture-of-ExpertsText Generation	—Unverified
Exploring Domain Robust Lightweight Reward Models based on Router Mechanism	Jul 24, 2024	Language ModelingLanguage Modelling	—Unverified
Exploring Routing Strategies for Multilingual Mixture-of-Experts Models	Jan 1, 2021	DecoderMixture-of-Experts	—Unverified
M6-T: Exploring Sparse Expert Models and Beyond	May 31, 2021	Mixture-of-ExpertsPlaying the Game of 2048	—Unverified
Exploring Speaker Diarization with Mixture of Experts	Jun 17, 2025	Mixture-of-Expertsspeaker-diarization	—Unverified
Facet-Aware Multi-Head Mixture-of-Experts Model for Sequential Recommendation	Nov 3, 2024	Mixture-of-ExpertsSequential Recommendation	—Unverified
Fast, Differentiable and Sparse Top-k: a Convex Analysis Perspective	Feb 2, 2023	GPUMixture-of-Experts	—Unverified
Faster Language Models with Better Multi-Token Prediction Using Tensor Decomposition	Oct 23, 2024	Code GenerationMixture-of-Experts	—Unverified
Faster MoE LLM Inference for Extremely Large Models	May 6, 2025	Inference OptimizationMixture-of-Experts	—Unverified
FaVChat: Unlocking Fine-Grained Facail Video Understanding with Multimodal Large Language Models	Mar 12, 2025	Mixture-of-ExpertsQuestion Answering	—Unverified
FEAMOE: Fair, Explainable and Adaptive Mixture of Experts	Oct 10, 2022	FairnessMixture-of-Experts	—Unverified
Federated learning using mixture of experts	Jan 1, 2021	Federated LearningMixture-of-Experts	—Unverified
Federated Mixture of Experts	Jul 14, 2021	Federated LearningMixture-of-Experts	—Unverified
FedMerge: Federated Personalization via Model Merging	Apr 9, 2025	Federated LearningMixture-of-Experts	—Unverified
FedMoE-DA: Federated Mixture of Experts via Domain Aware Fine-grained Aggregation	Nov 4, 2024	Federated LearningMixture-of-Experts	—Unverified
FedMoE: Personalized Federated Learning via Heterogeneous Mixture of Experts	Aug 21, 2024	Federated LearningHeuristic Search	—Unverified
Learning to Specialize: Joint Gating-Expert Training for Adaptive MoEs in Decentralized Settings	Jun 14, 2023	DiversityFederated Learning	—Unverified
Filtered not Mixed: Stochastic Filtering-Based Online Gating for Mixture of Large Language Models	Jun 5, 2024	Mixture-of-ExpertsTime Series	—Unverified
Finding Fantastic Experts in MoEs: A Unified Study for Expert Dropping Strategies and Observations	Apr 8, 2025	Instruction FollowingMixture-of-Experts	—Unverified
FineQuant: Unlocking Efficiency with Fine-Grained Weight-Only Quantization for LLMs	Aug 16, 2023	GPUMixture-of-Experts	—Unverified
FinTeamExperts: Role Specialized MOEs For Financial Analysis	Oct 28, 2024	Financial AnalysisMixture-of-Experts	—Unverified
Fixing MoE Over-Fitting on Low-Resource Languages in Multilingual Machine Translation	Dec 15, 2022	Machine TranslationMixture-of-Experts	—Unverified
Mixture-of-Experts Meets Instruction Tuning:A Winning Combination for Large Language Models	May 24, 2023	Mixture-of-ExpertsZero-shot Generalization	—Unverified
FlexMoE: Scaling Large-scale Sparse Pre-trained Model Training via Dynamic Device Placement	Apr 8, 2023	Mixture-of-ExpertsScheduling	—Unverified
FloE: On-the-Fly MoE Inference on Memory-constrained GPU	May 9, 2025	CPUGPU	—Unverified
fMoE: Fine-Grained Expert Offloading for Large Mixture-of-Experts Serving	Feb 7, 2025	CPUGPU	—Unverified
FMT:A Multimodal Pneumonia Detection Model Based on Stacking MOE Framework	Mar 7, 2025	DiagnosticMedical Image Analysis	—Unverified
ForceVLA: Enhancing VLA Models with a Force-aware MoE for Contact-rich Manipulation	May 28, 2025	Contact-rich ManipulationMixture-of-Experts	—Unverified
Free Agent in Agent-Based Mixture-of-Experts Generative AI Framework	Jan 29, 2025	Fraud DetectionMixture-of-Experts	—Unverified
FreqMoE: Dynamic Frequency Enhancement for Neural PDE Solvers	May 11, 2025	Computational EfficiencyMixture-of-Experts	—Unverified
Fresh-CL: Feature Realignment through Experts on Hypersphere in Continual Learning	Jan 4, 2025	Continual LearningMixture-of-Experts	—Unverified
From Google Gemini to OpenAI Q* (Q-Star): A Survey of Reshaping the Generative Artificial Intelligence (AI) Research Landscape	Dec 18, 2023	Mixture-of-Experts	—Unverified
FSMoE: A Flexible and Scalable Training System for Sparse Mixture-of-Experts Models	Jan 18, 2025	GPUMixture-of-Experts	—Unverified
Full-Precision Free Binary Graph Neural Networks	Sep 29, 2021	Graph Neural NetworkMixture-of-Experts	—Unverified
Functional-level Uncertainty Quantification for Calibrated Fine-tuning on LLMs	Oct 9, 2024	Common Sense ReasoningMixture-of-Experts	—Unverified
Functional mixture-of-experts for classification	Feb 28, 2022	ClassificationMixture-of-Experts	—Unverified
FuseMoE: Mixture-of-Experts Transformers for Fleximodal Fusion	Feb 5, 2024	Missing ElementsMixture-of-Experts	—Unverified
FuxiMT: Sparsifying Large Language Models for Chinese-Centric Multilingual Machine Translation	May 20, 2025	Language ModelingLanguage Modelling	—Unverified
Galaxy Walker: Geometry-aware VLMs For Galaxy-scale Understanding	Mar 24, 2025	Mixture-of-ExpertsMorphology classification	—Unverified
Gated Ensemble of Spatio-temporal Mixture of Experts for Multi-task Learning in Ride-hailing System	Dec 31, 2020	Mixture-of-ExpertsMulti-Task Learning	—Unverified
Gating Dropout: Communication-efficient Regularization for Sparsely Activated Transformers	May 28, 2022	Machine TranslationMixture-of-Experts	—Unverified
GEMNET: Effective Gated Gazetteer Representations for Recognizing Complex Entities in Low-context Input	Jun 1, 2021	Mixture-of-Expertsnamed-entity-recognition	—Unverified
Generalizable Person Re-identification with Relevance-aware Mixture of Experts	May 19, 2021	Generalizable Person Re-identificationMixture-of-Experts	—Unverified
Generalization Error Analysis for Sparse Mixture-of-Experts: A Preliminary Study	Mar 26, 2024	Learning TheoryMixture-of-Experts	—Unverified

Show:10 25 50

← PrevPage 16 of 27Next →

No leaderboard results yet.