Mixture-of-Experts

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 451–500 of 1312 papers

Title	Date	Tasks	Status
MoxE: Mixture of xLSTM Experts with Entropy-Aware Routing for Efficient Language Modeling	May 1, 2025	Language ModelingLanguage Modelling	—Unverified
CICADA: Cross-Domain Interpretable Coding for Anomaly Detection and Adaptation in Multivariate Time Series	May 1, 2025	Anomaly DetectionMeta-Learning	—Unverified
MicarVLMoE: A Modern Gated Cross-Aligned Vision-Language Mixture of Experts Model for Medical Image Captioning and Report Generation	Apr 29, 2025	cross-modal alignmentDecoder	CodeCode Available
Accelerating Mixture-of-Experts Training with Adaptive Expert Replication	Apr 28, 2025	GPUMixture-of-Experts	—Unverified
PICO: Secure Transformers via Robust Prompt Isolation and Cybersecurity Oversight	Apr 26, 2025	Mixture-of-ExpertsPICO	—Unverified
NoEsis: Differentially Private Knowledge Transfer in Modular LLM Adaptation	Apr 25, 2025	Code CompletionMixture-of-Experts	—Unverified
Unveiling the Hidden: Movie Genre and User Bias in Spoiler Detection	Apr 24, 2025	Graph AttentionMixture-of-Experts	CodeCode Available
BadMoE: Backdooring Mixture-of-Experts LLMs via Optimizing Routing Triggers and Infecting Dormant Experts	Apr 24, 2025	Backdoor AttackMixture-of-Experts	—Unverified
MoE Parallel Folding: Heterogeneous Parallelism Mappings for Efficient Large-Scale MoE Model Training with Megatron Core	Apr 21, 2025	Mixture-of-Experts	—Unverified
Multi-Type Context-Aware Conversational Recommender Systems via Mixture-of-Experts	Apr 18, 2025	Mixture-of-ExpertsRecommendation Systems	—Unverified
HAECcity: Open-Vocabulary Scene Understanding of City-Scale Point Clouds with Superpoint Graph Clustering	Apr 18, 2025	ClusteringGraph Clustering	—Unverified
D^2MoE: Dual Routing and Dynamic Scheduling for Efficient On-Device MoE-based LLM Serving	Apr 17, 2025	Mixture-of-ExpertsModel Compression	—Unverified
Trend Filtered Mixture of Experts for Automated Gating of High-Frequency Flow Cytometry Data	Apr 16, 2025	Mixture-of-Experts	—Unverified
Unveiling Hidden Collaboration within Mixture-of-Experts in Large Language Models	Apr 16, 2025	Dictionary LearningMixture-of-Experts	—Unverified
Plasticity-Aware Mixture of Experts for Learning Under QoE Shifts in Adaptive Video Streaming	Apr 14, 2025	Mixture-of-Experts	—Unverified
Mixture-of-Shape-Experts (MoSE): End-to-End Shape Dictionary Framework to Prompt SAM for Generalizable Medical Segmentation	Apr 13, 2025	Dictionary LearningDomain Generalization	—Unverified
MoE-Lens: Towards the Hardware Limit of High-Throughput MoE LLM Serving Under Resource Constraints	Apr 12, 2025	CPUGPU	—Unverified
RouterKT: Mixture-of-Experts for Knowledge Tracing	Apr 11, 2025	Knowledge TracingMixture-of-Experts	CodeCode Available
Regularized infill criteria for multi-objective Bayesian optimization with application to aircraft design	Apr 11, 2025	Bayesian Optimizationglobal-optimization	—Unverified
Adaptive Detection of Fast Moving Celestial Objects Using a Mixture of Experts and Physical-Inspired Neural Network	Apr 10, 2025	Mixture-of-Expertsobject-detection	—Unverified
Scaling Laws for Native Multimodal Models Scaling Laws for Native Multimodal Models	Apr 10, 2025	Mixture-of-Experts	—Unverified
Seed1.5-Thinking: Advancing Superb Reasoning Models with Reinforcement Learning	Apr 10, 2025	Mixture-of-Expertsreinforcement-learning	—Unverified
Cluster-Driven Expert Pruning for Mixture-of-Experts Large Language Models	Apr 10, 2025	Computational EfficiencyMixture-of-Experts	CodeCode Available
Holistic Capability Preservation: Towards Compact Yet Comprehensive Reasoning Models	Apr 9, 2025	Instruction FollowingMathematical Problem-Solving	—Unverified
FedMerge: Federated Personalization via Model Merging	Apr 9, 2025	Federated LearningMixture-of-Experts	—Unverified
Finding Fantastic Experts in MoEs: A Unified Study for Expert Dropping Strategies and Observations	Apr 8, 2025	Instruction FollowingMixture-of-Experts	—Unverified
HeterMoE: Efficient Training of Mixture-of-Experts Models on Heterogeneous GPUs	Apr 4, 2025	GPUMixture-of-Experts	—Unverified
RingMoE: Mixture-of-Modality-Experts Multi-Modal Foundation Models for Universal Remote Sensing Image Interpretation	Apr 4, 2025	Change DetectionDepth Estimation	—Unverified
MegaScale-Infer: Serving Mixture-of-Experts at Scale with Disaggregated Expert Parallelism	Apr 3, 2025	CPUGPU	—Unverified
Advancing MoE Efficiency: A Collaboration-Constrained Routing (C2R) Strategy for Better Expert Parallelism Design	Apr 2, 2025	AttributeMixture-of-Experts	—Unverified
A Unified Virtual Mixture-of-Experts Framework:Enhanced Inference and Hallucination Mitigation in Single-Model System	Apr 1, 2025	Dialogue GenerationEnsemble Learning	—Unverified
DynMoLE: Boosting Mixture of LoRA Experts Fine-Tuning with a Hybrid Routing Mechanism	Apr 1, 2025	Common Sense ReasoningComputational Efficiency	CodeCode Available
Detecting Financial Fraud with Hybrid Deep Learning: A Mix-of-Experts Approach to Sequential and Anomalous Patterns	Apr 1, 2025	Fraud DetectionMixture-of-Experts	—Unverified
Unimodal-driven Distillation in Multimodal Emotion Recognition with Dynamic Fusion	Mar 31, 2025	Emotion RecognitionKnowledge Distillation	—Unverified
Mixture of Routers	Mar 30, 2025	Mixture-of-Expertsparameter-efficient fine-tuning	—Unverified
S2MoE: Robust Sparse Mixture of Experts via Stochastic Learning	Mar 29, 2025	Mixture-of-Experts	—Unverified
Sparse Mixture of Experts as Unified Competitive Learning	Mar 29, 2025	Language ModelingLanguage Modelling	—Unverified
Beyond Standard MoE: Mixture of Latent Experts for Resource-Efficient Language Models	Mar 29, 2025	Computational EfficiencyMixture-of-Experts	—Unverified
Exploiting Mixture-of-Experts Redundancy Unlocks Multimodal Generative Abilities	Mar 28, 2025	Mixture-of-ExpertsText Generation	—Unverified
RocketPPA: Code-Level Power, Performance, and Area Prediction via LLM and Mixture of Experts	Mar 27, 2025	Code RepairFeature Engineering	—Unverified
LLaVA-CMoE: Towards Continual Mixture of Experts for Large Vision-Language Models	Mar 27, 2025	Mixture-of-Experts	—Unverified
iMedImage Technical Report	Mar 27, 2025	Anomaly DetectionDiagnostic	—Unverified
Reasoning Beyond Limits: Advances and Open Problems for LLMs	Mar 26, 2025	Mixture-of-ExpertsRAG	—Unverified
MoLe-VLA: Dynamic Layer-skipping Vision Language Action Model via Mixture-of-Layers for Efficient Robot Manipulation	Mar 26, 2025	Knowledge DistillationMixture-of-Experts	—Unverified
Modality-Independent Brain Lesion Segmentation with Privacy-aware Continual Learning	Mar 26, 2025	Continual LearningKnowledge Distillation	CodeCode Available
Optimal Scaling Laws for Efficiency Gains in a Theoretical Transformer-Augmented Sectional MoE Framework	Mar 26, 2025	Computational EfficiencyMixture-of-Experts	—Unverified
Enhancing Multi-modal Models with Heterogeneous MoE Adapters for Fine-tuning	Mar 26, 2025	Mixture-of-Expertsparameter-efficient fine-tuning	—Unverified
A multi-scale lithium-ion battery capacity prediction using mixture of experts and patch-based MLP	Mar 26, 2025	Mixture-of-Experts	CodeCode Available
M^2CD: A Unified MultiModal Framework for Optical-SAR Change Detection with Mixture of Experts and Self-Distillation	Mar 25, 2025	Change DetectionDisaster Response	—Unverified
Resilient Sensor Fusion under Adverse Sensor Failures via Multi-Modal Expert Fusion	Mar 25, 2025	Autonomous DrivingMixture-of-Experts	—Unverified

Show:10 25 50

← PrevPage 10 of 27Next →

No leaderboard results yet.