SOTAVerified

Mixture-of-Experts

Papers

Showing 251300 of 1312 papers

TitleStatusHype
Graph Sparsification via Mixture of GraphsCode1
PFL-MoE: Personalized Federated Learning Based on Mixture of ExpertsCode1
Prompt-prompted Adaptive Structured Pruning for Efficient LLM GenerationCode1
Question-Aware Gaussian Experts for Audio-Visual Question AnsweringCode1
Gradient-free variational learning with conditional mixture networksCode1
BiMediX: Bilingual Medical Mixture of Experts LLMCode1
Parameter-Efficient Mixture-of-Experts Architecture for Pre-trained Language ModelsCode1
Patcher: Patch Transformers with Mixture of Experts for Precise Medical Image SegmentationCode1
Occult: Optimizing Collaborative Communication across Experts for Accelerated Parallel MoE Training and InferenceCode1
GaVaMoE: Gaussian-Variational Gated Mixture of Experts for Explainable RecommendationCode1
Gated Multimodal Units for Information FusionCode1
GraphMETRO: Mitigating Complex Graph Distribution Shifts via Mixture of Aligned ExpertsCode1
Go Wider Instead of DeeperCode1
Frequency-Adaptive Pan-Sharpening with Mixture of ExpertsCode1
MLP Fusion: Towards Efficient Fine-tuning of Dense and Mixture-of-Experts Language ModelsCode1
Patch-level Routing in Mixture-of-Experts is Provably Sample-efficient for Convolutional Neural NetworksCode1
MX-Font++: Mixture of Heterogeneous Aggregation Experts for Few-shot Font GenerationCode1
Few-Shot and Continual Learning with Attentive Independent MechanismsCode1
MxMoE: Mixed-precision Quantization for MoE with Accuracy and Performance Co-DesignCode1
Multi-Task Reinforcement Learning with Mixture of Orthogonal ExpertsCode1
DMoERM: Recipes of Mixture-of-Experts for Effective Reward ModelingCode1
Multi-view Depth Estimation using Epipolar Spatio-Temporal NetworksCode1
Navigating Spatio-Temporal Heterogeneity: A Graph Transformer Approach for Traffic ForecastingCode1
Exploring Sparse MoE in GANs for Text-conditioned Image SynthesisCode1
Multilinear Mixture of Experts: Scalable Expert Specialization through FactorizationCode1
Multimodal Clinical Trial Outcome Prediction with Large Language ModelsCode1
Exploiting Inter-Layer Expert Affinity for Accelerating Mixture-of-Experts Model InferenceCode1
Multi-Head Mixture-of-ExpertsCode1
Multimodal Variational Autoencoders for Semi-Supervised Learning: In Defense of Product-of-ExpertsCode1
EWMoE: An effective model for global weather forecasting with mixture-of-expertsCode1
DMT-HI: MOE-based Hyperbolic Interpretable Deep Manifold Transformation for Unspervised Dimensionality ReductionCode1
Specialized federated learning using a mixture of expertsCode1
Examining Post-Training Quantization for Mixture-of-Experts: A BenchmarkCode1
FLAME-MoE: A Transparent End-to-End Research Platform for Mixture-of-Experts Language ModelsCode1
MomentumSMoE: Integrating Momentum into Sparse Mixture of ExpertsCode1
FineMoGen: Fine-Grained Spatio-Temporal Motion Generation and EditingCode1
Distribution-aware Forgetting Compensation for Exemplar-Free Lifelong Person Re-identificationCode1
XMoE: Sparse Models with Fine-grained and Adaptive Expert SelectionCode1
Distilling the Knowledge in a Neural NetworkCode1
Emergent Modularity in Pre-trained TransformersCode1
Awaker2.5-VL: Stably Scaling MLLMs with Parameter-Efficient Mixture of ExpertsCode1
Emotion-Qwen: Training Hybrid Experts for Unified Emotion and General Vision-Language UnderstandingCode1
MoGERNN: An Inductive Traffic Predictor for Unobserved Locations in Dynamic Sensing NetworksCode1
Dynamic Data Mixing Maximizes Instruction Tuning for Mixture-of-ExpertsCode1
Dynamic Language Group-Based MoE: Enhancing Code-Switching Speech Recognition with Hierarchical RoutingCode1
MoËT: Mixture of Expert Trees and its Application to Verifiable Reinforcement LearningCode1
DirectMultiStep: Direct Route Generation for Multi-Step RetrosynthesisCode1
Enhancing Fast Feed Forward Networks with Load Balancing and a Master Leaf NodeCode1
MoExtend: Tuning New Experts for Modality and Task ExtensionCode1
Efficient Dictionary Learning with Switch Sparse AutoencodersCode1
Show:102550
← PrevPage 6 of 27Next →

No leaderboard results yet.