SOTAVerified

Mixture-of-Experts

Papers

Showing 851900 of 1312 papers

TitleStatusHype
Variational Distillation of Diffusion Policies into Mixture of Experts0
GW-MoE: Resolving Uncertainty in MoE Router with Global Workspace TheoryCode0
Not Eliminate but Aggregate: Post-Hoc Control over Mixture-of-Experts to Address Shortcut Shifts in Natural Language UnderstandingCode0
Graph Knowledge Distillation to Mixture of ExpertsCode0
Interpretable Cascading Mixture-of-Experts for Urban Traffic Congestion Prediction0
Continual Traffic Forecasting via Mixture of Experts0
Filtered not Mixed: Stochastic Filtering-Based Online Gating for Mixture of Large Language Models0
Style Mixture of Experts for Expressive Text-To-Speech Synthesis0
Node-wise Filtering in Graph Neural Networks: A Mixture of Experts Approach0
A Gaussian Process-based Streaming Algorithm for Prediction of Time Series With Regimes and OutliersCode0
Optimizing 6G Integrated Sensing and Communications (ISAC) via Expert Networks0
Training-efficient density quantum machine learning0
Learning Mixture-of-Experts for General-Purpose Black-Box Discrete OptimizationCode0
MEMoE: Enhancing Model Editing with Mixture of Experts Adaptors0
MoNDE: Mixture of Near-Data Experts for Large-Scale Sparse Models0
LoRA-Switch: Boosting the Efficiency of Dynamic LLM Adapters via System-Algorithm Co-design0
A Provably Effective Method for Pruning Experts in Fine-tuned Sparse Mixture-of-Experts0
Expert-Token Resonance: Redefining MoE Routing through Affinity-Driven Active Selection0
Statistical Advantages of Perturbing Cosine Router in Mixture of Experts0
Sigmoid Gating is More Sample Efficient than Softmax Gating in Mixture of Experts0
Ensemble and Mixture-of-Experts DeepONets For Operator LearningCode0
Learning More Generalized Experts by Merging Experts in Mixture-of-Experts0
Many Hands Make Light Work: Task-Oriented Dialogue System with Module-Based Mixture-of-Experts0
A Mixture of Experts Approach to 3D Human Motion PredictionCode0
A Mixture-of-Experts Approach to Few-Shot Task Transfer in Open-Ended Text Worlds0
SUTRA: Scalable Multilingual Language Model Architecture0
Lory: Fully Differentiable Mixture-of-Experts for Autoregressive Language Model Pre-training0
MEET: Mixture of Experts Extra Tree-Based sEMG Hand Gesture Identification0
WDMoE: Wireless Distributed Large Language Models with Mixture of Experts0
Mixture of partially linear experts0
Hierarchical mixture of discriminative Generalized Dirichlet classifiers0
Mixture of insighTful Experts (MoTE): The Synergy of Thought Chains and Expert Mixtures in Self-Alignment0
Powering In-Database Dynamic Model Slicing for Structured Data Analytics0
MoPEFT: A Mixture-of-PEFTs for the Segment Anything Model0
Lancet: Accelerating Mixture-of-Experts Training via Whole Graph Computation-Communication Overlapping0
Mix of Experts Language Model for Named Entity Recognition0
Towards Incremental Learning in Large Language Models: A Critical Review0
Integration of Mixture of Experts and Multimodal Generative AI in Internet of Vehicles: A Survey0
U2++ MoE: Scaling 4.7x parameters with minimal impact on RTF0
A Novel A.I Enhanced Reservoir Characterization with a Combined Mixture of Experts -- NVIDIA Modulus based Physics Informed Neural Operator Forward Model0
A Large-scale Medical Visual Task Adaptation Benchmark0
MoA: Mixture-of-Attention for Subject-Context Disentanglement in Personalized Image Generation0
Generative AI Agents with Large Language Model for Satellite Networks via a Mixture of Experts Transmission0
Intuition-aware Mixture-of-Rank-1-Experts for Parameter Efficient Finetuning0
Mixture of Experts Soften the Curse of Dimensionality in Operator Learning0
Countering Mainstream Bias via End-to-End Adaptive Local LearningCode0
Identifying Shopping Intent in Product QA for Proactive Recommendations0
Dense Training, Sparse Inference: Rethinking Training of Mixture-of-Experts Language Models0
SEER-MoE: Sparse Expert Efficiency through Regularization for Mixture-of-Experts0
Shortcut-connected Expert Parallelism for Accelerating Mixture-of-Experts0
Show:102550
← PrevPage 18 of 27Next →

No leaderboard results yet.