SOTAVerified

Mixture-of-Experts

Papers

Showing 551575 of 1312 papers

TitleStatusHype
PROPER: A Progressive Learning Framework for Personalized Large Language Models with Group-Level Adaptation0
Unify and Anchor: A Context-Aware Transformer for Cross-Domain Time Series Forecasting0
DeRS: Towards Extremely Efficient Upcycled Mixture-of-Experts Models0
Explainable Classifier for Malignant Lymphoma Subtyping via Cell Graph and Image Fusion0
CL-MoE: Enhancing Multimodal Large Language Model with Dual Momentum Mixture-of-Experts for Continual Visual Question Answering0
CoSMoEs: Compact Sparse Mixture of Experts0
UniCodec: Unified Audio Codec with Single Domain-Adaptive Codebook0
Mixture of Experts for Recognizing Depression from Interview and Reading Tasks0
Mixture of Experts-augmented Deep Unfolding for Activity Detection in IRS-aided Systems0
OneRec: Unifying Retrieve and Rank with Generative Recommender and Iterative Preference Alignment0
Drop-Upcycling: Training Sparse Mixture of Experts with Partial Re-initialization0
Evaluating Expert Contributions in a MoE LLM for Quiz-Based Tasks0
ENACT-Heart -- ENsemble-based Assessment Using CNN and Transformer on Heart Sounds0
The Empirical Impact of Reducing Symmetries on the Performance of Deep Ensembles and MoE0
BigMac: A Communication-Efficient Mixture-of-Experts Model Structure for Fast Training and Inference0
An Autonomous Network Orchestration Framework Integrating Large Language Models with Continual Reinforcement Learning0
Tight Clusters Make Specialized ExpertsCode0
Binary-Integer-Programming Based Algorithm for Expert Load Balancing in Mixture-of-Experts ModelsCode0
Ray-Tracing for Conditionally Activated Neural Networks0
Unraveling the Localized Latents: Learning Stratified Manifold Structures in LLM Embedding Space with Sparse Mixture-of-Experts0
Every Expert Matters: Towards Effective Knowledge Distillation for Mixture-of-Experts Language Models0
DSMoE: Matrix-Partitioned Experts with Dynamic Routing for Computation-Efficient Dense LLMs0
How to Upscale Neural Networks with Scaling Law? A Survey and Practical Guidelines0
Connector-S: A Survey of Connectors in Multi-modal Large Language Models0
Fate: Fast Edge Inference of Mixture-of-Experts Models via Cross-Layer GateCode0
Show:102550
← PrevPage 23 of 53Next →

No leaderboard results yet.