SOTAVerified

Mixture-of-Experts

Papers

Showing 376400 of 1312 papers

TitleStatusHype
Graph Mixture of Experts and Memory-augmented Routers for Multivariate Time Series Anomaly Detection0
AskChart: Universal Chart Understanding through Textual EnhancementCode0
BIG-MoE: Bypass Isolated Gating MoE for Generalized Multimodal Face Anti-SpoofingCode0
UME: Upcycling Mixture-of-Experts for Scalable and Efficient Automatic Speech Recognition0
BrainMAP: Learning Multiple Activation Pathways in Brain NetworksCode1
Part-Of-Speech Sensitivity of Routers in Mixture of Experts Models0
Theory of Mixture-of-Experts for Mobile Edge Computing0
Qwen2.5 Technical ReportCode13
ReMoE: Fully Differentiable Mixture-of-Experts with ReLU RoutingCode2
A Survey on Inference Optimization Techniques for Mixture of Experts ModelsCode3
MedCoT: Medical Chain of Thought via Hierarchical ExpertCode1
SEKE: Specialised Experts for Keyword ExtractionCode0
SMOSE: Sparse Mixture of Shallow Experts for Interpretable Reinforcement Learning in Continuous Control TasksCode0
DAOP: Data-Aware Offloading and Predictive Pre-Calculation for Efficient MoE InferenceCode0
Enhancing Healthcare Recommendation Systems with a Multimodal LLMs-based MOE Architecture0
Investigating Mixture of Experts in Dense Retrieval0
Towards Adversarial Robustness of Model-Level Mixture-of-Experts Architectures for Semantic SegmentationCode0
Wonderful Matrices: Combining for a More Efficient and Effective Foundation Model ArchitectureCode1
DeMo: Decoupled Feature-Based Mixture of Experts for Multi-Modal Object Re-IdentificationCode2
Llama 3 Meets MoE: Efficient Upcycling0
DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal UnderstandingCode9
Towards a Multimodal Large Language Model with Pixel-Level Insight for BiomedicineCode2
Mixture of Experts Meets Decoupled Message Passing: Towards General and Adaptive Node ClassificationCode0
Adaptive Prompting for Continual Relation Extraction: A Within-Task Variance Perspective0
MoE-CAP: Benchmarking Cost, Accuracy and Performance of Sparse Mixture-of-Experts Systems0
Show:102550
← PrevPage 16 of 53Next →

No leaderboard results yet.