SOTAVerified

Mixture-of-Experts

Papers

Showing 11761200 of 1312 papers

TitleStatusHype
Octavius: Mitigating Task Interference in MLLMs via LoRA-MoECode0
Distribution-aware Fairness Learning in Medical Image Segmentation From A Control-Theoretic PerspectiveCode0
Mixture of Modular Experts: Distilling Knowledge from a Multilingual Teacher into Specialized Modular Language ModelsCode0
Discontinuity-Sensitive Optimal Control Learning by Mixture of ExpertsCode0
H^3Fusion: Helpful, Harmless, Honest Fusion of Aligned LLMsCode0
A Survey on Prompt TuningCode0
On-Device Collaborative Language Modeling via a Mixture of Generalists and SpecialistsCode0
GW-MoE: Resolving Uncertainty in MoE Router with Global Workspace TheoryCode0
AskChart: Universal Chart Understanding through Textual EnhancementCode0
GuiLoMo: Allocating Expert Number and Rank for LoRA-MoE via Bilevel Optimization with GuidedSelection VectorsCode0
Guiding the Experts: Semantic Priors for Efficient and Focused MoE RoutingCode0
CartesianMoE: Boosting Knowledge Sharing among Experts via Cartesian Product Routing in Mixture-of-ExpertsCode0
Online Action Recognition for Human Risk Prediction with Anticipated Haptic Alert via WearablesCode0
Table-based Fact Verification with Self-adaptive Mixture of ExpertsCode0
VE: Modeling Multivariate Time Series Correlation with Variate EmbeddingCode0
GShard: Scaling Giant Models with Conditional Computation and Automatic ShardingCode0
Deep Mixture of Experts via Shallow EmbeddingCode0
Build a Robust QA System with Transformer-based Mixture of ExpertsCode0
TAMER: A Test-Time Adaptive MoE-Driven Framework for EHR Representation LearningCode0
DESIRE-ME: Domain-Enhanced Supervised Information REtrieval using Mixture-of-ExpertsCode0
DeepSpeed-MoE: Advancing Mixture-of-Experts Inference and Training to Power Next-Generation AI ScaleCode0
SEKE: Specialised Experts for Keyword ExtractionCode0
Mixture of Link Predictors on GraphsCode0
Mixture-of-Experts Variational Autoencoder for Clustering and Generating from Similarity-Based Representations on Single Cell DataCode0
Opponent Modeling in Deep Reinforcement LearningCode0
Show:102550
← PrevPage 48 of 53Next →

No leaderboard results yet.