SOTAVerified

Mixture-of-Experts

Papers

Showing 351400 of 1312 papers

TitleStatusHype
OMoE: Diversifying Mixture of Low-Rank Adaptation by Orthogonal Finetuning0
LLM-Based Routing in Mixture of Experts: A Novel Framework for Trading0
MiniMax-01: Scaling Foundation Models with Lightning AttentionCode7
PSReg: Prior-guided Sparse Mixture of Experts for Point Cloud Registration0
GRAPHMOE: Amplifying Cognitive Depth of Mixture-of-Experts Network via Introducing Self-Rethinking Mechanism0
A Multi-Modal Deep Learning Framework for Pan-Cancer PrognosisCode0
Transforming Vision Transformer: Towards Efficient Multi-Task Asynchronous LearningCode1
TAMER: A Test-Time Adaptive MoE-Driven Framework for EHR Representation LearningCode0
Optimizing Distributed Deployment of Mixture-of-Experts Model Inference in Serverless Computing0
mFabric: An Efficient and Scalable Fabric for Mixture-of-Experts Training0
LiMoE: Mixture of LiDAR Representation Learners from Automotive ScenesCode2
Mixture-of-Experts Graph Transformers for Interpretable Particle Collision DetectionCode0
Fresh-CL: Feature Realignment through Experts on Hypersphere in Continual Learning0
MoVE-KD: Knowledge Distillation for VLMs with Mixture of Visual Encoders0
Learning Heterogeneous Tissues with Mixture of Experts for Gigapixel Whole Slide Images0
UNIALIGN: Scaling Multimodal Alignment within One Unified Model0
Correlative and Discriminative Label Grouping for Multi-Label Visual Prompt Tuning0
Towards Efficient Foundation Model for Zero-shot Amodal Segmentation0
MExD: An Expert-Infused Diffusion Model for Whole-Slide Image Classification0
REM: A Scalable Reinforced Multi-Expert Framework for Multiplex Influence Maximization0
CNC: Cross-modal Normality Constraint for Unsupervised Multi-class Anomaly Detection0
Superposition in Transformers: A Novel Way of Building Mixture of ExpertsCode2
Multimodal Variational Autoencoder: a Barycentric View0
UniRestorer: Universal Image Restoration via Adaptively Estimating Image Degradation at Proper GranularityCode0
DeepSeek-V3 Technical ReportCode16
Graph Mixture of Experts and Memory-augmented Routers for Multivariate Time Series Anomaly Detection0
AskChart: Universal Chart Understanding through Textual EnhancementCode0
BIG-MoE: Bypass Isolated Gating MoE for Generalized Multimodal Face Anti-SpoofingCode0
UME: Upcycling Mixture-of-Experts for Scalable and Efficient Automatic Speech Recognition0
BrainMAP: Learning Multiple Activation Pathways in Brain NetworksCode1
Part-Of-Speech Sensitivity of Routers in Mixture of Experts Models0
Theory of Mixture-of-Experts for Mobile Edge Computing0
Qwen2.5 Technical ReportCode13
ReMoE: Fully Differentiable Mixture-of-Experts with ReLU RoutingCode2
A Survey on Inference Optimization Techniques for Mixture of Experts ModelsCode3
MedCoT: Medical Chain of Thought via Hierarchical ExpertCode1
SEKE: Specialised Experts for Keyword ExtractionCode0
SMOSE: Sparse Mixture of Shallow Experts for Interpretable Reinforcement Learning in Continuous Control TasksCode0
DAOP: Data-Aware Offloading and Predictive Pre-Calculation for Efficient MoE InferenceCode0
Enhancing Healthcare Recommendation Systems with a Multimodal LLMs-based MOE Architecture0
Investigating Mixture of Experts in Dense Retrieval0
Towards Adversarial Robustness of Model-Level Mixture-of-Experts Architectures for Semantic SegmentationCode0
Wonderful Matrices: Combining for a More Efficient and Effective Foundation Model ArchitectureCode1
DeMo: Decoupled Feature-Based Mixture of Experts for Multi-Modal Object Re-IdentificationCode2
Llama 3 Meets MoE: Efficient UpcyclingCode0
DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal UnderstandingCode9
Towards a Multimodal Large Language Model with Pixel-Level Insight for BiomedicineCode2
Mixture of Experts Meets Decoupled Message Passing: Towards General and Adaptive Node ClassificationCode0
Adaptive Prompting for Continual Relation Extraction: A Within-Task Variance Perspective0
MoE-CAP: Benchmarking Cost, Accuracy and Performance of Sparse Mixture-of-Experts Systems0
Show:102550
← PrevPage 8 of 27Next →

No leaderboard results yet.