SOTAVerified

Mixture-of-Experts

Papers

Showing 10511075 of 1312 papers

TitleStatusHype
Omni-SMoLA: Boosting Generalist Multimodal Models with Soft Mixture of Low-rank Experts0
OMoE: Diversifying Mixture of Low-Rank Adaptation by Orthogonal Finetuning0
On component interactions in two-stage recommender systems0
SkillNet-NLU: A Sparsely Activated Model for General-Purpose Natural Language Understanding0
OneRec: Unifying Retrieve and Rank with Generative Recommender and Iterative Preference Alignment0
One Student Knows All Experts Know: From Sparse to Dense0
On Expert Estimation in Hierarchical Mixture of Experts: Beyond Softmax Gating Functions0
On Least Square Estimation in Softmax Gating Mixture of Experts0
On Minimax Estimation of Parameters in Softmax-Contaminated Mixture of Experts0
On Multi-objective Policy Optimization as a Tool for Reinforcement Learning: Case Studies in Offline RL and Finetuning0
On Parameter Estimation in Deviated Gaussian Mixture of Experts0
On the Adversarial Robustness of Mixture of Experts0
On the effectiveness of discrete representations in sparse mixture of experts0
On the Expressive Power of Mixture-of-Experts for Structured Complex Tasks0
On the Functional Equivalence of TSK Fuzzy Systems to Neural Networks, Mixture of Experts, CART, and Stacking Ensemble Regression0
On the Representation Collapse of Sparse Mixture of Experts0
On the Risk of Evidence Pollution for Malicious Social Text Detection in the Era of LLMs0
Opportunistic Osteoporosis Diagnosis via Texture-Preserving Self-Supervision, Mixture of Experts and Multi-Task Integration0
Optimal Scaling Laws for Efficiency Gains in a Theoretical Transformer-Augmented Sectional MoE Framework0
Optimizing 6G Integrated Sensing and Communications (ISAC) via Expert Networks0
Optimizing Distributed Deployment of Mixture-of-Experts Model Inference in Serverless Computing0
Optimizing LLMs for Resource-Constrained Environments: A Survey of Model Compression Techniques0
Optimizing Mixture-of-Experts Inference Time Combining Model Deployment and Communication Scheduling0
Optimizing Mixture of Experts using Dynamic Recompilations0
Optimizing Robustness and Accuracy in Mixture of Experts: A Dual-Model Approach0
Show:102550
← PrevPage 43 of 53Next →

No leaderboard results yet.