SOTAVerified

Mixture-of-Experts

Papers

Showing 951960 of 1312 papers

TitleStatusHype
Towards Convergence Rates for Parameter Estimation in Gaussian-gated Mixture of Experts0
Locking and Quacking: Stacking Bayesian model predictions by log-pooling and superposition0
Alternating Gradient Descent and Mixture-of-Experts for Integrated Multimodal Perception0
Steered Mixture-of-Experts Autoencoder Design for Real-Time Image Modelling and Denoising0
Demystifying Softmax Gating Function in Gaussian Mixture of Experts0
Towards Being Parameter-Efficient: A Stratified Sparsely Activated Transformer with Dynamic CapacityCode0
Unicorn: A Unified Multi-tasking Model for Supporting Matching Tasks in Data IntegrationCode1
Pipeline MoE: A Flexible MoE Implementation with Pipeline Parallelism0
Revisiting Single-gated Mixtures of Experts0
FlexMoE: Scaling Large-scale Sparse Pre-trained Model Training via Dynamic Device Placement0
Show:102550
← PrevPage 96 of 132Next →

No leaderboard results yet.