SOTAVerified

Mixture-of-Experts

Papers

Showing 901950 of 1312 papers

TitleStatusHype
SMILE: Scaling Mixture-of-Experts with Efficient Bi-level Routing0
Sparse Diffusion Policy: A Sparse, Reusable, and Flexible Policy for Robot Learning0
Sparsely Activated Mixture-of-Experts are Robust Multi-Task Learners0
Sparsely Activated Mixture-of-Experts are Robust Multi-Task Learners0
Sparse Mixers: Combining MoE and Mixing to build a more efficient BERT0
Sparse Mixture of Experts as Unified Competitive Learning0
Sparse Mixture-of-Experts for Non-Uniform Noise Reduction in MRI Images0
Cross-token Modeling with Conditional Computation0
Sparse Upcycling: Inference Inefficient Finetuning0
Sparse Video Representation Using Steered Mixture-of-Experts With Global Motion Compensation0
Sparsity-Constrained Optimal Transport0
Speculative MoE: Communication Efficient Parallel MoE Inference with Speculative Token and Expert Pre-scheduling0
SpeechMatrix: A Large-Scale Mined Corpus of Multilingual Speech-to-Speech Translations0
SpeechMoE2: Mixture-of-Experts Model with Improved Routing0
Speech Quality Assessment Model Based on Mixture of Experts: System-Level Performance Enhancement and Utterance-Level Challenge Analysis0
SPMoE: Generate Multiple Pattern-Aware Outputs with Sparse Pattern Mixture of Experts0
SQL-GEN: Bridging the Dialect Gap for Text-to-SQL Via Synthetic Data And Model Merging0
StableMoE: Stable Routing Strategy for Mixture of Experts0
STAR-Rec: Making Peace with Length Variance and Pattern Diversity in Sequential Recommendation0
Static Batching of Irregular Workloads on GPUs: Framework and Application to Efficient MoE Model Inference0
Statistical Advantages of Perturbing Cosine Router in Mixture of Experts0
Statistical Perspective of Top-K Sparse Softmax Gating Mixture of Experts0
Stealing User Prompts from Mixture of Experts0
Steered Mixture-of-Experts Autoencoder Design for Real-Time Image Modelling and Denoising0
Steered Mixture of Experts Regression for Image Denoising with Multi-Model-Inference0
Steps are all you need: Rethinking STEM Education with Prompt Engineering0
Stereo-Talker: Audio-driven 3D Human Synthesis with Prior-Guided Mixture-of-Experts0
ST-ExpertNet: A Deep Expert Framework for Traffic Prediction0
SToLa: Self-Adaptive Touch-Language Framework with Tactile Commonsense Reasoning in Open-Ended Scenarios0
StPR: Spatiotemporal Preservation and Routing for Exemplar-Free Video Class-Incremental Learning0
Strength in Numbers: Averaging and Clustering Effects in Mixture of Experts for Graph-Based Dependency Parsing0
Structure-Enhanced Protein Instruction Tuning: Towards General-Purpose Protein Understanding with LLMs0
STUN: Structured-Then-Unstructured Pruning for Scalable MoE Pruning0
Style Mixture of Experts for Expressive Text-To-Speech Synthesis0
Stylistic Variation in Social Media Part-of-Speech Tagging0
SummaReranker: A Multi-Task Mixture-of-Experts Re-ranking Framework for Abstractive Summarization0
SUTRA: Scalable Multilingual Language Model Architecture0
Symbolic Mixture-of-Experts: Adaptive Skill-based Routing for Heterogeneous Reasoning0
Tabby: Tabular Data Synthesis with Language Models0
Table-based Fact Verification with Self-adaptive Mixture of Experts0
Table-based Fact Verification with Self-labeled Keypoint Alignment0
TAL: Two-stream Adaptive Learning for Generalizable Person Re-identification0
Task-Based MoE for Multitask Multilingual Machine Translation0
Task-customized Masked AutoEncoder via Mixture of Cluster-conditional Experts0
Task-Specific Expert Pruning for Sparse Mixture-of-Experts0
Team Deep Mixture of Experts for Distributed Power Control0
Terminating Differentiable Tree Experts0
The Empirical Impact of Reducing Symmetries on the Performance of Deep Ensembles and MoE0
The Labyrinth of Links: Navigating the Associative Maze of Multi-modal LLMs0
Theory of Mixture-of-Experts for Mobile Edge Computing0
Show:102550
← PrevPage 19 of 27Next →

No leaderboard results yet.