SOTAVerified

Mixture-of-Experts

Papers

Showing 10511100 of 1312 papers

TitleStatusHype
Generalizing Multimodal Variational Methods to Sets0
Mod-Squad: Designing Mixture of Experts As Modular Multi-Task Learners0
Fixing MoE Over-Fitting on Low-Resource Languages in Multilingual Machine Translation0
SMILE: Scaling Mixture-of-Experts with Efficient Bi-level Routing0
Incorporating Polar Field Data for Improved Solar Flare Prediction0
Named Entity and Relation Extraction with Multi-Modal Retrieval0
Automatically Extracting Information in Medical Dialogue: Expert System And Attention for Labelling0
Double Deep Q-Learning in Opponent Modeling0
Who Says Elephants Can't Run: Bringing Large Scale MoE Models into Cloud Scale Production0
A Bird's-eye View of Reranking: from List Level to Page LevelCode0
HMOE: Hypernetwork-based Mixture of Experts for Domain Generalization0
Handling Trade-Offs in Speech Separation with Sparsely-Gated Mixture of Experts0
SpeechMatrix: A Large-Scale Mined Corpus of Multilingual Speech-to-Speech Translations0
Using Deep Mixture-of-Experts to Detect Word Meaning Shift for TempoWiC0
Safe Real-World Autonomous Driving by Learning to Predict and Plan with a Mixture of Experts0
Contextual Mixture of Experts: Integrating Knowledge into Predictive Modeling0
Prediction Sets for High-Dimensional Mixture of Experts Models0
Knowledge-in-Context: Towards Knowledgeable Semi-Parametric Language Models0
Coordination with Humans via Strategy Matching0
On the Adversarial Robustness of Mixture of Experts0
Tiny-Attention Adapter: Contexts Are More Important Than the Number of Parameters0
FEAMOE: Fair, Explainable and Adaptive Mixture of Experts0
Deep Learning Mixture-of-Experts Approach for Cytotoxic Edema Assessment in Infants and Children0
Probabilistic partition of unity networks for high-dimensional regression problems0
Parameter-varying neural ordinary differential equations with partition-of-unity networks0
Table-based Fact Verification with Self-labeled Keypoint Alignment0
Sparsity-Constrained Optimal Transport0
Mixture of experts models for multilevel data: modelling framework and approximation theory0
Tuning of Mixture-of-Experts Mixed-Precision Neural Networks0
Diversified Dynamic Routing for Vision Tasks0
Parameter-Efficient Conformers via Sharing Sparsely-Gated Experts for End-to-End Speech Recognition0
Sparse Video Representation Using Steered Mixture-of-Experts With Global Motion Compensation0
A Review of Sparse Expert Models in Deep Learning0
ADMoE: Anomaly Detection with Mixture-of-Experts from Noisy Labels0
Context-aware Mixture-of-Experts for Unbiased Scene Graph Generation0
A Theoretical View on Sparsely Activated Networks0
Edge-Aware Autoencoder Design for Real-Time Mixture-of-Experts Image Compression0
Adaptive Mixture of Experts Learning for Generalizable Face Anti-Spoofing0
MoEC: Mixture of Expert Clusters0
Learning Large-scale Universal User Representation with Sparse Mixture of Experts0
RoME: Role-aware Mixture-of-Expert Transformer for Text-to-Video RetrievalCode0
Scalable Neural Data Server: A Data Recommender for Transfer Learning0
Adaptive Expert Models for Personalization in Federated LearningCode0
Quantitative Stock Investment by Routing Uncertainty-Aware Trading Experts: A Multi-Task Learning Approach0
Multimodal Contrastive Learning with LIMoE: the Language-Image Mixture of Experts0
Interpretable Mixture of Experts0
Task-Specific Expert Pruning for Sparse Mixture-of-Experts0
Gating Dropout: Communication-efficient Regularization for Sparsely Activated Transformers0
Automatic Expert Selection for Multi-Scenario and Multi-Task Search0
Eliciting and Understanding Cross-Task Skills with Task-Level Mixture-of-ExpertsCode0
Show:102550
← PrevPage 22 of 27Next →

No leaderboard results yet.