Mixture-of-Experts

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1051–1100 of 1312 papers

Title	Date	Tasks	Status	Hype
Quantitative Stock Investment by Routing Uncertainty-Aware Trading Experts: A Multi-Task Learning Approach	Jun 7, 2022	Decision MakingMixture-of-Experts	—Unverified	0
Tutel: Adaptive Mixture-of-Experts at Scale	Jun 7, 2022	Mixture-of-ExpertsObject Detection	CodeCode Available	2
Multimodal Contrastive Learning with LIMoE: the Language-Image Mixture of Experts	Jun 6, 2022	Contrastive LearningMixture-of-Experts	—Unverified	0
Interpretable Mixture of Experts	Jun 5, 2022	Decision MakingMixture-of-Experts	—Unverified	0
Patcher: Patch Transformers with Mixture of Experts for Precise Medical Image Segmentation	Jun 3, 2022	DecoderImage Segmentation	CodeCode Available	1
Task-Specific Expert Pruning for Sparse Mixture-of-Experts	Jun 1, 2022	Mixture-of-Experts	—Unverified	0
Text2Human: Text-Driven Controllable Human Image Generation	May 31, 2022	DiversityHuman Parsing	CodeCode Available	2
Gating Dropout: Communication-efficient Regularization for Sparsely Activated Transformers	May 28, 2022	Machine TranslationMixture-of-Experts	—Unverified	0
Automatic Expert Selection for Multi-Scenario and Multi-Task Search	May 28, 2022	Mixture-of-ExpertsMulti-Task Learning	—Unverified	0
Eliciting and Understanding Cross-Task Skills with Task-Level Mixture-of-Experts	May 25, 2022	Mixture-of-ExpertsMulti-Task Learning	CodeCode Available	0
Sparse Mixers: Combining MoE and Mixing to build a more efficient BERT	May 24, 2022	Mixture-of-Experts	—Unverified	0
MoESys: A Distributed and Efficient Mixture-of-Experts Training and Inference System for Internet Services	May 20, 2022	CPUDistributed Computing	—Unverified	0
Pluralistic Image Completion with Probabilistic Mixture-of-Experts	May 18, 2022	DiversityMixture-of-Experts	—Unverified	0
Unified Modeling of Multi-Domain Multi-Device ASR Systems	May 13, 2022	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified	0
Addressing Confounding Feature Issue for Causal Recommendation	May 13, 2022	Mixture-of-ExpertsRecommendation Systems	CodeCode Available	1
ST-ExpertNet: A Deep Expert Framework for Traffic Prediction	May 5, 2022	Mixture-of-ExpertsPrediction	—Unverified	0
Optimizing Mixture of Experts using Dynamic Recompilations	May 4, 2022	Mixture-of-Experts	—Unverified	0
How Can Cross-lingual Knowledge Contribute Better to Fine-Grained Entity Typing?	May 1, 2022	Entity TypingMixture-of-Experts	—Unverified	0
On the Representation Collapse of Sparse Mixture of Experts	Apr 20, 2022	ClusteringLanguage Modeling	—Unverified	0
Residual Mixture of Experts	Apr 20, 2022	Mixture-of-Expertsobject-detection	—Unverified	0
Table-based Fact Verification with Self-adaptive Mixture of Experts	Apr 19, 2022	Fact VerificationLogical Reasoning	CodeCode Available	0
Towards Efficient Single Image Dehazing and Desnowing	Apr 19, 2022	Image DehazingImage Restoration	—Unverified	0
StableMoE: Stable Routing Strategy for Mixture of Experts	Apr 18, 2022	Language ModelingLanguage Modelling	CodeCode Available	1
Sparsely Activated Mixture-of-Experts are Robust Multi-Task Learners	Apr 16, 2022	Mixture-of-ExpertsMulti-Task Learning	—Unverified	0
Mixture of Experts for Biomedical Question Answering	Apr 15, 2022	Mixture-of-ExpertsQuestion Answering	—Unverified	0
MoEBERT: from BERT to Mixture-of-Experts via Importance-Guided Adaptation	Apr 15, 2022	Knowledge DistillationMixture-of-Experts	CodeCode Available	1
Mixture-of-experts VAEs can disregard variation in surjective multimodal data	Apr 11, 2022	Mixture-of-Experts	—Unverified	0
3M: Multi-loss, Multi-path and Multi-level Neural Networks for speech recognition	Apr 7, 2022	Mixture-of-Expertsspeech-recognition	CodeCode Available	1
Learning to Adapt Clinical Sequences with Residual Mixture of Experts	Apr 6, 2022	Mixture-of-Experts	CodeCode Available	0
Combining Spectral and Self-Supervised Features for Low Resource Speech Recognition and Translation	Apr 5, 2022	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified	0
On the Adaptation to Concept Drift for CTR Prediction	Apr 1, 2022	Click-Through Rate PredictionIncremental Learning	—Unverified	0
Efficient Reflectance Capture with a Deep Gated Mixture-of-Experts	Mar 29, 2022	DecoderMixture-of-Experts	—Unverified	0
Efficient and Degradation-Adaptive Network for Real-World Image Super-Resolution	Mar 27, 2022	Image Super-ResolutionMixture-of-Experts	CodeCode Available	1
Build a Robust QA System with Transformer-based Mixture of Experts	Mar 20, 2022	Data AugmentationMixture-of-Experts	CodeCode Available	0
Efficient Language Modeling with Sparse all-MLP	Mar 14, 2022	AllCommon Sense Reasoning	—Unverified	0
SummaReranker: A Multi-Task Mixture-of-Experts Re-ranking Framework for Abstractive Summarization	Mar 13, 2022	Abstractive Text SummarizationDocument Summarization	CodeCode Available	1
SkillNet-NLU: A Sparsely Activated Model for General-Purpose Natural Language Understanding	Mar 7, 2022	Language ModellingMasked Language Modeling	—Unverified	0
Parameter-Efficient Mixture-of-Experts Architecture for Pre-trained Language Models	Mar 2, 2022	Language ModelingLanguage Modelling	CodeCode Available	1
Functional mixture-of-experts for classification	Feb 28, 2022	ClassificationMixture-of-Experts	—Unverified	0
Mixture-of-Experts with Expert Choice Routing	Feb 18, 2022	Mixture-of-Experts	—Unverified	0
ST-MoE: Designing Stable and Transferable Sparse Expert Models	Feb 17, 2022	ARCCommon Sense Reasoning	CodeCode Available	3
A Survey on Dynamic Neural Networks for Natural Language Processing	Feb 15, 2022	Dynamic neural networksMixture-of-Experts	—Unverified	0
Physics-Guided Problem Decomposition for Scaling Deep Learning of High-dimensional Eigen-Solvers: The Case of Schrödinger's Equation	Feb 12, 2022	Mixture-of-ExpertsProblem Decomposition	—Unverified	0
One Student Knows All Experts Know: From Sparse to Dense	Jan 26, 2022	AllKnowledge Distillation	—Unverified	0
Sparsely Activated Mixture-of-Experts are Robust Multi-Task Learners	Jan 16, 2022	Mixture-of-ExpertsMulti-Task Learning	—Unverified	0
MoEBERT: from BERT to Mixture-of-Experts via Importance-Guided Adaptation	Jan 16, 2022	Knowledge DistillationMixture-of-Experts	—Unverified	0
DeepSpeed-MoE: Advancing Mixture-of-Experts Inference and Training to Power Next-Generation AI Scale	Jan 14, 2022	DecoderMixture-of-Experts	CodeCode Available	0
Towards Lightweight Neural Animation : Exploration of Neural Network Pruning in Mixture of Experts-based Animation Models	Jan 11, 2022	Mixture-of-ExpertsNetwork Pruning	—Unverified	0
MDFEND: Multi-domain Fake News Detection	Jan 4, 2022	Fake News DetectionMixture-of-Experts	CodeCode Available	2
EvoMoE: An Evolutional Mixture-of-Experts Training Framework via Dense-To-Sparse Gate	Dec 29, 2021	Language ModelingLanguage Modelling	CodeCode Available	1

Show:10 25 50

← PrevPage 22 of 27Next →

No leaderboard results yet.