Mixture-of-Experts

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1101–1150 of 1312 papers

Title	Date	Tasks	Status
Sparse Mixers: Combining MoE and Mixing to build a more efficient BERT	May 24, 2022	Mixture-of-Experts	—Unverified
MoESys: A Distributed and Efficient Mixture-of-Experts Training and Inference System for Internet Services	May 20, 2022	CPUDistributed Computing	—Unverified
Pluralistic Image Completion with Probabilistic Mixture-of-Experts	May 18, 2022	DiversityMixture-of-Experts	—Unverified
Unified Modeling of Multi-Domain Multi-Device ASR Systems	May 13, 2022	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
ST-ExpertNet: A Deep Expert Framework for Traffic Prediction	May 5, 2022	Mixture-of-ExpertsPrediction	—Unverified
Optimizing Mixture of Experts using Dynamic Recompilations	May 4, 2022	Mixture-of-Experts	—Unverified
How Can Cross-lingual Knowledge Contribute Better to Fine-Grained Entity Typing?	May 1, 2022	Entity TypingMixture-of-Experts	—Unverified
On the Representation Collapse of Sparse Mixture of Experts	Apr 20, 2022	ClusteringLanguage Modeling	—Unverified
Residual Mixture of Experts	Apr 20, 2022	Mixture-of-Expertsobject-detection	—Unverified
Towards Efficient Single Image Dehazing and Desnowing	Apr 19, 2022	Image DehazingImage Restoration	—Unverified
Table-based Fact Verification with Self-adaptive Mixture of Experts	Apr 19, 2022	Fact VerificationLogical Reasoning	CodeCode Available
Sparsely Activated Mixture-of-Experts are Robust Multi-Task Learners	Apr 16, 2022	Mixture-of-ExpertsMulti-Task Learning	—Unverified
Mixture of Experts for Biomedical Question Answering	Apr 15, 2022	Mixture-of-ExpertsQuestion Answering	—Unverified
Mixture-of-experts VAEs can disregard variation in surjective multimodal data	Apr 11, 2022	Mixture-of-Experts	—Unverified
Learning to Adapt Clinical Sequences with Residual Mixture of Experts	Apr 6, 2022	Mixture-of-Experts	CodeCode Available
Combining Spectral and Self-Supervised Features for Low Resource Speech Recognition and Translation	Apr 5, 2022	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
On the Adaptation to Concept Drift for CTR Prediction	Apr 1, 2022	Click-Through Rate PredictionIncremental Learning	—Unverified
Efficient Reflectance Capture with a Deep Gated Mixture-of-Experts	Mar 29, 2022	DecoderMixture-of-Experts	—Unverified
Build a Robust QA System with Transformer-based Mixture of Experts	Mar 20, 2022	Data AugmentationMixture-of-Experts	CodeCode Available
Efficient Language Modeling with Sparse all-MLP	Mar 14, 2022	AllCommon Sense Reasoning	—Unverified
SkillNet-NLU: A Sparsely Activated Model for General-Purpose Natural Language Understanding	Mar 7, 2022	Language ModellingMasked Language Modeling	—Unverified
Functional mixture-of-experts for classification	Feb 28, 2022	ClassificationMixture-of-Experts	—Unverified
Mixture-of-Experts with Expert Choice Routing	Feb 18, 2022	Mixture-of-Experts	—Unverified
A Survey on Dynamic Neural Networks for Natural Language Processing	Feb 15, 2022	Dynamic neural networksMixture-of-Experts	—Unverified
Physics-Guided Problem Decomposition for Scaling Deep Learning of High-dimensional Eigen-Solvers: The Case of Schrödinger's Equation	Feb 12, 2022	Mixture-of-ExpertsProblem Decomposition	—Unverified
One Student Knows All Experts Know: From Sparse to Dense	Jan 26, 2022	AllKnowledge Distillation	—Unverified
MoEBERT: from BERT to Mixture-of-Experts via Importance-Guided Adaptation	Jan 16, 2022	Knowledge DistillationMixture-of-Experts	—Unverified
Sparsely Activated Mixture-of-Experts are Robust Multi-Task Learners	Jan 16, 2022	Mixture-of-ExpertsMulti-Task Learning	—Unverified
DeepSpeed-MoE: Advancing Mixture-of-Experts Inference and Training to Power Next-Generation AI Scale	Jan 14, 2022	DecoderMixture-of-Experts	CodeCode Available
Towards Lightweight Neural Animation : Exploration of Neural Network Pruning in Mixture of Experts-based Animation Models	Jan 11, 2022	Mixture-of-ExpertsNetwork Pruning	—Unverified
Combinations of Adaptive Filters	Dec 22, 2021	Mixture-of-Experts	—Unverified
Efficient Large Scale Language Modeling with Mixtures of Experts	Dec 20, 2021	Language ModelingLanguage Modelling	—Unverified
GLaM: Efficient Scaling of Language Models with Mixture-of-Experts	Dec 13, 2021	Common Sense ReasoningIn-Context Learning	—Unverified
Building a great multi-lingual teacher with sparsely-gated mixture of experts for speech recognition	Dec 10, 2021	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Specializing Versatile Skill Libraries using Local Mixture of Experts	Dec 8, 2021	Incremental LearningMixture-of-Experts	CodeCode Available
Anchoring to Exemplars for Training Mixture-of-Expert Cell Embeddings	Dec 6, 2021	Drug DiscoveryGPU	—Unverified
A Mixture of Expert Based Deep Neural Network for Improved ASR	Dec 2, 2021	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
TAL: Two-stream Adaptive Learning for Generalizable Person Re-identification	Nov 29, 2021	Domain GeneralizationGeneralizable Person Re-identification	—Unverified
Expert Aggregation for Financial Forecasting	Nov 25, 2021	BIG-bench Machine LearningMixture-of-Experts	—Unverified
SpeechMoE2: Mixture-of-Experts Model with Improved Routing	Nov 23, 2021	Computational EfficiencyMixture-of-Experts	—Unverified
Table-based Fact Verification with Self-adaptive Mixture of Experts	Nov 16, 2021	Fact VerificationLogical Reasoning	—Unverified
MoEfication: Conditional Computation of Transformer Models for Efficient Inference	Nov 16, 2021	Mixture-of-Experts	—Unverified
StableMoE: Stable Routing Strategy for Mixture of Experts	Nov 16, 2021	Language ModelingLanguage Modelling	—Unverified
M6-T: Exploring Sparse Expert Models and Beyond	Nov 16, 2021	Mixture-of-Experts	—Unverified
SummaReranker: A Multi-Task Mixture-of-Experts Re-ranking Framework for Abstractive Summarization	Nov 16, 2021	Abstractive Text SummarizationMixture-of-Experts	—Unverified
Elucidating Robust Learning with Uncertainty-Aware Corruption Pattern Estimation	Nov 2, 2021	Mixture-of-Experts	CodeCode Available
RTM Super Learner Results at Quality Estimation Task	Nov 1, 2021	Mixture-of-ExpertsTranslation	—Unverified
Polynomial-Spline Neural Networks with Exact Integrals	Oct 26, 2021	Mixture-of-Expertsregression	—Unverified
P-Adapters: Robustly Extracting Factual Information from Language Models with Diverse Prompts	Oct 14, 2021	Mixture-of-ExpertsNatural Language Queries	—Unverified
Simple or Complex? Complexity-Controllable Question Generation with Soft Templates and Deep Mixture of Experts Model	Oct 13, 2021	Mixture-of-ExpertsQuestion Generation	—Unverified

Show:10 25 50

← PrevPage 23 of 27Next →

No leaderboard results yet.