| Combinations of Adaptive Filters | Dec 22, 2021 | Mixture-of-Experts | —Unverified | 0 |
| Efficient Large Scale Language Modeling with Mixtures of Experts | Dec 20, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Mimic Embedding via Adaptive Aggregation: Learning Generalizable Person Re-identification | Dec 16, 2021 | Generalizable Person Re-identificationMixture-of-Experts | CodeCode Available | 1 |
| GLaM: Efficient Scaling of Language Models with Mixture-of-Experts | Dec 13, 2021 | Common Sense ReasoningIn-Context Learning | —Unverified | 0 |
| Building a great multi-lingual teacher with sparsely-gated mixture of experts for speech recognition | Dec 10, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Specializing Versatile Skill Libraries using Local Mixture of Experts | Dec 8, 2021 | Incremental LearningMixture-of-Experts | CodeCode Available | 0 |
| Anchoring to Exemplars for Training Mixture-of-Expert Cell Embeddings | Dec 6, 2021 | Drug DiscoveryGPU | —Unverified | 0 |
| A Mixture of Expert Based Deep Neural Network for Improved ASR | Dec 2, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| TAL: Two-stream Adaptive Learning for Generalizable Person Re-identification | Nov 29, 2021 | Domain GeneralizationGeneralizable Person Re-identification | —Unverified | 0 |
| Expert Aggregation for Financial Forecasting | Nov 25, 2021 | BIG-bench Machine LearningMixture-of-Experts | —Unverified | 0 |
| SpeechMoE2: Mixture-of-Experts Model with Improved Routing | Nov 23, 2021 | Computational EfficiencyMixture-of-Experts | —Unverified | 0 |
| M6-T: Exploring Sparse Expert Models and Beyond | Nov 16, 2021 | Mixture-of-Experts | —Unverified | 0 |
| StableMoE: Stable Routing Strategy for Mixture of Experts | Nov 16, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Table-based Fact Verification with Self-adaptive Mixture of Experts | Nov 16, 2021 | Fact VerificationLogical Reasoning | —Unverified | 0 |
| SummaReranker: A Multi-Task Mixture-of-Experts Re-ranking Framework for Abstractive Summarization | Nov 16, 2021 | Abstractive Text SummarizationMixture-of-Experts | —Unverified | 0 |
| MoEfication: Conditional Computation of Transformer Models for Efficient Inference | Nov 16, 2021 | Mixture-of-Experts | —Unverified | 0 |
| Elucidating Robust Learning with Uncertainty-Aware Corruption Pattern Estimation | Nov 2, 2021 | Mixture-of-Experts | CodeCode Available | 0 |
| RTM Super Learner Results at Quality Estimation Task | Nov 1, 2021 | Mixture-of-ExpertsTranslation | —Unverified | 0 |
| Unsupervised Foreground Extraction via Deep Region Competition | Oct 29, 2021 | Image SegmentationInductive Bias | CodeCode Available | 1 |
| Polynomial-Spline Neural Networks with Exact Integrals | Oct 26, 2021 | Mixture-of-Expertsregression | —Unverified | 0 |
| P-Adapters: Robustly Extracting Factual Information from Language Models with Diverse Prompts | Oct 14, 2021 | Mixture-of-ExpertsNatural Language Queries | —Unverified | 0 |
| Simple or Complex? Complexity-Controllable Question Generation with Soft Templates and Deep Mixture of Experts Model | Oct 13, 2021 | Mixture-of-ExpertsQuestion Generation | —Unverified | 0 |
| HydraSum: Disentangling Stylistic Features in Text Summarization using Multi-Decoder Models | Oct 8, 2021 | Abstractive Text SummarizationDecoder | CodeCode Available | 1 |
| Taming Sparsely Activated Transformer with Stochastic Experts | Oct 8, 2021 | Machine TranslationMixture-of-Experts | CodeCode Available | 1 |
| Sparse MoEs meet Efficient Ensembles | Oct 7, 2021 | Few-Shot LearningMixture-of-Experts | CodeCode Available | 1 |
| Continual Learning Using Task Conditional Neural Networks | Sep 29, 2021 | Continual LearningMixture-of-Experts | —Unverified | 0 |
| Full-Precision Free Binary Graph Neural Networks | Sep 29, 2021 | Graph Neural NetworkMixture-of-Experts | —Unverified | 0 |
| MECATS: Mixture-of-Experts for Probabilistic Forecasts of Aggregated Time Series | Sep 29, 2021 | Mixture-of-ExpertsTime Series | —Unverified | 0 |
| HydraSum - Disentangling Stylistic Features in Text Summarization using Multi-Decoder Models | Sep 29, 2021 | Abstractive Text SummarizationDecoder | —Unverified | 0 |
| Beyond Distillation: Task-level Mixture-of-Experts for Efficient Inference | Sep 24, 2021 | Mixture-of-ExpertsSentence | —Unverified | 0 |
| Unbiased Gradient Estimation with Balanced Assignments for Mixtures of Experts | Sep 24, 2021 | Mixture-of-Experts | —Unverified | 0 |
| Scalable and Efficient MoE Training for Multitask Multilingual Models | Sep 22, 2021 | Machine TranslationMixture-of-Experts | —Unverified | 0 |
| Universal Simultaneous Machine Translation with Mixture-of-Experts Wait-k Policy | Sep 11, 2021 | Machine TranslationMixture-of-Experts | CodeCode Available | 0 |
| Improving Video-Text Retrieval by Multi-Stream Corpus Alignment and Dual Softmax Loss | Sep 9, 2021 | Mixture-of-ExpertsRetrieval | CodeCode Available | 1 |
| Cross-token Modeling with Conditional Computation | Sep 5, 2021 | Computational EfficiencyImage Classification | —Unverified | 0 |
| Personalised Federated Learning: A Combinational Approach | Aug 22, 2021 | Federated LearningKnowledge Distillation | —Unverified | 0 |
| SPMoE: Generate Multiple Pattern-Aware Outputs with Sparse Pattern Mixture of Experts | Aug 17, 2021 | DiversityMixture-of-Experts | —Unverified | 0 |
| AIREX: Neural Network-based Approach for Air Quality Inference in Unmonitored Cities | Aug 16, 2021 | Air Quality InferenceMixture-of-Experts | —Unverified | 0 |
| Strength in Numbers: Averaging and Clustering Effects in Mixture of Experts for Graph-Based Dependency Parsing | Aug 1, 2021 | ClusteringDependency Parsing | —Unverified | 0 |
| A Mixture-of-Experts Model for Antonym-Synonym Discrimination | Aug 1, 2021 | Mixture-of-Experts | CodeCode Available | 0 |
| ExpertRank: A Multi-level Coarse-grained Expert-based Listwise Ranking Loss | Jul 29, 2021 | Information RetrievalMixture-of-Experts | —Unverified | 0 |
| Few-Shot and Continual Learning with Attentive Independent Mechanisms | Jul 29, 2021 | Continual LearningFew-Shot Learning | CodeCode Available | 1 |
| Go Wider Instead of Deeper | Jul 25, 2021 | Image ClassificationMixture-of-Experts | CodeCode Available | 1 |
| Federated Mixture of Experts | Jul 14, 2021 | Federated LearningMixture-of-Experts | —Unverified | 0 |
| Lifelong Mixture of Variational Autoencoders | Jul 9, 2021 | Lifelong learningMixture-of-Experts | CodeCode Available | 0 |
| AdaSpeech 3: Adaptive Text to Speech for Spontaneous Style | Jul 6, 2021 | DecoderMixture-of-Experts | —Unverified | 0 |
| Adaptive 3D descattering with a dynamic synthesis network | Jul 1, 2021 | DenoisingMixture-of-Experts | CodeCode Available | 0 |
| On component interactions in two-stage recommender systems | Jun 28, 2021 | Mixture-of-ExpertsRecommendation Systems | —Unverified | 0 |
| Mixtures of Deep Neural Experts for Automated Speech Scoring | Jun 23, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Heterogeneous Multi-task Learning with Expert Diversity | Jun 20, 2021 | DiversityMixture-of-Experts | CodeCode Available | 1 |