| HydraSum: Disentangling Stylistic Features in Text Summarization using Multi-Decoder Models | Oct 8, 2021 | Abstractive Text SummarizationDecoder | CodeCode Available | 1 |
| Sparse MoEs meet Efficient Ensembles | Oct 7, 2021 | Few-Shot LearningMixture-of-Experts | CodeCode Available | 1 |
| Improving Video-Text Retrieval by Multi-Stream Corpus Alignment and Dual Softmax Loss | Sep 9, 2021 | Mixture-of-ExpertsRetrieval | CodeCode Available | 1 |
| Few-Shot and Continual Learning with Attentive Independent Mechanisms | Jul 29, 2021 | Continual LearningFew-Shot Learning | CodeCode Available | 1 |
| Go Wider Instead of Deeper | Jul 25, 2021 | Image ClassificationMixture-of-Experts | CodeCode Available | 1 |
| Heterogeneous Multi-task Learning with Expert Diversity | Jun 20, 2021 | DiversityMixture-of-Experts | CodeCode Available | 1 |
| Scaling Vision with Sparse Mixture of Experts | Jun 10, 2021 | Few-Shot Image ClassificationImage Classification | CodeCode Available | 1 |
| RetGen: A Joint framework for Retrieval and Grounded Text Generation Modeling | May 14, 2021 | Dialogue GenerationLanguage Modeling | CodeCode Available | 1 |
| SpeechMoE: Scaling to Large Acoustic Models with Dynamic Routing Mixture of Experts | May 7, 2021 | DiversityMixture-of-Experts | CodeCode Available | 1 |
| MiCE: Mixture of Contrastive Experts for Unsupervised Image Clustering | May 5, 2021 | ClusteringContrastive Learning | CodeCode Available | 1 |
| Cross-Domain Label-Adaptive Stance Detection | Apr 15, 2021 | Domain AdaptationMixture-of-Experts | CodeCode Available | 1 |
| VDSM: Unsupervised Video Disentanglement with State-Space Modeling and Deep Mixtures of Experts | Mar 12, 2021 | DecoderDisentanglement | CodeCode Available | 1 |
| Real-time Relevant Recommendation Suggestion | Mar 8, 2021 | Mixture-of-ExpertsRecommendation Systems | CodeCode Available | 1 |
| Multimodal Variational Autoencoders for Semi-Supervised Learning: In Defense of Product-of-Experts | Jan 18, 2021 | AllMixture-of-Experts | CodeCode Available | 1 |
| PFL-MoE: Personalized Federated Learning Based on Mixture of Experts | Dec 31, 2020 | Decision MakingFederated Learning | CodeCode Available | 1 |
| Multi-view Depth Estimation using Epipolar Spatio-Temporal Networks | Nov 26, 2020 | Depth EstimationMixture-of-Experts | CodeCode Available | 1 |
| Specialized federated learning using a mixture of experts | Oct 5, 2020 | Federated LearningMixture-of-Experts | CodeCode Available | 1 |
| Transformer Based Multi-Source Domain Adaptation | Sep 16, 2020 | Domain AdaptationMixture-of-Experts | CodeCode Available | 1 |
| Making Neural Networks Interpretable with Attribution: Application to Implicit Signals Prediction | Aug 26, 2020 | Interpretable Machine LearningMixture-of-Experts | CodeCode Available | 1 |
| Task-Agnostic Online Reinforcement Learning with an Infinite Mixture of Gaussian Processes | Jun 19, 2020 | Continual LearningDecision Making | CodeCode Available | 1 |
| Towards Crowdsourced Training of Large Neural Networks using Decentralized Mixture-of-Experts | Feb 10, 2020 | Language ModellingMixture-of-Experts | CodeCode Available | 1 |
| Variational Mixture-of-Experts Autoencoders for Multi-Modal Deep Generative Models | Nov 8, 2019 | Mixture-of-Experts | CodeCode Available | 1 |
| MoËT: Mixture of Expert Trees and its Application to Verifiable Reinforcement Learning | Jun 16, 2019 | Game of GoImitation Learning | CodeCode Available | 1 |
| Gated Multimodal Units for Information Fusion | Feb 7, 2017 | General ClassificationGenre classification | CodeCode Available | 1 |
| Distilling the Knowledge in a Neural Network | Mar 9, 2015 | Knowledge DistillationMixture-of-Experts | CodeCode Available | 1 |