| Transformer Based Multi-Source Domain Adaptation | Sep 16, 2020 | Domain AdaptationMixture-of-Experts | CodeCode Available | 1 |
| Double-Wing Mixture of Experts for Streaming Recommendations | Sep 14, 2020 | Ensemble LearningMixture-of-Experts | —Unverified | 0 |
| Anomaly Detection by Recombining Gated Unsupervised Experts | Aug 31, 2020 | Anomaly DetectionMixture-of-Experts | CodeCode Available | 0 |
| Making Neural Networks Interpretable with Attribution: Application to Implicit Signals Prediction | Aug 26, 2020 | Interpretable Machine LearningMixture-of-Experts | CodeCode Available | 1 |
| Aphasic Speech Recognition using a Mixture of Speech Intelligibility Experts | Aug 25, 2020 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Biased Mixtures Of Experts: Enabling Computer Vision Inference Under Data Transfer Limitations | Aug 21, 2020 | Action ClassificationImage Super-Resolution | —Unverified | 0 |
| MIXCAPS: A Capsule Network-based Mixture of Experts for Lung Nodule Malignancy Prediction | Aug 13, 2020 | Mixture-of-ExpertsSpecificity | —Unverified | 0 |
| Team Deep Mixture of Experts for Distributed Power Control | Jul 28, 2020 | Mixture-of-Expertsspeech-recognition | —Unverified | 0 |
| Adversarial Mixture Of Experts with Category Hierarchy Soft Constraint | Jul 24, 2020 | ClusteringFeature Importance | CodeCode Available | 0 |
| Exploring Model Consensus to Generate Translation Paraphrases | Jul 1, 2020 | DiversityMachine Translation | CodeCode Available | 0 |
| A Mixture of h - 1 Heads is Better than h Heads | Jul 1, 2020 | Language ModelingLanguage Modelling | —Unverified | 0 |
| GShard: Scaling Giant Models with Conditional Computation and Automatic Sharding | Jun 30, 2020 | Machine TranslationMixture-of-Experts | CodeCode Available | 0 |
| Task-Agnostic Online Reinforcement Learning with an Infinite Mixture of Gaussian Processes | Jun 19, 2020 | Continual LearningDecision Making | CodeCode Available | 1 |
| Model Agnostic Combination for Ensemble Learning | Jun 16, 2020 | Ensemble LearningMixture-of-Experts | —Unverified | 0 |
| An efficient application of Bayesian optimization to an industrial MDO framework for aircraft design | Jun 12, 2020 | Bayesian Optimizationglobal-optimization | —Unverified | 0 |
| Fast Deep Mixtures of Gaussian Process Experts | Jun 11, 2020 | Gaussian ProcessesMixture-of-Experts | —Unverified | 0 |
| Catching Attention with Automatic Pull Quote Selection | May 27, 2020 | ArticlesMixture-of-Experts | CodeCode Available | 0 |
| A Tree Architecture of LSTM Networks for Sequential Regression with Missing Data | May 22, 2020 | Mixture-of-Expertsregression | —Unverified | 0 |
| A Mixture of h-1 Heads is Better than h Heads | May 13, 2020 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Machine learning based digital twin for dynamical systems with multiple time-scales | May 12, 2020 | BIG-bench Machine LearningMixture-of-Experts | —Unverified | 0 |
| Contextual Policy Transfer in Reinforcement Learning Domains via Deep Mixtures-of-Experts | Feb 29, 2020 | Mixture-of-ExpertsOpenAI Gym | —Unverified | 0 |
| Towards Crowdsourced Training of Large Neural Networks using Decentralized Mixture-of-Experts | Feb 10, 2020 | Language ModellingMixture-of-Experts | CodeCode Available | 1 |
| Learning CHARME models with neural networks | Feb 8, 2020 | Learning TheoryMixture-of-Experts | CodeCode Available | 0 |
| Off-policy Maximum Entropy Reinforcement Learning : Soft Actor-Critic with Advantage Weighted Mixture Policy(SAC-AWMP) | Feb 7, 2020 | continuous-controlContinuous Control | —Unverified | 0 |
| Neural Data Server: A Large-Scale Search Engine for Transfer Learning Data | Jan 9, 2020 | image-classificationImage Classification | —Unverified | 0 |