| On Multi-objective Policy Optimization as a Tool for Reinforcement Learning: Case Studies in Offline RL and Finetuning | Jun 15, 2021 | Deep Reinforcement LearningMixture-of-Experts | —Unverified | 0 |
| Automatic Document Sketching: Generating Drafts from Analogous Texts | Jun 14, 2021 | Mixture-of-ExpertsReinforcement Learning (RL) | —Unverified | 0 |
| Scaling Vision with Sparse Mixture of Experts | Jun 10, 2021 | Few-Shot Image ClassificationImage Classification | CodeCode Available | 1 |
| DSelect-k: Differentiable Selection in the Mixture of Experts with Applications to Multi-Task Learning | Jun 7, 2021 | Mixture-of-ExpertsMulti-Task Learning | CodeCode Available | 0 |
| AdaTag: Multi-Attribute Value Extraction from Product Profiles with Adaptive Decoding | Jun 4, 2021 | AttributeAttribute Extraction | —Unverified | 0 |
| GEMNET: Effective Gated Gazetteer Representations for Recognizing Complex Entities in Low-context Input | Jun 1, 2021 | Mixture-of-Expertsnamed-entity-recognition | —Unverified | 0 |
| M6-T: Exploring Sparse Expert Models and Beyond | May 31, 2021 | Mixture-of-ExpertsPlaying the Game of 2048 | —Unverified | 0 |
| Data Expansion using Back Translation and Paraphrasing for Hate Speech Detection | May 25, 2021 | Data AugmentationDecoder | —Unverified | 0 |
| Mixture of ELM based experts with trainable gating network | May 25, 2021 | Ensemble LearningMixture-of-Experts | —Unverified | 0 |
| Generalizable Person Re-identification with Relevance-aware Mixture of Experts | May 19, 2021 | Generalizable Person Re-identificationMixture-of-Experts | —Unverified | 0 |
| RetGen: A Joint framework for Retrieval and Grounded Text Generation Modeling | May 14, 2021 | Dialogue GenerationLanguage Modeling | CodeCode Available | 1 |
| MTNet: A Multi-Task Neural Network for On-Field Calibration of Low-Cost Air Monitoring Sensors | May 10, 2021 | feature selectionMixture-of-Experts | —Unverified | 0 |
| KDExplainer: A Task-oriented Attention Model for Explaining Knowledge Distillation | May 10, 2021 | Knowledge DistillationMixture-of-Experts | —Unverified | 0 |
| SpeechMoE: Scaling to Large Acoustic Models with Dynamic Routing Mixture of Experts | May 7, 2021 | DiversityMixture-of-Experts | CodeCode Available | 1 |
| MiCE: Mixture of Contrastive Experts for Unsupervised Image Clustering | May 5, 2021 | ClusteringContrastive Learning | CodeCode Available | 1 |
| Robust Federated Learning by Mixture of Experts | Apr 23, 2021 | Federated LearningMixture-of-Experts | CodeCode Available | 0 |
| Probabilistic Rainfall Estimation from Automotive Lidar | Apr 23, 2021 | Mixture-of-Experts | CodeCode Available | 0 |
| Probabilistic Mixture-of-Experts for Efficient Deep Reinforcement Learning | Apr 19, 2021 | Deep Reinforcement LearningMixture-of-Experts | CodeCode Available | 0 |
| Non-asymptotic model selection in block-diagonal mixture of polynomial experts models | Apr 18, 2021 | Mixture-of-ExpertsModel Selection | —Unverified | 0 |
| Cross-Domain Label-Adaptive Stance Detection | Apr 15, 2021 | Domain AdaptationMixture-of-Experts | CodeCode Available | 1 |
| A non-asymptotic approach for model selection via penalization in high-dimensional mixture of experts models | Apr 6, 2021 | Mixture-of-ExpertsModel Selection | CodeCode Available | 0 |
| Multi-GAT: A Graphical Attention-based Hierarchical Multimodal Representation Learning Approach for Human Activity Recognition | Apr 1, 2021 | Activity RecognitionHuman Activity Recognition | —Unverified | 0 |
| Cross-Topic Rumor Detection using Topic-Mixtures | Apr 1, 2021 | Mixture-of-Experts | —Unverified | 0 |
| Imitation Learning from MPC for Quadrupedal Multi-Gait Control | Mar 26, 2021 | Imitation LearningMixture-of-Experts | —Unverified | 0 |
| VDSM: Unsupervised Video Disentanglement with State-Space Modeling and Deep Mixtures of Experts | Mar 12, 2021 | DecoderDisentanglement | CodeCode Available | 1 |
| Real-time Relevant Recommendation Suggestion | Mar 8, 2021 | Mixture-of-ExpertsRecommendation Systems | CodeCode Available | 1 |
| An Autonomous Negotiating Agent Framework with Reinforcement Learning Based Strategies and Adaptive Strategy Switching Mechanism | Feb 6, 2021 | Mixture-of-Experts | —Unverified | 0 |
| Multimodal Variational Autoencoders for Semi-Supervised Learning: In Defense of Product-of-Experts | Jan 18, 2021 | AllMixture-of-Experts | CodeCode Available | 1 |
| A Novel Cluster Classify Regress Model Predictive Controller Formulation; CCR-MPC | Jan 15, 2021 | BIG-bench Machine LearningClustering | —Unverified | 0 |
| Preferential Mixture-of-Experts: Interpretable Models that Rely on Human Expertise as much as Possible | Jan 13, 2021 | Decision MakingManagement | —Unverified | 0 |
| Switch Transformers: Scaling to Trillion Parameter Models with Simple and Efficient Sparsity | Jan 11, 2021 | Language ModellingMixture-of-Experts | CodeCode Available | 2 |
| Federated learning using mixture of experts | Jan 1, 2021 | Federated LearningMixture-of-Experts | —Unverified | 0 |
| Exploring Routing Strategies for Multilingual Mixture-of-Experts Models | Jan 1, 2021 | DecoderMixture-of-Experts | —Unverified | 0 |
| Gated Ensemble of Spatio-temporal Mixture of Experts for Multi-task Learning in Ride-hailing System | Dec 31, 2020 | Mixture-of-ExpertsMulti-Task Learning | —Unverified | 0 |
| PFL-MoE: Personalized Federated Learning Based on Mixture of Experts | Dec 31, 2020 | Decision MakingFederated Learning | CodeCode Available | 1 |
| Self-Supervised Multimodal Domino: in Search of Biomarkers for Alzheimer's Disease | Dec 25, 2020 | Contrastive LearningDecoder | CodeCode Available | 0 |
| Channel Gain Cartography via Mixture of Experts | Dec 8, 2020 | Mixture-of-Experts | —Unverified | 0 |
| A similarity-based Bayesian mixture-of-experts model | Dec 3, 2020 | Mixture-of-Expertsmodel | —Unverified | 0 |
| A Mixture-of-Experts Model for Learning Multi-Facet Entity Embeddings | Dec 1, 2020 | Entity EmbeddingsMixture-of-Experts | CodeCode Available | 0 |
| Neural Transduction for Multilingual Lexical Translation | Dec 1, 2020 | Mixture-of-ExpertsTranslation | —Unverified | 0 |
| Multi-view Depth Estimation using Epipolar Spatio-Temporal Networks | Nov 26, 2020 | Depth EstimationMixture-of-Experts | CodeCode Available | 1 |
| DADNN: Multi-Scene CTR Prediction via Domain-Aware Deep Neural Network | Nov 24, 2020 | Click-Through Rate PredictionMixture-of-Experts | —Unverified | 0 |
| Modular Action Concept Grounding in Semantic Video Prediction | Nov 23, 2020 | Action RecognitionMixture-of-Experts | —Unverified | 0 |
| Nested Mixture of Experts: Cooperative and Competitive Learning of Hybrid Dynamical System | Nov 20, 2020 | continuous-controlContinuous Control | —Unverified | 0 |
| RTM Ensemble Learning Results at Quality Estimation Task | Nov 1, 2020 | Ensemble LearningMixture-of-Experts | —Unverified | 0 |
| An Empirical Study on Model-agnostic Debiasing Strategies for Robust Natural Language Inference | Oct 8, 2020 | Data AugmentationMixture-of-Experts | CodeCode Available | 0 |
| Specialized federated learning using a mixture of experts | Oct 5, 2020 | Federated LearningMixture-of-Experts | CodeCode Available | 1 |
| Memory Clustering using Persistent Homology for Multimodality- and Discontinuity-Sensitive Learning of Optimal Control Warm-starts | Oct 2, 2020 | ClusteringMixture-of-Experts | —Unverified | 0 |
| Restoring Spatially-Heterogeneous Distortions using Mixture of Experts Network | Sep 30, 2020 | Mixture-of-ExpertsMulti-Task Learning | CodeCode Available | 0 |
| Non-asymptotic oracle inequalities for the Lasso in high-dimensional mixture of experts | Sep 22, 2020 | feature selectionMixture-of-Experts | —Unverified | 0 |