| On Multi-objective Policy Optimization as a Tool for Reinforcement Learning: Case Studies in Offline RL and Finetuning | Jun 15, 2021 | Deep Reinforcement LearningMixture-of-Experts | —Unverified | 0 |
| Automatic Document Sketching: Generating Drafts from Analogous Texts | Jun 14, 2021 | Mixture-of-ExpertsReinforcement Learning (RL) | —Unverified | 0 |
| Scaling Vision with Sparse Mixture of Experts | Jun 10, 2021 | Few-Shot Image ClassificationImage Classification | CodeCode Available | 1 |
| DSelect-k: Differentiable Selection in the Mixture of Experts with Applications to Multi-Task Learning | Jun 7, 2021 | Mixture-of-ExpertsMulti-Task Learning | CodeCode Available | 0 |
| AdaTag: Multi-Attribute Value Extraction from Product Profiles with Adaptive Decoding | Jun 4, 2021 | AttributeAttribute Extraction | —Unverified | 0 |
| GEMNET: Effective Gated Gazetteer Representations for Recognizing Complex Entities in Low-context Input | Jun 1, 2021 | Mixture-of-Expertsnamed-entity-recognition | —Unverified | 0 |
| M6-T: Exploring Sparse Expert Models and Beyond | May 31, 2021 | Mixture-of-ExpertsPlaying the Game of 2048 | —Unverified | 0 |
| Data Expansion using Back Translation and Paraphrasing for Hate Speech Detection | May 25, 2021 | Data AugmentationDecoder | —Unverified | 0 |
| Mixture of ELM based experts with trainable gating network | May 25, 2021 | Ensemble LearningMixture-of-Experts | —Unverified | 0 |
| Generalizable Person Re-identification with Relevance-aware Mixture of Experts | May 19, 2021 | Generalizable Person Re-identificationMixture-of-Experts | —Unverified | 0 |
| RetGen: A Joint framework for Retrieval and Grounded Text Generation Modeling | May 14, 2021 | Dialogue GenerationLanguage Modeling | CodeCode Available | 1 |
| MTNet: A Multi-Task Neural Network for On-Field Calibration of Low-Cost Air Monitoring Sensors | May 10, 2021 | feature selectionMixture-of-Experts | —Unverified | 0 |
| KDExplainer: A Task-oriented Attention Model for Explaining Knowledge Distillation | May 10, 2021 | Knowledge DistillationMixture-of-Experts | —Unverified | 0 |
| SpeechMoE: Scaling to Large Acoustic Models with Dynamic Routing Mixture of Experts | May 7, 2021 | DiversityMixture-of-Experts | CodeCode Available | 1 |
| MiCE: Mixture of Contrastive Experts for Unsupervised Image Clustering | May 5, 2021 | ClusteringContrastive Learning | CodeCode Available | 1 |
| Robust Federated Learning by Mixture of Experts | Apr 23, 2021 | Federated LearningMixture-of-Experts | CodeCode Available | 0 |
| Probabilistic Rainfall Estimation from Automotive Lidar | Apr 23, 2021 | Mixture-of-Experts | CodeCode Available | 0 |
| Probabilistic Mixture-of-Experts for Efficient Deep Reinforcement Learning | Apr 19, 2021 | Deep Reinforcement LearningMixture-of-Experts | CodeCode Available | 0 |
| Non-asymptotic model selection in block-diagonal mixture of polynomial experts models | Apr 18, 2021 | Mixture-of-ExpertsModel Selection | —Unverified | 0 |
| Cross-Domain Label-Adaptive Stance Detection | Apr 15, 2021 | Domain AdaptationMixture-of-Experts | CodeCode Available | 1 |
| A non-asymptotic approach for model selection via penalization in high-dimensional mixture of experts models | Apr 6, 2021 | Mixture-of-ExpertsModel Selection | CodeCode Available | 0 |
| Multi-GAT: A Graphical Attention-based Hierarchical Multimodal Representation Learning Approach for Human Activity Recognition | Apr 1, 2021 | Activity RecognitionHuman Activity Recognition | —Unverified | 0 |
| Cross-Topic Rumor Detection using Topic-Mixtures | Apr 1, 2021 | Mixture-of-Experts | —Unverified | 0 |
| Imitation Learning from MPC for Quadrupedal Multi-Gait Control | Mar 26, 2021 | Imitation LearningMixture-of-Experts | —Unverified | 0 |
| VDSM: Unsupervised Video Disentanglement with State-Space Modeling and Deep Mixtures of Experts | Mar 12, 2021 | DecoderDisentanglement | CodeCode Available | 1 |