| Uni-MoE: Scaling Unified Multimodal LLMs with Mixture of Experts | May 18, 2024 | Mixture-of-ExpertsVisual Question Answering | CodeCode Available | 5 |
| Many Hands Make Light Work: Task-Oriented Dialogue System with Module-Based Mixture-of-Experts | May 16, 2024 | Dialogue State TrackingMixture-of-Experts | —Unverified | 0 |
| M^4oE: A Foundation Model for Medical Multimodal Image Segmentation with Mixture of Experts | May 15, 2024 | Image SegmentationMixture-of-Experts | CodeCode Available | 1 |
| A Mixture of Experts Approach to 3D Human Motion Prediction | May 9, 2024 | Human motion predictionMixture-of-Experts | CodeCode Available | 0 |
| A Mixture-of-Experts Approach to Few-Shot Task Transfer in Open-Ended Text Worlds | May 9, 2024 | Few-Shot LearningMixture-of-Experts | —Unverified | 0 |
| EWMoE: An effective model for global weather forecasting with mixture-of-experts | May 9, 2024 | Mixture-of-ExpertsWeather Forecasting | CodeCode Available | 1 |
| CuMo: Scaling Multimodal LLM with Co-Upcycled Mixture-of-Experts | May 9, 2024 | Image CaptioningInstruction Following | CodeCode Available | 2 |
| SUTRA: Scalable Multilingual Language Model Architecture | May 7, 2024 | Computational EfficiencyHallucination | —Unverified | 0 |
| DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model | May 7, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 9 |
| MEET: Mixture of Experts Extra Tree-Based sEMG Hand Gesture Identification | May 6, 2024 | Electromyography (EMG)Gesture Recognition | —Unverified | 0 |