| MoExtend: Tuning New Experts for Modality and Task Extension | Aug 7, 2024 | Mixture-of-Experts | CodeCode Available | 1 |
| Mixture-of-Noises Enhanced Forgery-Aware Predictor for Multi-Face Manipulation Detection and Localization | Aug 5, 2024 | Face DetectionMixture-of-Experts | —Unverified | 0 |
| HMDN: Hierarchical Multi-Distribution Network for Click-Through Rate Prediction | Aug 2, 2024 | Click-Through Rate PredictionMixture-of-Experts | —Unverified | 0 |
| Multimodal Fusion and Coherence Modeling for Video Topic Segmentation | Aug 1, 2024 | Contrastive LearningMixture-of-Experts | —Unverified | 0 |
| PMoE: Progressive Mixture of Experts with Asymmetric Transformer for Continual Learning | Jul 31, 2024 | Continual LearningGeneral Knowledge | —Unverified | 0 |
| MoMa: Efficient Early-Fusion Pre-training with Mixture of Modality-Aware Experts | Jul 31, 2024 | Causal InferenceLanguage Modelling | —Unverified | 0 |
| Distribution Learning for Molecular Regression | Jul 30, 2024 | Mixture-of-ExpertsMolecular Property Prediction | —Unverified | 0 |
| Time series forecasting with high stakes: A field study of the air cargo industry | Jul 29, 2024 | Decision MakingDemand Forecasting | —Unverified | 0 |
| Mixture of Nested Experts: Adaptive Processing of Visual Tokens | Jul 29, 2024 | Mixture-of-Experts | CodeCode Available | 0 |
| Mixture of Modular Experts: Distilling Knowledge from a Multilingual Teacher into Specialized Modular Language Models | Jul 28, 2024 | Knowledge DistillationMixture-of-Experts | CodeCode Available | 0 |
| MOoSE: Multi-Orientation Sharing Experts for Open-set Scene Text Recognition | Jul 26, 2024 | Mixture-of-ExpertsScene Text Recognition | CodeCode Available | 0 |
| Wolf: Captioning Everything with a World Summarization Framework | Jul 26, 2024 | Autonomous DrivingMixture-of-Experts | —Unverified | 0 |
| Dynamic Language Group-Based MoE: Enhancing Code-Switching Speech Recognition with Hierarchical Routing | Jul 26, 2024 | AttributeLanguage Modelling | CodeCode Available | 1 |
| How Lightweight Can A Vision Transformer Be | Jul 25, 2024 | Mixture-of-ExpertsTransfer Learning | —Unverified | 0 |
| Exploring Domain Robust Lightweight Reward Models based on Router Mechanism | Jul 24, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Wonderful Matrices: More Efficient and Effective Architecture for Language Modeling Tasks | Jul 24, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| M4: Multi-Proxy Multi-Gate Mixture of Experts Network for Multiple Instance Learning in Histopathology Image Analysis | Jul 24, 2024 | Mixture-of-ExpertsMultiple Instance Learning | CodeCode Available | 1 |
| Stretching Each Dollar: Diffusion Training from Scratch on a Micro-Budget | Jul 22, 2024 | Mixture-of-Experts | CodeCode Available | 5 |
| Norface: Improving Facial Expression Analysis by Identity Normalization | Jul 22, 2024 | ClassificationEmotion Recognition | CodeCode Available | 1 |
| EEGMamba: Bidirectional State Space Model with Mixture of Experts for EEG Multi-task Classification | Jul 20, 2024 | EEGElectroencephalogram (EEG) | —Unverified | 0 |
| Mixture of Experts with Mixture of Precisions for Tuning Quality of Service | Jul 19, 2024 | CPUGPU | —Unverified | 0 |
| EVLM: An Efficient Vision-Language Model for Visual Understanding | Jul 19, 2024 | Image CaptioningLanguage Modeling | —Unverified | 0 |
| Mixture of Experts based Multi-task Supervise Learning from Crowds | Jul 18, 2024 | Mixture-of-Experts | —Unverified | 0 |
| Discussion: Effective and Interpretable Outcome Prediction by Training Sparse Mixtures of Linear Experts | Jul 18, 2024 | feature selectionMixture-of-Experts | —Unverified | 0 |
| MoE-DiffIR: Task-customized Diffusion Priors for Universal Compressed Image Restoration | Jul 15, 2024 | Image RestorationMixture-of-Experts | —Unverified | 0 |