| LOLA -- An Open-Source Massively Multilingual Large Language Model | Sep 17, 2024 | DiversityLanguage Modeling | CodeCode Available | 1 |
| LPT++: Efficient Training on Mixture of Long-tailed Experts | Sep 17, 2024 | Mixture-of-Expertsparameter-efficient fine-tuning | —Unverified | 0 |
| Adaptive Segmentation-Based Initialization for Steered Mixture of Experts Image Regression | Sep 16, 2024 | DenoisingMixture-of-Experts | —Unverified | 0 |
| Integrating AI's Carbon Footprint into Risk Management Frameworks: Strategies and Tools for Sustainable Compliance in Banking Sector | Sep 15, 2024 | Cloud ComputingManagement | —Unverified | 0 |
| MiniDrive: More Efficient Vision-Language Models with Multi-Level 2D Features as Text Tokens for Autonomous Driving | Sep 11, 2024 | Autonomous DrivingFeature Engineering | CodeCode Available | 2 |
| DA-MoE: Towards Dynamic Expert Allocation for Mixture-of-Experts Models | Sep 10, 2024 | Mixture-of-Experts | —Unverified | 0 |
| STUN: Structured-Then-Unstructured Pruning for Scalable MoE Pruning | Sep 10, 2024 | GSM8KMixture-of-Experts | —Unverified | 0 |
| VE: Modeling Multivariate Time Series Correlation with Variate Embedding | Sep 10, 2024 | Mixture-of-ExpertsMultivariate Time Series Forecasting | CodeCode Available | 0 |
| M3-Jepa: Multimodal Alignment via Multi-directional MoE based on the JEPA framework | Sep 9, 2024 | Computational EfficiencyCross-Modal Retrieval | CodeCode Available | 1 |
| Adapted-MoE: Mixture of Experts with Test-Time Adaption for Anomaly Detection | Sep 9, 2024 | Anomaly DetectionMixture-of-Experts | —Unverified | 0 |