Towards Robust Multimodal Representation: A Unified Approach with Adaptive Experts and Alignment

2025-03-12Code Available0· sign in to hype

Nazanin Moradinasab, Saurav Sengupta, Jiebei Liu, Sana Syed, Donald E. Brown

Code Available — Be the first to reproduce this paper.

Code

github.com/nazaninmn/mosare
OfficialIn paperpytorch★ 5

Abstract

Healthcare relies on multiple types of data, such as medical images, genetic information, and clinical records, to improve diagnosis and treatment. However, missing data is a common challenge due to privacy restrictions, cost, and technical issues, making many existing multi-modal models unreliable. To address this, we propose a new multi-model model called Mixture of Experts, Symmetric Aligning, and Reconstruction (MoSARe), a deep learning framework that handles incomplete multimodal data while maintaining high accuracy. MoSARe integrates expert selection, cross-modal attention, and contrastive learning to improve feature representation and decision-making. Our results show that MoSARe outperforms existing models in situations when the data is complete. Furthermore, it provides reliable predictions even when some data are missing. This makes it especially useful in real-world healthcare settings, including resource-limited environments. Our code is publicly available at https://github.com/NazaninMn/MoSARe.

Tasks

Contrastive Learning Decision Making Mixture-of-Experts

Towards Robust Multimodal Representation: A Unified Approach with Adaptive Experts and Alignment

Code

Abstract

Tasks

Reproductions