Federated Continual Instruction Tuning

2025-03-17Code Available0· sign in to hype

Haiyang Guo, Fanhu Zeng, Fei Zhu, Wenzhuo LIU, Da-Han Wang, Jian Xu, Xu-Yao Zhang, Cheng-Lin Liu

Code Available — Be the first to reproduce this paper.

Code

github.com/ghy0501/fcit
Official★ 15

Abstract

A vast amount of instruction tuning data is crucial for the impressive performance of Large Multimodal Models (LMMs), but the associated computational costs and data collection demands during supervised fine-tuning make it impractical for most researchers. Federated learning (FL) has the potential to leverage all distributed data and training resources to reduce the overhead of joint training. However, most existing methods assume a fixed number of tasks, while in real-world scenarios, clients continuously encounter new knowledge and often struggle to retain old tasks due to memory constraints. In this work, we introduce the Federated Continual Instruction Tuning (FCIT) benchmark to model this real-world challenge. Our benchmark includes two realistic scenarios, encompassing four different settings and twelve carefully curated instruction tuning datasets. To address the challenges posed by FCIT, we propose dynamic knowledge organization to effectively integrate updates from different tasks during training and subspace selective activation to allocate task-specific output during inference. Extensive experimental results demonstrate that our proposed method significantly enhances model performance across varying levels of data heterogeneity and catastrophic forgetting. Our source code and dataset will be made publicly available.

Tasks

Federated Learning

Federated Continual Instruction Tuning

Code

Abstract

Tasks

Reproductions