SOTAVerified

Improving Generalization in Federated Learning by Seeking Flat Minima

2022-03-22Code Available1· sign in to hype

Debora Caldarola, Barbara Caputo, Marco Ciccone

Code Available — Be the first to reproduce this paper.

Reproduce

Code

Abstract

Models trained in federated settings often suffer from degraded performances and fail at generalizing, especially when facing heterogeneous scenarios. In this work, we investigate such behavior through the lens of geometry of the loss and Hessian eigenspectrum, linking the model's lack of generalization capacity to the sharpness of the solution. Motivated by prior studies connecting the sharpness of the loss surface and the generalization gap, we show that i) training clients locally with Sharpness-Aware Minimization (SAM) or its adaptive version (ASAM) and ii) averaging stochastic weights (SWA) on the server-side can substantially improve generalization in Federated Learning and help bridging the gap with centralized models. By seeking parameters in neighborhoods having uniform low loss, the model converges towards flatter minima and its generalization significantly improves in both homogeneous and heterogeneous scenarios. Empirical results demonstrate the effectiveness of those optimizers across a variety of benchmark vision datasets (e.g. CIFAR10/100, Landmarks-User-160k, IDDA) and tasks (large scale classification, semantic segmentation, domain generalization).

Tasks

Benchmark Results

DatasetModelMetricClaimedVerifiedStatus
CIFAR-100 (alpha=0, 10 clients per round)FedASAM + SWAACC@1-100Clients42.64Unverified
CIFAR-100 (alpha=0, 10 clients per round)FedAvgACC@1-100Clients36.74Unverified
CIFAR-100 (alpha=0, 10 clients per round)FedSAMACC@1-100Clients36.93Unverified
CIFAR-100 (alpha=0, 10 clients per round)FedSAM + SWAACC@1-100Clients39.51Unverified
CIFAR-100 (alpha=0, 10 clients per round)FedASAMACC@1-100Clients39.76Unverified
CIFAR-100 (alpha=0, 20 clients per round)FedAvgACC@1-100Clients38.59Unverified
CIFAR-100 (alpha=0, 20 clients per round)FedSAMACC@1-100Clients38.56Unverified
CIFAR-100 (alpha=0, 20 clients per round)FedASAM + SWAACC@1-100Clients41.62Unverified
CIFAR-100 (alpha=0, 20 clients per round)FedASAMACC@1-100Clients40.81Unverified
CIFAR-100 (alpha=0, 20 clients per round)FedSAM + SWAACC@1-100Clients39.24Unverified
CIFAR-100 (alpha=0.5, 10 clients per round)FedSAM + SWAACC@1-100Clients46.76Unverified
CIFAR-100 (alpha=0.5, 10 clients per round)FedASAMACC@1-100Clients46.58Unverified
CIFAR-100 (alpha=0.5, 10 clients per round)FedAvgACC@1-100Clients41.27Unverified
CIFAR-100 (alpha=0.5, 10 clients per round)FedSAMACC@1-100Clients44.84Unverified
CIFAR-100 (alpha=0.5, 10 clients per round)FedASAM + SWAACC@1-100Clients48.72Unverified
CIFAR-100 (alpha=0.5, 20 clients per round)FedAvgACC@1-100Clients42.17Unverified
CIFAR-100 (alpha=0.5, 20 clients per round)FedSAM + SWAACC@1-100Clients46.47Unverified
CIFAR-100 (alpha=0.5, 20 clients per round)FedASAMACC@1-100Clients47.78Unverified
CIFAR-100 (alpha=0.5, 20 clients per round)FedASAM + SWAACC@1-100Clients48.27Unverified
CIFAR-100 (alpha=0.5, 20 clients per round)FedSAMACC@1-100Clients46.05Unverified
CIFAR-100 (alpha=0.5, 5 clients per round)FedSAMACC@1-100Clients44.73Unverified
CIFAR-100 (alpha=0.5, 5 clients per round)FedASAMACC@1-100Clients45.61Unverified
CIFAR-100 (alpha=0.5, 5 clients per round)FedASAM + SWAACC@1-100Clients49.17Unverified
CIFAR-100 (alpha=0.5, 5 clients per round)FedSAM + SWAACC@1-100Clients47.96Unverified
CIFAR-100 (alpha=0.5, 5 clients per round)FedAvgACC@1-100Clients40.43Unverified
CIFAR-100 (alpha=0, 5 clients per round)FedASAMACC@1-100Clients36.04Unverified
CIFAR-100 (alpha=0, 5 clients per round)FedAvgACC@1-100Clients30.25Unverified
CIFAR-100 (alpha=0, 5 clients per round)FedSAM + SWAACC@1-100Clients39.3Unverified
CIFAR-100 (alpha=0, 5 clients per round)FedASAM + SWAACC@1-100Clients42.01Unverified
CIFAR-100 (alpha=0, 5 clients per round)FedSAMACC@1-100Clients31.04Unverified
CIFAR-100 (alpha=1000, 10 clients per round)FedSAMACC@1-100Clients53.39Unverified
CIFAR-100 (alpha=1000, 10 clients per round)FedSAM + SWAACC@1-100Clients53.67Unverified
CIFAR-100 (alpha=1000, 10 clients per round)FedASAM + SWAACC@1-100Clients54.79Unverified
CIFAR-100 (alpha=1000, 10 clients per round)FedASAMACC@1-100Clients54.97Unverified
CIFAR-100 (alpha=1000, 10 clients per round)FedAvgACC@1-100Clients50.25Unverified
CIFAR-100 (alpha=1000, 20 clients per round)FedAvgACC@1-100Clients50.66Unverified
CIFAR-100 (alpha=1000, 20 clients per round)FedASAMACC@1-100Clients54.5Unverified
CIFAR-100 (alpha=1000, 20 clients per round)FedSAM + SWAACC@1-100Clients54.36Unverified
CIFAR-100 (alpha=1000, 20 clients per round)FedASAM + SWAACC@1-100Clients54.1Unverified
CIFAR-100 (alpha=1000, 20 clients per round)FedSAMACC@1-100Clients53.97Unverified
CIFAR-100 (alpha=1000, 5 clients per round)FedSAM + SWAACC@1-100Clients53.9Unverified
CIFAR-100 (alpha=1000, 5 clients per round)FedASAM + SWAACC@1-100Clients53.86Unverified
CIFAR-100 (alpha=1000, 5 clients per round)FedAvgACC@1-100Clients49.92Unverified
CIFAR-100 (alpha=1000, 5 clients per round)FedASAMACC@1-100Clients54.81Unverified
CIFAR-100 (alpha=1000, 5 clients per round)FedSAMACC@1-100Clients54.01Unverified
Cityscapes heterogeneousFedASAMmIoU42.27Unverified
Cityscapes heterogeneousFedAvg + SWAmIoU42.48Unverified
Cityscapes heterogeneousSiloBN + ASAMmIoU49.75Unverified
Cityscapes heterogeneousFedSAM + SWAmIoU43.42Unverified
Cityscapes heterogeneousFedASAM + SWAmIoU43.02Unverified

Reproductions