SOTAVerified

Long-Tail Learning with Foundation Model: Heavy Fine-Tuning Hurts

2023-09-18Code Available1· sign in to hype

Jiang-Xin Shi, Tong Wei, Zhi Zhou, Jie-Jing Shao, Xin-Yan Han, Yu-Feng Li

Code Available — Be the first to reproduce this paper.

Reproduce

Code

Abstract

The fine-tuning paradigm in addressing long-tail learning tasks has sparked significant interest since the emergence of foundation models. Nonetheless, how fine-tuning impacts performance in long-tail learning was not explicitly quantified. In this paper, we disclose that heavy fine-tuning may even lead to non-negligible performance deterioration on tail classes, and lightweight fine-tuning is more effective. The reason is attributed to inconsistent class conditions caused by heavy fine-tuning. With the observation above, we develop a low-complexity and accurate long-tail learning algorithms LIFT with the goal of facilitating fast prediction and compact models by adaptive lightweight fine-tuning. Experiments clearly verify that both the training time and the learned parameters are significantly reduced with more accurate predictive performance compared with state-of-the-art approaches. The implementation code is available at https://github.com/shijxcs/LIFT.

Tasks

Benchmark Results

DatasetModelMetricClaimedVerifiedStatus
CIFAR-100-LT (ρ=10)LIFT (ViT-B/16, ImageNet-21K pre-training)Error Rate8.7Unverified
CIFAR-100-LT (ρ=10)LIFT (ViT-B/16, CLIP)Error Rate15.1Unverified
CIFAR-100-LT (ρ=100)LIFT (ViT-B/16, CLIP)Error Rate18.3Unverified
CIFAR-100-LT (ρ=100)LIFT (ViT-B/16, ImageNet-21K pre-training)Error Rate10.9Unverified
CIFAR-100-LT (ρ=50)LIFT (ViT-B/16, CLIP)Error Rate16.9Unverified
CIFAR-100-LT (ρ=50)LIFT (ViT-B/16, ImageNet-21K pre-training)Error Rate9.8Unverified
ImageNet-LTLIFT (ViT-L/14)Top-1 Accuracy82.9Unverified
ImageNet-LTLIFT (ViT-B/16)Top-1 Accuracy78.3Unverified
iNaturalist 2018LIFT (ViT-L/14)Top-1 Accuracy85.2Unverified
iNaturalist 2018LIFT (ViT-L/14@336px)Top-1 Accuracy87.4Unverified
iNaturalist 2018LIFT (ViT-B/16)Top-1 Accuracy80.4Unverified
Places-LTLIFT (ViT-L/14)Top-1 Accuracy53.7Unverified
Places-LTLIFT (ViT-B/16)Top-1 Accuracy52.2Unverified

Reproductions