Big Transfer (BiT): General Visual Representation Learning

2019-12-24ECCV 2020Code Available2· sign in to hype

Alexander Kolesnikov, Lucas Beyer, Xiaohua Zhai, Joan Puigcerver, Jessica Yung, Sylvain Gelly, Neil Houlsby

Code Available — Be the first to reproduce this paper.

Code

github.com/google-research/big_transfer
Officialjax★ 1,538
github.com/sayakpaul/FunMatch-Distillation
tf★ 88
github.com/bethgelab/InDomainGeneralizationBenchmark
pytorch★ 34
github.com/batsresearch/taglets
pytorch★ 17
github.com/2024-MindSpore-1/Code2/tree/main/model-1/bit
mindspore★ 0
github.com/SoojungYang/supervised_pretraining_GN_WS
tf★ 0
github.com/MS-Mind/MS-Code-02/tree/main/configs/bit
mindspore★ 0
github.com/hw666666666666/BigTransfer
mindspore★ 0
github.com/sayakpaul/A-Barebones-Image-Retrieval-System
tf★ 0

Abstract

Transfer of pre-trained representations improves sample efficiency and simplifies hyperparameter tuning when training deep neural networks for vision. We revisit the paradigm of pre-training on large supervised datasets and fine-tuning the model on a target task. We scale up pre-training, and propose a simple recipe that we call Big Transfer (BiT). By combining a few carefully selected components, and transferring using a simple heuristic, we achieve strong performance on over 20 datasets. BiT performs well across a surprisingly wide range of data regimes -- from 1 example per class to 1M total examples. BiT achieves 87.5% top-1 accuracy on ILSVRC-2012, 99.4% on CIFAR-10, and 76.3% on the 19 task Visual Task Adaptation Benchmark (VTAB). On small datasets, BiT attains 76.8% on ILSVRC-2012 with 10 examples per class, and 97.0% on CIFAR-10 with 10 examples per class. We conduct detailed analysis of the main components that lead to high transfer performance.

Tasks

Few-Shot Learning Fine-Grained Image Classification Image Classification Out-of-Distribution Generalization Representation Learning

Benchmark Results

Dataset	Model	Metric	Claimed	Verified	Status
Oxford 102 Flowers	BiT-M (ResNet)	Accuracy	99.3	—	Unverified
Oxford 102 Flowers	BiT-L (ResNet)	Accuracy	99.63	—	Unverified
Oxford-IIIT Pets	BiT-L (ResNet)	Accuracy	96.62	—	Unverified
Oxford-IIIT Pets	BiT-M (ResNet)	Accuracy	94.47	—	Unverified

Big Transfer (BiT): General Visual Representation Learning

Code

Abstract

Tasks

Benchmark Results

Reproductions