Billion-scale semi-supervised learning for image classification
I. Zeki Yalniz, Hervé Jégou, Kan Chen, Manohar Paluri, Dhruv Mahajan
Code Available — Be the first to reproduce this paper.
ReproduceCode
Abstract
This paper presents a study of semi-supervised learning with large convolutional networks. We propose a pipeline, based on a teacher/student paradigm, that leverages a large collection of unlabelled images (up to 1 billion). Our main goal is to improve the performance for a given target architecture, like ResNet-50 or ResNext. We provide an extensive analysis of the success factors of our approach, which leads us to formulate some recommendations to produce high-accuracy models for image classification with semi-supervised learning. As a result, our approach brings important gains to standard architectures for image, video and fine-grained classification. For instance, by leveraging one billion unlabelled images, our learned vanilla ResNet-50 achieves 81.2% top-1 accuracy on the ImageNet benchmark.
Tasks
Benchmark Results
| Dataset | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| ImageNet | ResNeXt-101 32x16d (semi-weakly sup.) | Top 1 Accuracy | 84.8 | — | Unverified |
| ImageNet | ResNeXt-101 32x8d (semi-weakly sup.) | Top 1 Accuracy | 84.3 | — | Unverified |
| ImageNet | ResNeXt-101 32x4d (semi-weakly sup.) | Top 1 Accuracy | 83.4 | — | Unverified |
| OmniBenchmark | IG-1B | Average Top-1 Accuracy | 40.4 | — | Unverified |