Exploring the Limits of Weakly Supervised Pretraining

2018-05-02ECCV 2018Code Available0· sign in to hype

Dhruv Mahajan, Ross Girshick, Vignesh Ramanathan, Kaiming He, Manohar Paluri, Yixuan Li, Ashwin Bharambe, Laurens van der Maaten

arXiv PDF

Code Available — Be the first to reproduce this paper.

Reproduce

Code

github.com/eminorhan/resnext-wsl
pytorch★ 15
github.com/facebookresearch/WSL-Images
pytorch★ 0

Abstract

State-of-the-art visual perception models for a wide range of tasks rely on supervised pretraining. ImageNet classification is the de facto pretraining task for these models. Yet, ImageNet is now nearly ten years old and is by modern standards "small". Even so, relatively little is known about the behavior of pretraining with datasets that are multiple orders of magnitude larger. The reasons are obvious: such datasets are difficult to collect and annotate. In this paper, we present a unique study of transfer learning with large convolutional networks trained to predict hashtags on billions of social media images. Our experiments demonstrate that training for large-scale hashtag prediction leads to excellent results. We show improvements on several image classification and object detection tasks, and report the highest ImageNet-1k single-crop, top-1 accuracy to date: 85.4% (97.6% top-5). We also perform extensive experiments that provide novel empirical data on the relationship between large-scale pretraining and transfer learning performance.

Tasks

General Classification image-classification Image Classification object-detection Object Detection Transfer Learning

Benchmark Results

Dataset	Model	Metric	Claimed	Verified	Status
ImageNet	ResNeXt-101 32x48d	Top 1 Accuracy	85.4	—	Unverified
ImageNet	ResNeXt-101 32x32d	Top 1 Accuracy	85.1	—	Unverified
ImageNet	ResNeXt-101 32×16d	Top 1 Accuracy	84.2	—	Unverified
ImageNet	ResNeXt-101 32x8d	Top 1 Accuracy	82.2	—	Unverified

Exploring the Limits of Weakly Supervised Pretraining

Code

Abstract

Tasks

Benchmark Results

Reproductions