MoPro: Webly Supervised Learning with Momentum Prototypes

2020-09-17ICLR 2021Code Available1· sign in to hype

Junnan Li, Caiming Xiong, Steven C. H. Hoi

Code Available — Be the first to reproduce this paper.

Code

github.com/salesforce/MoPro
OfficialIn paperpytorch★ 88
github.com/yuleiqin/capro
pytorch★ 2

Abstract

We propose a webly-supervised representation learning method that does not suffer from the annotation unscalability of supervised learning, nor the computation unscalability of self-supervised learning. Most existing works on webly-supervised representation learning adopt a vanilla supervised learning method without accounting for the prevalent noise in the training data, whereas most prior methods in learning with label noise are less effective for real-world large-scale noisy data. We propose momentum prototypes (MoPro), a simple contrastive learning method that achieves online label noise correction, out-of-distribution sample removal, and representation learning. MoPro achieves state-of-the-art performance on WebVision, a weakly-labeled noisy dataset. MoPro also shows superior performance when the pretrained model is transferred to down-stream image classification and detection tasks. It outperforms the ImageNet supervised pretrained model by +10.5 on 1-shot classification on VOC, and outperforms the best self-supervised pretrained model by +17.3 when finetuned on 1\% of ImageNet labeled samples. Furthermore, MoPro is more robust to distribution shifts. Code and pretrained models are available at https://github.com/salesforce/MoPro.

Tasks

Contrastive Learning image-classification Image Classification Representation Learning Self-Supervised Learning

Benchmark Results

Dataset	Model	Metric	Claimed	Verified	Status
OmniBenchmark	MoPro-V2	Average Top-1 Accuracy	36.1	—	Unverified
WebVision-1000	MoPro (ResNet-50)	Top-1 Accuracy	73.9	—	Unverified

MoPro: Webly Supervised Learning with Momentum Prototypes

Code

Abstract

Tasks

Benchmark Results

Reproductions