Sample Prior Guided Robust Model Learning to Suppress Noisy Labels

2021-12-02Code Available1· sign in to hype

Wenkai Chen, Chuang Zhu, Yi Chen, Mengting Li, Tiejun Huang

Code Available — Be the first to reproduce this paper.

Code

github.com/bupt-ai-cz/PGDF
Officialpytorch★ 31

Abstract

Imperfect labels are ubiquitous in real-world datasets and seriously harm the model performance. Several recent effective methods for handling noisy labels have two key steps: 1) dividing samples into cleanly labeled and wrongly labeled sets by training loss, 2) using semi-supervised methods to generate pseudo-labels for samples in the wrongly labeled set. However, current methods always hurt the informative hard samples due to the similar loss distribution between the hard samples and the noisy ones. In this paper, we proposed PGDF (Prior Guided Denoising Framework), a novel framework to learn a deep model to suppress noise by generating the samples' prior knowledge, which is integrated into both dividing samples step and semi-supervised step. Our framework can save more informative hard clean samples into the cleanly labeled set. Besides, our framework also promotes the quality of pseudo-labels during the semi-supervised step by suppressing the noise in the current pseudo-labels generating scheme. To further enhance the hard samples, we reweight the samples in the cleanly labeled set during training. We evaluated our method using synthetic datasets based on CIFAR-10 and CIFAR-100, as well as on the real-world datasets WebVision and Clothing1M. The results demonstrate substantial improvements over state-of-the-art methods.

Tasks

Image Classification Learning with noisy labels

Benchmark Results

Dataset	Model	Metric	Claimed	Verified	Status
CIFAR-10 (with noisy labels)	PGDF (ResNet-18)	Accuracy (under 20% Sym. label noise)	96.7	—	Unverified
Clothing1M	PGDF	Accuracy	75.19	—	Unverified
mini WebVision 1.0	PGDF (Inception-ResNet-v2)	Top-1 Accuracy	81.47	—	Unverified

Sample Prior Guided Robust Model Learning to Suppress Noisy Labels

Code

Abstract

Tasks

Benchmark Results

Reproductions