MentorNet: Learning Data-Driven Curriculum for Very Deep Neural Networks on Corrupted Labels

2017-12-14ICML 2018Code Available0· sign in to hype

Lu Jiang, Zhengyuan Zhou, Thomas Leung, Li-Jia Li, Li Fei-Fei

Code Available — Be the first to reproduce this paper.

Code

github.com/google/mentornet
OfficialIn papertf★ 0

Abstract

Recent deep networks are capable of memorizing the entire data even when the labels are completely random. To overcome the overfitting on corrupted labels, we propose a novel technique of learning another neural network, called MentorNet, to supervise the training of the base deep networks, namely, StudentNet. During training, MentorNet provides a curriculum (sample weighting scheme) for StudentNet to focus on the sample the label of which is probably correct. Unlike the existing curriculum that is usually predefined by human experts, MentorNet learns a data-driven curriculum dynamically with StudentNet. Experimental results demonstrate that our approach can significantly improve the generalization performance of deep networks trained on corrupted training data. Notably, to the best of our knowledge, we achieve the best-published result on WebVision, a large benchmark containing 2.2 million images of real-world noisy labels. The code are at https://github.com/google/mentornet

Tasks

Image Classification

Benchmark Results

Dataset	Model	Metric	Claimed	Verified	Status
mini WebVision 1.0	MentorNet (Inception-ResNet-v2)	ImageNet Top-1 Accuracy	63.8	—	Unverified
WebVision-1000	MentorNet (InceptionResNet-V2)	Top-1 Accuracy	70.8	—	Unverified

MentorNet: Learning Data-Driven Curriculum for Very Deep Neural Networks on Corrupted Labels

Code

Abstract

Tasks

Benchmark Results

Reproductions