Repetitive Reprediction Deep Decipher for Semi-Supervised Learning
Guo-Hua Wang, Jianxin Wu
Code Available — Be the first to reproduce this paper.
ReproduceCode
- github.com/DoctorKey/R2D2.pytorchOfficialpytorch★ 0
Abstract
Most recent semi-supervised deep learning (deep SSL) methods used a similar paradigm: use network predictions to update pseudo-labels and use pseudo-labels to update network parameters iteratively. However, they lack theoretical support and cannot explain why predictions are good candidates for pseudo-labels. In this paper, we propose a principled end-to-end framework named deep decipher (D2) for SSL. Within the D2 framework, we prove that pseudo-labels are related to network predictions by an exponential link function, which gives a theoretical support for using predictions as pseudo-labels. Furthermore, we demonstrate that updating pseudo-labels by network predictions will make them uncertain. To mitigate this problem, we propose a training strategy called repetitive reprediction (R2). Finally, the proposed R2-D2 method is tested on the large-scale ImageNet dataset and outperforms state-of-the-art methods by 5 percentage points.
Tasks
Benchmark Results
| Dataset | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| cifar-100, 10000 Labels | R2-D2 (CNN-13) | Percentage error | 32.87 | — | Unverified |
| CIFAR-10, 4000 Labels | R2-D2 (Shake-Shake) | Percentage error | 5.72 | — | Unverified |
| ImageNet - 10% labeled data | R2-D2 (ResNet-18) | Top 5 Accuracy | 90.48 | — | Unverified |
| SVHN, 1000 labels | R2-D2 (CNN-13) | Accuracy | 96.36 | — | Unverified |