SimMatch: Semi-supervised Learning with Similarity Matching

2022-03-14CVPR 2022Code Available0· sign in to hype

Mingkai Zheng, Shan You, Lang Huang, Fei Wang, Chen Qian, Chang Xu

Code Available — Be the first to reproduce this paper.

Code

github.com/kylezheng1997/simmatch
OfficialIn paperpytorch★ 0

Abstract

Learning with few labeled data has been a longstanding problem in the computer vision and machine learning research community. In this paper, we introduced a new semi-supervised learning framework, SimMatch, which simultaneously considers semantic similarity and instance similarity. In SimMatch, the consistency regularization will be applied on both semantic-level and instance-level. The different augmented views of the same instance are encouraged to have the same class prediction and similar similarity relationship respected to other instances. Next, we instantiated a labeled memory buffer to fully leverage the ground truth labels on instance-level and bridge the gaps between the semantic and instance similarities. Finally, we proposed the unfolding and aggregation operation which allows these two similarities be isomorphically transformed with each other. In this way, the semantic and instance pseudo-labels can be mutually propagated to generate more high-quality and reliable matching targets. Extensive experimental results demonstrate that SimMatch improves the performance of semi-supervised learning tasks across different benchmark datasets and different settings. Notably, with 400 epochs of training, SimMatch achieves 67.2\%, and 74.4\% Top-1 Accuracy with 1\% and 10\% labeled examples on ImageNet, which significantly outperforms the baseline methods and is better than previous semi-supervised learning frameworks. Code and pre-trained models are available at https://github.com/KyleZheng1997/simmatch.

Tasks

Benchmark Results

Dataset	Model	Metric	Claimed	Verified	Status
cifar-100, 10000 Labels	SimMatch	Percentage error	20.58	—	Unverified
CIFAR-100, 2500 Labels	SimMatch	Percentage error	25.07	—	Unverified
CIFAR-100, 400 Labels	SimMatch	Percentage error	37.81	—	Unverified
CIFAR-10, 250 Labels	SimMatch	Percentage error	4.84	—	Unverified
CIFAR-10, 4000 Labels	SimMatch	Percentage error	3.96	—	Unverified
CIFAR-10, 40 Labels	SimMatch	Percentage error	5.6	—	Unverified
ImageNet - 10% labeled data	SimMatch (ResNet-50)	Top 1 Accuracy	74.4	—	Unverified
ImageNet - 1% labeled data	SimMatch (ResNet-50)	Top 1 Accuracy	67.2	—	Unverified

SimMatch: Semi-supervised Learning with Similarity Matching

Code

Abstract

Tasks

Benchmark Results

Reproductions