Scaling Up Semi-supervised Learning with Unconstrained Unlabelled Data

2023-06-02Code Available0· sign in to hype

Shuvendu Roy, Ali Etemad

Code Available — Be the first to reproduce this paper.

Code

github.com/shuvenduroy/unmixmatch
OfficialIn paperpytorch★ 4

Abstract

We propose UnMixMatch, a semi-supervised learning framework which can learn effective representations from unconstrained unlabelled data in order to scale up performance. Most existing semi-supervised methods rely on the assumption that labelled and unlabelled samples are drawn from the same distribution, which limits the potential for improvement through the use of free-living unlabeled data. Consequently, the generalizability and scalability of semi-supervised learning are often hindered by this assumption. Our method aims to overcome these constraints and effectively utilize unconstrained unlabelled data in semi-supervised learning. UnMixMatch consists of three main components: a supervised learner with hard augmentations that provides strong regularization, a contrastive consistency regularizer to learn underlying representations from the unlabelled data, and a self-supervised loss to enhance the representations that are learnt from the unlabelled data. We perform extensive experiments on 4 commonly used datasets and demonstrate superior performance over existing semi-supervised methods with a performance boost of 4.79%. Extensive ablation and sensitivity studies show the effectiveness and impact of each of the proposed components of our method.

Tasks

Image Classification Network Pruning Semi-Supervised Image Classification

Benchmark Results

Dataset	Model	Metric	Claimed	Verified	Status
CIFAR-10 (40 Labels, ImageNet-100 Unlabeled)	UnMixMatch	Accuarcy	52.07	—	Unverified

Scaling Up Semi-supervised Learning with Unconstrained Unlabelled Data

Code

Abstract

Tasks

Benchmark Results

Reproductions