Information Maximization Clustering via Multi-View Self-Labelling

2021-03-12Code Available1· sign in to hype

Foivos Ntelemis, Yaochu Jin, Spencer A. Thomas

Code Available — Be the first to reproduce this paper.

Code

github.com/foiv0s/imc-swav-pub
OfficialIn paperpytorch★ 13

Abstract

Image clustering is a particularly challenging computer vision task, which aims to generate annotations without human supervision. Recent advances focus on the use of self-supervised learning strategies in image clustering, by first learning valuable semantics and then clustering the image representations. These multiple-phase algorithms, however, increase the computational time and their final performance is reliant on the first stage. By extending the self-supervised approach, we propose a novel single-phase clustering method that simultaneously learns meaningful representations and assigns the corresponding annotations. This is achieved by integrating a discrete representation into the self-supervised paradigm through a classifier net. Specifically, the proposed clustering objective employs mutual information, and maximizes the dependency between the integrated discrete representation and a discrete probability distribution. The discrete probability distribution is derived though the self-supervised process by comparing the learnt latent representation with a set of trainable prototypes. To enhance the learning performance of the classifier, we jointly apply the mutual information across multi-crop views. Our empirical results show that the proposed framework outperforms state-of-the-art techniques with the average accuracy of 89.1% and 49.0%, respectively, on CIFAR-10 and CIFAR-100/20 datasets. Finally, the proposed method also demonstrates attractive robustness to parameter settings, making it ready to be applicable to other datasets.

Tasks

Clustering Image Classification Image Clustering Self-Supervised Learning

Benchmark Results

Dataset	Model	Metric	Claimed	Verified	Status
CIFAR-10	IMC-SwAV (Best)	Accuracy	0.9	—	Unverified
CIFAR-10	IMC-SwAV (Avg+-)	Accuracy	0.89	—	Unverified
CIFAR-100	IMC-SwAV (Best)	Accuracy	0.52	—	Unverified
CIFAR-100	IMC-SwAV (Avg+-)	Accuracy	0.49	—	Unverified
STL-10	IMC-SwAV (Avg+-)	Accuracy	0.83	—	Unverified
STL-10	IMC-SwAV (Best)	Accuracy	0.85	—	Unverified
Tiny ImageNet	IMC-SwAV (Best)	Accuracy	0.28	—	Unverified
Tiny ImageNet	IMC-SwAV (Avg+-)	Accuracy	0.28	—	Unverified

Information Maximization Clustering via Multi-View Self-Labelling

Code

Abstract

Tasks

Benchmark Results

Reproductions