Multi-Modal Deep Clustering: Unsupervised Partitioning of Images

2019-12-05Code Available0· sign in to hype

Guy Shiran, Daphna Weinshall

Code Available — Be the first to reproduce this paper.

Code

github.com/guysrn/mmdc
OfficialIn paperpytorch★ 0

Abstract

The clustering of unlabeled raw images is a daunting task, which has recently been approached with some success by deep learning methods. Here we propose an unsupervised clustering framework, which learns a deep neural network in an end-to-end fashion, providing direct cluster assignments of images without additional processing. Multi-Modal Deep Clustering (MMDC), trains a deep network to align its image embeddings with target points sampled from a Gaussian Mixture Model distribution. The cluster assignments are then determined by mixture component association of image embeddings. Simultaneously, the same deep network is trained to solve an additional self-supervised task of predicting image rotations. This pushes the network to learn more meaningful image representations that facilitate a better clustering. Experimental results show that MMDC achieves or exceeds state-of-the-art performance on six challenging benchmarks. On natural image datasets we improve on previous results with significant margins of up to 20% absolute accuracy points, yielding an accuracy of 82% on CIFAR-10, 45% on CIFAR-100 and 69% on STL-10.

Tasks

Clustering Deep Clustering Image Clustering

Benchmark Results

Dataset	Model	Metric	Claimed	Verified	Status
CIFAR-10	MMDC	Accuracy	0.82	—	Unverified
CIFAR-100	MMDC	Accuracy	0.45	—	Unverified
ImageNet-10	MMDC	NMI	0.72	—	Unverified
STL-10	MMDC	Accuracy	0.69	—	Unverified
Tiny ImageNet	MMDC	Accuracy	0.12	—	Unverified

Multi-Modal Deep Clustering: Unsupervised Partitioning of Images

Code

Abstract

Tasks

Benchmark Results

Reproductions