SOTAVerified

Exploring the Limits of Deep Image Clustering using Pretrained Models

2023-03-31Code Available1· sign in to hype

Nikolas Adaloglou, Felix Michels, Hamza Kalisch, Markus Kollmann

Code Available — Be the first to reproduce this paper.

Reproduce

Code

Abstract

We present a general methodology that learns to classify images without labels by leveraging pretrained feature extractors. Our approach involves self-distillation training of clustering heads based on the fact that nearest neighbours in the pretrained feature space are likely to share the same label. We propose a novel objective that learns associations between image features by introducing a variant of pointwise mutual information together with instance weighting. We demonstrate that the proposed objective is able to attenuate the effect of false positive pairs while efficiently exploiting the structure in the pretrained feature space. As a result, we improve the clustering accuracy over k-means on 17 different pretrained models by 6.1\% and 12.2\% on ImageNet and CIFAR100, respectively. Finally, using self-supervised vision transformers, we achieve a clustering accuracy of 61.6\% on ImageNet. The code is available at https://github.com/HHU-MMBS/TEMI-official-BMVC2023.

Tasks

Benchmark Results

DatasetModelMetricClaimedVerifiedStatus
CIFAR-10TEMI CLIP ViT-L (openai)Accuracy0.97Unverified
CIFAR-10TEMI DINO ViT-BNMI0.89Unverified
CIFAR-100TEMI DINO ViT-BAccuracy0.67Unverified
CIFAR-100TEMI CLIP ViT-L (openai)Accuracy0.74Unverified
ImageNetTEMI DINO (ViT-B)Accuracy58Unverified
ImageNetTEMI MSN (ViT-L)Accuracy61.6Unverified
ImageNet-100 (TEMI Split)TEMI CLIP ViT-L (openai)NMI0.9Unverified
ImageNet-100 (TEMI Split)TEMI MSN ViT-LNMI0.89Unverified
ImageNet-100 (TEMI Split)TEMI DINO ViT-BNMI0.86Unverified
ImageNet-200TEMI DINO ViT-BNMI0.85Unverified
ImageNet-200TEMI CLIP ViT-L (openai)NMI0.88Unverified
ImageNet-200TEMI MSN ViT-LNMI0.87Unverified
ImageNet-50 (TEMI Split)TEMI DINO ViT-BNMI0.86Unverified
ImageNet-50 (TEMI Split)TEMI CLIP ViT-L (openai)NMI0.92Unverified
ImageNet-50 (TEMI Split)TEMI MSN ViT-LNMI0.88Unverified
STL-10TEMI DINO ViT-BAccuracy0.99Unverified

Reproductions