Causal Unsupervised Semantic Segmentation
Junho Kim, Byung-Kwan Lee, Yong Man Ro
Code Available — Be the first to reproduce this paper.
ReproduceCode
- github.com/ByungKwanLee/Causal-Unsupervised-SegmentationOfficialpytorch★ 11
Abstract
Unsupervised semantic segmentation aims to achieve high-quality semantic grouping without human-labeled annotations. With the advent of self-supervised pre-training, various frameworks utilize the pre-trained features to train prediction heads for unsupervised dense prediction. However, a significant challenge in this unsupervised setup is determining the appropriate level of clustering required for segmenting concepts. To address it, we propose a novel framework, CAusal Unsupervised Semantic sEgmentation (CAUSE), which leverages insights from causal inference. Specifically, we bridge intervention-oriented approach (i.e., frontdoor adjustment) to define suitable two-step tasks for unsupervised prediction. The first step involves constructing a concept clusterbook as a mediator, which represents possible concept prototypes at different levels of granularity in a discretized form. Then, the mediator establishes an explicit link to the subsequent concept-wise self-supervised learning for pixel-level grouping. Through extensive experiments and analyses on various datasets, we corroborate the effectiveness of CAUSE and achieve state-of-the-art performance in unsupervised semantic segmentation.
Tasks
Benchmark Results
| Dataset | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| COCO-Stuff-171 | CAUSE-TR (ViT-S/8) | mIoU | 15.2 | — | Unverified |
| COCO-Stuff-27 | CAUSE (DINOv2, ViT-B/14) | Clustering [mIoU] | 45.3 | — | Unverified |
| COCO-Stuff-27 | CAUSE (ViT-B/8) | Clustering [mIoU] | 41.9 | — | Unverified |
| COCO-Stuff-81 | CAUSE-TR (ViT-S/8) | mIoU | 21.2 | — | Unverified |
| COCO-Stuff-81 | CAUSE-MLP (ViT-S/8) | mIoU | 19.1 | — | Unverified |
| PASCAL VOC 2012 val | CAUSE (DINOv2, ViT-B/14) | Clustering [mIoU] | 53.2 | — | Unverified |
| PASCAL VOC 2012 val | CAUSE (ViT-B/8) | Clustering [mIoU] | 53.3 | — | Unverified |
| PASCAL VOC 2012 val | CAUSE (iBOT, ViT-B/16) | Clustering [mIoU] | 53.4 | — | Unverified |