Unsupervised Semantic Segmentation
Models that learn to segment each image (i.e. assign a class to every pixel) without seeing the ground truth labels.
( Image credit: SegSort: Segmentation by Discriminative Sorting of Segments )
Papers
Showing 1–10 of 95 papers
All datasetsCOCO-Stuff-27Cityscapes testPASCAL VOC 2012 valPotsdam-3COCO-Stuff-3ImageNet-S-50COCO-Stuff-171COCO-Stuff-81SUIMCOCO-Stuff-15ACDC (Adverse Conditions Dataset with Correspondences)Cityscapes val
Benchmark Results
| # | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| 1 | SGSeg | Clustering [Accuracy] | 55.7 | — | Unverified |
| 2 | DynaSeg - FSF (ResNet-18 FPN) | Clustering [mIoU] | 54.1 | — | Unverified |
| 3 | SAN | Clustering [Accuracy] | 52 | — | Unverified |
| 4 | DiffCut | Clustering [mIoU] | 49.1 | — | Unverified |
| 5 | PiCIE | Clustering [Accuracy] | 48.1 | — | Unverified |
| 6 | HCL (ViT-S/8) | Linear Classifier [mIoU] | 47.4 | — | Unverified |
| 7 | HCL (ViT-S/16) | Linear Classifier [mIoU] | 45.8 | — | Unverified |
| 8 | CAUSE (DINOv2, ViT-B/14) | Clustering [mIoU] | 45.3 | — | Unverified |
| 9 | Ours (SlotCon) | Clustering [Accuracy] | 42.36 | — | Unverified |
| 10 | CAUSE (ViT-B/8) | Clustering [mIoU] | 41.9 | — | Unverified |
| # | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| 1 | GraPix | Pixel Accuracy | 64.89 | — | Unverified |
| 2 | CUPS | mIoU | 26.8 | — | Unverified |
| 3 | ViCE | mIoU | 25.2 | — | Unverified |
| 4 | EAGLE (DINO, ViT-B/8) | mIoU | 22.1 | — | Unverified |
| 5 | EQUSS | mIoU | 22 | — | Unverified |
| 6 | PriMaPs-EM + STEGO (DINO ViT-B/8) | mIoU | 21.6 | — | Unverified |
| 7 | STEGO | mIoU | 21 | — | Unverified |
| 8 | EAGLE (DINO, ViT-S/8) | mIoU | 19.7 | — | Unverified |
| 9 | PriMaPs-EM (DINO ViT-S/8) | mIoU | 19.4 | — | Unverified |
| 10 | HP | mIoU | 18.4 | — | Unverified |
| # | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| 1 | CAUSE (iBOT, ViT-B/16) | Clustering [mIoU] | 53.4 | — | Unverified |
| 2 | CAUSE (ViT-B/8) | Clustering [mIoU] | 53.3 | — | Unverified |
| 3 | CAUSE (DINOv2, ViT-B/14) | Clustering [mIoU] | 53.2 | — | Unverified |
| 4 | MaskDistill+CRF | Clustering [mIoU] | 48.9 | — | Unverified |
| 5 | Leopart (ViT-B/8) | Clustering [mIoU] | 47.2 | — | Unverified |
| 6 | HCL (ViT-S/8) | Clustering [mIoU] | 46.3 | — | Unverified |
| 7 | MaskDistill | Clustering [mIoU] | 45.8 | — | Unverified |
| 8 | MaskContrast (Saliency) | Clustering [mIoU] | 44.2 | — | Unverified |
| 9 | HCL (ViT-S/16) | Clustering [mIoU] | 43.2 | — | Unverified |
| 10 | Leopart (ViT-S/16) | Clustering [mIoU] | 41.7 | — | Unverified |
| # | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| 1 | PriMaPs-EM+HP (DINO ViT-B/8) | Accuracy | 83.3 | — | Unverified |
| 2 | EAGLE (DINO, ViT-B/8) | Accuracy | 83.3 | — | Unverified |
| 3 | HP | Accuracy | 82.4 | — | Unverified |
| 4 | EQUSS | Accuracy | 82 | — | Unverified |
| 5 | PriMaPs-EM (DINO ViT-B/8) | Accuracy | 80.5 | — | Unverified |
| 6 | STEGO | Accuracy | 77 | — | Unverified |
| 7 | InfoSeg | Pixel Accuracy | 71.6 | — | Unverified |
| 8 | IIC | Accuracy | 45.4 | — | Unverified |
| # | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| 1 | PASS (+Saliency map) | mIoU (test) | 42.3 | — | Unverified |
| 2 | PASS | mIoU (test) | 32 | — | Unverified |
| 3 | MaskContrast (+Saliency map) | mIoU (test) | 24.2 | — | Unverified |
| 4 | PiCIE (Supervised pretrain) | mIoU (test) | 17.6 | — | Unverified |
| 5 | MDC (Supervised pretrain) | mIoU (test) | 14.3 | — | Unverified |
| # | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| 1 | CAUSE-TR (ViT-S/8) | mIoU | 15.2 | — | Unverified |
| 2 | TransFGU (ViT-S/8) | mIoU | 11.93 | — | Unverified |
| 3 | PiCIE (ResNet-50) | mIoU | 5.6 | — | Unverified |
| 4 | IIC (ResNet-50) | mIoU | 2.2 | — | Unverified |
| # | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| 1 | CAUSE-TR (ViT-S/8) | mIoU | 21.2 | — | Unverified |
| 2 | CAUSE-MLP (ViT-S/8) | mIoU | 19.1 | — | Unverified |
| 3 | TransFGU (ViT-S/8) | mIoU | 12.7 | — | Unverified |
| 4 | MaskContrast (ResNet-50) | mIoU | 3.7 | — | Unverified |
| # | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| 1 | DatUS (ViT-B/8) + OC | Pixel Accuracy | 69.98 | — | Unverified |
| 2 | GraPix + AUT | Pixel Accuracy | 65.48 | — | Unverified |
| 3 | DatUS (ViT-B/8) | Pixel Accuracy | 64.67 | — | Unverified |
| 4 | GraPix | Pixel Accuracy | 64.06 | — | Unverified |
| # | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| 1 | Segmenter ViT-S/16 | mIoU | 16.7 | — | Unverified |
| # | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| 1 | Segmenter ViT-S/16 | mIoU | 21.8 | — | Unverified |
| # | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| 1 | DenseSiam | mIoU | 16.4 | — | Unverified |
| # | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| 1 | InfoSeg | Pixel Accuracy | 69.6 | — | Unverified |
| # | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| 1 | Segmenter ViT-S/16 | mIoU | 14.2 | — | Unverified |
| # | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| 1 | PASS | mIoU (test) | 11 | — | Unverified |
| # | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| 1 | PASS | mIoU (test) | 18.1 | — | Unverified |
| # | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| 1 | Segmenter ViT-S/16 | mIoU | 18.9 | — | Unverified |