SOTAVerified

Doubly Deformable Aggregation of Covariance Matrices for Few-shot Segmentation

2022-07-30Code Available1· sign in to hype

Zhitong Xiong, Haopeng Li, Xiao Xiang Zhu

Code Available — Be the first to reproduce this paper.

Reproduce

Code

Abstract

Training semantic segmentation models with few annotated samples has great potential in various real-world applications. For the few-shot segmentation task, the main challenge is how to accurately measure the semantic correspondence between the support and query samples with limited training data. To address this problem, we propose to aggregate the learnable covariance matrices with a deformable 4D Transformer to effectively predict the segmentation map. Specifically, in this work, we first devise a novel hard example mining mechanism to learn covariance kernels for the Gaussian process. The learned covariance kernel functions have great advantages over existing cosine similarity-based methods in correspondence measurement. Based on the learned covariance kernels, an efficient doubly deformable 4D Transformer module is designed to adaptively aggregate feature similarity maps into segmentation results. By combining these two designs, the proposed method can not only set new state-of-the-art performance on public benchmarks, but also converge extremely faster than existing methods. Experiments on three public datasets have demonstrated the effectiveness of our method.

Tasks

Benchmark Results

DatasetModelMetricClaimedVerifiedStatus
COCO-20i (1-shot)DACM (VAT, ResNet-50)Mean IoU43Unverified
COCO-20i (1-shot)DACM (ResNet-50)Mean IoU40.6Unverified
COCO-20i (5-shot)DACM (VAT, ResNet-50)Mean IoU49.2Unverified
COCO-20i (5-shot)DACM (ResNet-50)Mean IoU48.1Unverified
FSS-1000 (1-shot)DACM (ResNet-50)Mean IoU90.7Unverified
FSS-1000 (1-shot)DACM (ResNet-101)Mean IoU90.8Unverified
FSS-1000 (5-shot)DACM (ResNet-101)Mean IoU91.7Unverified
FSS-1000 (5-shot)DACM (ResNet-50)Mean IoU91.6Unverified
PASCAL-5i (1-Shot)DACM (VGG-16)Mean IoU61.8Unverified
PASCAL-5i (1-Shot)DACM (VAT, ResNet-101)Mean IoU69.1Unverified
PASCAL-5i (1-Shot)DACM (ResNet-101)Mean IoU67.5Unverified
PASCAL-5i (1-Shot)DACM (VAT, ResNet-50)Mean IoU66.8Unverified
PASCAL-5i (1-Shot)DACM (ResNet-50)Mean IoU65.7Unverified
PASCAL-5i (5-Shot)DACM (VAT, ResNet-101)Mean IoU73.3Unverified
PASCAL-5i (5-Shot)DACM (VAT, ResNet-50)Mean IoU71.7Unverified
PASCAL-5i (5-Shot)DACM (ResNet-101)Mean IoU71.4Unverified
PASCAL-5i (5-Shot)DACM (ResNet-50)Mean IoU70.9Unverified
PASCAL-5i (5-Shot)DACM (VGG-16)Mean IoU65.7Unverified

Reproductions