Open-Vocabulary Segmentation with Semantic-Assisted Calibration

2023-12-07CVPR 2024Code Available1· sign in to hype

Yong liu, Sule Bai, Guanbin Li, Yitong Wang, Yansong Tang

Code Available — Be the first to reproduce this paper.

Code

github.com/workforai/scan
OfficialIn paperpytorch★ 77
github.com/yongliu20/SCAN
Officialpytorch★ 77

Abstract

This paper studies open-vocabulary segmentation (OVS) through calibrating in-vocabulary and domain-biased embedding space with generalized contextual prior of CLIP. As the core of open-vocabulary understanding, alignment of visual content with the semantics of unbounded text has become the bottleneck of this field. To address this challenge, recent works propose to utilize CLIP as an additional classifier and aggregate model predictions with CLIP classification results. Despite their remarkable progress, performance of OVS methods in relevant scenarios is still unsatisfactory compared with supervised counterparts. We attribute this to the in-vocabulary embedding and domain-biased CLIP prediction. To this end, we present a Semantic-assisted CAlibration Network (SCAN). In SCAN, we incorporate generalized semantic prior of CLIP into proposal embedding to avoid collapsing on known categories. Besides, a contextual shift strategy is applied to mitigate the lack of global context and unnatural background noise. With above designs, SCAN achieves state-of-the-art performance on all popular open-vocabulary segmentation benchmarks. Furthermore, we also focus on the problem of existing evaluation system that ignores semantic duplication across categories, and propose a new metric called Semantic-Guided IoU (SG-IoU).

Tasks

Attribute Open Vocabulary Semantic Segmentation

Benchmark Results

Dataset	Model	Metric	Claimed	Verified	Status
ADE20K-150	SCAN	mIoU	33.5	—	Unverified
ADE20K-847	SCAN	mIoU	14	—	Unverified
PASCAL Context-459	SCAN	mIoU	16.7	—	Unverified
PASCAL Context-59	SCAN	mIoU	59.3	—	Unverified
PascalVOC-20	SCAN	mIoU	97.2	—	Unverified

Open-Vocabulary Segmentation with Semantic-Assisted Calibration

Code

Abstract

Tasks

Benchmark Results

Reproductions