SOTAVerified

PolyFormer: Referring Image Segmentation as Sequential Polygon Generation

2023-02-14CVPR 2023Code Available1· sign in to hype

Jiang Liu, Hui Ding, Zhaowei Cai, Yuting Zhang, Ravi Kumar Satzoda, Vijay Mahadevan, R. Manmatha

Code Available — Be the first to reproduce this paper.

Reproduce

Code

Abstract

In this work, instead of directly predicting the pixel-level segmentation masks, the problem of referring image segmentation is formulated as sequential polygon generation, and the predicted polygons can be later converted into segmentation masks. This is enabled by a new sequence-to-sequence framework, Polygon Transformer (PolyFormer), which takes a sequence of image patches and text query tokens as input, and outputs a sequence of polygon vertices autoregressively. For more accurate geometric localization, we propose a regression-based decoder, which predicts the precise floating-point coordinates directly, without any coordinate quantization error. In the experiments, PolyFormer outperforms the prior art by a clear margin, e.g., 5.40% and 4.52% absolute improvements on the challenging RefCOCO+ and RefCOCOg datasets. It also shows strong generalization ability when evaluated on the referring video segmentation task without fine-tuning, e.g., achieving competitive 61.5% J&F on the Ref-DAVIS17 dataset.

Tasks

Benchmark Results

DatasetModelMetricClaimedVerifiedStatus
DAVIS 2017 (val)PolyFormer-BJ&F 1st frame60.9Unverified
RefCOCOg-testPolyFormer-LOverall IoU70.19Unverified
RefCOCOg-testPolyFormer-BOverall IoU69.05Unverified
RefCOCOg-valPolyFormer-LOverall IoU69.2Unverified
RefCOCOg-valPolyFormer-BOverall IoU67.76Unverified
RefCOCO testAPolyFormer-BOverall IoU72.89Unverified
RefCOCO testAPolyFormer-LOverall IoU74.56Unverified
RefCOCO+ test BPolyFormer-LOverall IoU61.87Unverified
RefCOCO+ test BPolyFormer-BOverall IoU59.33Unverified
RefCoCo valPolyFormer-BOverall IoU67.64Unverified
RefCoCo valPolyFormer-BOverall IoU74.82Unverified
RefCoCo valPolyFormer-LOverall IoU75.96Unverified
RefCoCo valPolyFormer-LOverall IoU69.33Unverified
ReferItPolyFormer-LOverall IoU72.6Unverified
ReferItPolyFormer-BOverall IoU71.91Unverified

Reproductions