SOTAVerified

DiscoBox: Weakly Supervised Instance Segmentation and Semantic Correspondence from Box Supervision

2021-05-13ICCV 2021Code Available1· sign in to hype

Shiyi Lan, Zhiding Yu, Christopher Choy, Subhashree Radhakrishnan, Guilin Liu, Yuke Zhu, Larry S. Davis, Anima Anandkumar

Code Available — Be the first to reproduce this paper.

Reproduce

Code

Abstract

We introduce DiscoBox, a novel framework that jointly learns instance segmentation and semantic correspondence using bounding box supervision. Specifically, we propose a self-ensembling framework where instance segmentation and semantic correspondence are jointly guided by a structured teacher in addition to the bounding box supervision. The teacher is a structured energy model incorporating a pairwise potential and a cross-image potential to model the pairwise pixel relationships both within and across the boxes. Minimizing the teacher energy simultaneously yields refined object masks and dense correspondences between intra-class objects, which are taken as pseudo-labels to supervise the task network and provide positive/negative correspondence pairs for dense constrastive learning. We show a symbiotic relationship where the two tasks mutually benefit from each other. Our best model achieves 37.9% AP on COCO instance segmentation, surpassing prior weakly supervised methods and is competitive to supervised methods. We also obtain state of the art weakly supervised results on PASCAL VOC12 and PF-PASCAL with real-time inference.

Tasks

Benchmark Results

DatasetModelMetricClaimedVerifiedStatus
COCO 2017 valDiscoBox (ResNet-50)AP31.4Unverified
COCO test-devDiscoBox (ResNeXt-101-DCN-FPN)AP37.9Unverified
COCO test-devDiscoBox (ResNet-101-DCN-FPN)AP35.8Unverified
COCO test-devDiscoBox (ResNet-50-FPN)AP32Unverified

Reproductions