3rd Place Solution for PVUW2023 VSS Track: A Large Model for Semantic Segmentation on VSPW

2023-06-04Code Available1· sign in to hype

Shijie Chang, Zeqi Hao, Ben Kang, Xiaoqi Zhao, Jiawen Zhu, Zhenyu Chen, Lihe Zhang, Lu Zhang, Huchuan Lu

Code Available — Be the first to reproduce this paper.

Code

github.com/dut-csj/pvuw2023-vss-3rd
OfficialIn papernone★ 10

Abstract

In this paper, we introduce 3rd place solution for PVUW2023 VSS track. Semantic segmentation is a fundamental task in computer vision with numerous real-world applications. We have explored various image-level visual backbones and segmentation heads to tackle the problem of video semantic segmentation. Through our experimentation, we find that InternImage-H as the backbone and Mask2former as the segmentation head achieves the best performance. In addition, we explore two post-precessing methods: CascadePSP and Segment Anything Model (SAM). Ultimately, our approach obtains 62.60\% and 64.84\% mIoU on the VSPW test set1 and final test set, respectively, securing the third position in the PVUW2023 VSS track.

Tasks

Position Segmentation Semantic Segmentation Video Semantic Segmentation

3rd Place Solution for PVUW2023 VSS Track: A Large Model for Semantic Segmentation on VSPW

Code

Abstract

Tasks

Reproductions