Stepwise Feature Fusion: Local Guides Global

2022-03-07Code Available1· sign in to hype

Jinfeng Wang, Qiming Huang, Feilong Tang, Jia Meng, Jionglong Su, Sifan Song

Code Available — Be the first to reproduce this paper.

Code

github.com/Qiming-Huang/ssformer
OfficialIn paperpytorch★ 60

Abstract

Colonoscopy, currently the most efficient and recognized colon polyp detection technology, is necessary for early screening and prevention of colorectal cancer. However, due to the varying size and complex morphological features of colonic polyps as well as the indistinct boundary between polyps and mucosa, accurate segmentation of polyps is still challenging. Deep learning has become popular for accurate polyp segmentation tasks with excellent results. However, due to the structure of polyps image and the varying shapes of polyps, it easy for existing deep learning models to overfitting the current dataset. As a result, the model may not process unseen colonoscopy data. To address this, we propose a new State-Of-The-Art model for medical image segmentation, the SSFormer, which uses a pyramid Transformer encoder to improve the generalization ability of models. Specifically, our proposed Progressive Locality Decoder can be adapted to the pyramid Transformer backbone to emphasize local features and restrict attention dispersion. The SSFormer achieves statet-of-the-art performance in both learning and generalization assessment.

Tasks

Decoder Image Segmentation Medical Image Segmentation Segmentation Semantic Segmentation

Benchmark Results

Dataset	Model	Metric	Claimed	Verified	Status
2018 Data Science Bowl	SSFormer-L	Dice	0.92	—	Unverified
CVC-ClinicDB	SSFormer-L	mean Dice	0.94	—	Unverified
CVC-ColonDB	SSFormer-L	mean Dice	0.8	—	Unverified
ETIS-LARIBPOLYPDB	SSFormer-L	mean Dice	0.8	—	Unverified
Kvasir-SEG	SSFormer-L	mean Dice	0.94	—	Unverified

Stepwise Feature Fusion: Local Guides Global

Code

Abstract

Tasks

Benchmark Results

Reproductions