SOTAVerified

MST: Adaptive Multi-Scale Tokens Guided Interactive Segmentation

2024-01-09Code Available0· sign in to hype

Long Xu, Shanghong Li, Yongquan Chen, Jun Luo, Shiwu Lai

Code Available — Be the first to reproduce this paper.

Reproduce

Code

Abstract

Interactive segmentation has gained significant attention for its application in human-computer interaction and data annotation. To address the target scale variation issue in interactive segmentation, a novel multi-scale token adaptation algorithm is proposed. By performing top-k operations across multi-scale tokens, the computational complexity is greatly simplified while ensuring performance. To enhance the robustness of multi-scale token selection, we also propose a token learning algorithm based on contrastive loss. This algorithm can effectively improve the performance of multi-scale token adaptation. Extensive benchmarking shows that the algorithm achieves state-of-the-art (SOTA) performance, compared to current methods. An interactive demo and all reproducible codes will be released at https://github.com/hahamyt/mst.

Tasks

Benchmark Results

DatasetModelMetricClaimedVerifiedStatus
BerkeleyViT-B+MST+CLNoC@901.5Unverified
COCO minivalViT-B+MST+CLNoC@852.08Unverified
DAVISViT-B+MST+CLNoC@904.55Unverified
DAVIS-585ViT-B+MST+CLNoC@902.29Unverified
GrabCutViT-B+MST+CLNoC@901.48Unverified
PascalVOCViT-B+MST+CLNoC@851.69Unverified
SBDViT-B+MST+CLNoC@853.03Unverified

Reproductions