SOTAVerified

Instance Segmentation

Instance Segmentation is a computer vision task that involves identifying and separating individual objects within an image, including detecting the boundaries of each object and assigning a unique label to each object. The goal of instance segmentation is to produce a pixel-wise segmentation map of the image, where each pixel is assigned to a specific object instance.

Image Credit: Deep Occlusion-Aware Instance Segmentation with Overlapping BiLayers, CVPR'21

Papers

Showing 151175 of 2262 papers

TitleStatusHype
Decoupled Motion Expression Video Segmentation0
Semantic and Sequential Alignment for Referring Video Object Segmentation0
Insightful Instance Features for 3D Instance Segmentation0
DefMamba: Deformable Visual State Space Model0
PolarNeXt: Rethink Instance Segmentation with Polar Representation0
PanoSLAM: Panoptic 3D Scene Reconstruction via Gaussian SLAMCode0
A Novel Shape Guided Transformer Network for Instance Segmentation in Remote Sensing Images0
Progressive Fine-to-Coarse Reconstruction for Accurate Low-Bit Post-Training Quantization in Vision Transformers0
RelationField: Relate Anything in Radiance FieldsCode2
ZoRI: Towards Discriminative Zero-Shot Remote Sensing Instance SegmentationCode1
PyPotteryLens: An Open-Source Deep Learning Framework for Automated Digitisation of Archaeological Pottery DocumentationCode0
SAM-IF: Leveraging SAM for Incremental Few-Shot Instance Segmentation0
Classification Drives Geographic Bias in Street Scene Segmentation0
RapidNet: Multi-Level Dilated Convolution Based Mobile BackboneCode1
STEAM: Squeeze and Transform Enhanced Attention Module0
MaskTerial: A Foundation Model for Automated 2D Material Flake DetectionCode2
Open-Vocabulary High-Resolution 3D (OVHR3D) Data Segmentation and Annotation Framework0
Integrating YOLO11 and Convolution Block Attention Module for Multi-Season Segmentation of Tree Trunks and Branches in Commercial Apple Orchards0
DreamColour: Controllable Video Colour Editing without TrainingCode2
Towards Real-Time Open-Vocabulary Video Instance SegmentationCode0
Vision Transformers for Weakly-Supervised Microorganism EnumerationCode0
A2VIS: Amodal-Aware Approach to Video Instance Segmentation0
3DSceneEditor: Controllable 3D Scene Editing with Gaussian Splatting0
Holistic Understanding of 3D Scenes as Universal Scene Description0
Token Cropr: Faster ViTs for Quite a Few TasksCode1
Show:102550
← PrevPage 7 of 91Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1InternImage-HAP5080.8Unverified
2ResNeSt-200 (multi-scale)AP5070.2Unverified
3CenterMask + VoVNetV2-99 (multi-scale)AP5066.2Unverified
4CenterMask + VoVNetV2-57 (single-scale)AP5060.8Unverified
5Co-DETRmask AP57.1Unverified
6CBNetV2 (EVA02, single-scale)mask AP56.1Unverified
7ISDA (ResNet-50)APL55.7Unverified
8EVAmask AP55.5Unverified
9FD-SwinV2-Gmask AP55.4Unverified
10Mask Frozen-DETRmask AP55.3Unverified
#ModelMetricClaimedVerifiedStatus
1InternImage-BGFLOPs501Unverified
2Co-DETRmask AP56.6Unverified
3ViT-CoMer-L (Mask RCNN, DINOv2)mask AP55.9Unverified
4InternImage-Hmask AP55.4Unverified
5EVAmask AP55Unverified
6Mask Frozen-DETRmask AP54.9Unverified
7MasK DINO (SwinL, multi-scale)mask AP54.5Unverified
8ViT-Adapter-L (HTC++, BEiTv2, O365, multi-scale)mask AP54.2Unverified
9GLEE-Promask AP54.2Unverified
10SwinV2-G (HTC++)mask AP53.7Unverified