SOTAVerified

Instance Segmentation

Instance Segmentation is a computer vision task that involves identifying and separating individual objects within an image, including detecting the boundaries of each object and assigning a unique label to each object. The goal of instance segmentation is to produce a pixel-wise segmentation map of the image, where each pixel is assigned to a specific object instance.

Image Credit: Deep Occlusion-Aware Instance Segmentation with Overlapping BiLayers, CVPR'21

Papers

Showing 18511875 of 2262 papers

TitleStatusHype
UNIT: Unsupervised Online Instance Segmentation through Time0
UnScene3D: Unsupervised 3D Instance Segmentation for Indoor Scenes0
Unseen Object Instance Segmentation with Fully Test-time RGB-D Embeddings Adaptation0
Unsupervised Discovery of the Long-Tail in Instance Segmentation Using Hierarchical Self-Supervision0
Unsupervised Pre-Training for 3D Leaf Instance Segmentation0
Unsupervised Spiking Instance Segmentation on Event Data using STDP0
Unsupervised Video Object Segmentation with Distractor-Aware Online Adaptation0
CamoFA: A Learnable Fourier-based Augmentation for Camouflage Segmentation0
UrbanBIS: a Large-scale Benchmark for Fine-grained Urban Building Instance Segmentation0
Using depth information and colour space variations for improving outdoor robustness for instance segmentation of cabbage0
Using t-distributed stochastic neighbor embedding for visualization and segmentation of 3D point clouds of plants0
US-net for robust and efficient nuclei instance segmentation0
UVIS: Unsupervised Video Instance Segmentation0
Vehicle Instance Segmentation from Aerial Image and Video Using a Multi-Task Learning Residual Fully Convolutional Network0
Vehicle Occurrence-based Parking Space Detection0
VertDetect: Fully End-to-End 3D Vertebral Instance Segmentation Model0
VFX Creator: Animated Visual Effect Generation with Controllable Diffusion Transformer0
Video Instance Segmentation by Instance Flow Assembly0
Video Instance Segmentation Tracking With a Modified VAE Architecture0
Video Prediction Models as General Visual Encoders0
Virtual KITTI 20
Virtual Worlds as Proxy for Multi-Object Tracking Analysis0
Vision Aided Channel Prediction for Vehicular Communications: A Case Study of Received Power Prediction Using RGB Images0
Vision Foundation Model Embedding-Based Semantic Anomaly Detection0
Vision Transformers Are Good Mask Auto-Labelers0
Show:102550
← PrevPage 75 of 91Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1InternImage-HAP5080.8Unverified
2ResNeSt-200 (multi-scale)AP5070.2Unverified
3CenterMask + VoVNetV2-99 (multi-scale)AP5066.2Unverified
4CenterMask + VoVNetV2-57 (single-scale)AP5060.8Unverified
5Co-DETRmask AP57.1Unverified
6CBNetV2 (EVA02, single-scale)mask AP56.1Unverified
7ISDA (ResNet-50)APL55.7Unverified
8EVAmask AP55.5Unverified
9FD-SwinV2-Gmask AP55.4Unverified
10Mask Frozen-DETRmask AP55.3Unverified
#ModelMetricClaimedVerifiedStatus
1InternImage-BGFLOPs501Unverified
2Co-DETRmask AP56.6Unverified
3ViT-CoMer-L (Mask RCNN, DINOv2)mask AP55.9Unverified
4InternImage-Hmask AP55.4Unverified
5EVAmask AP55Unverified
6Mask Frozen-DETRmask AP54.9Unverified
7MasK DINO (SwinL, multi-scale)mask AP54.5Unverified
8ViT-Adapter-L (HTC++, BEiTv2, O365, multi-scale)mask AP54.2Unverified
9GLEE-Promask AP54.2Unverified
10SwinV2-G (HTC++)mask AP53.7Unverified