SOTAVerified

Instance Segmentation

Instance Segmentation is a computer vision task that involves identifying and separating individual objects within an image, including detecting the boundaries of each object and assigning a unique label to each object. The goal of instance segmentation is to produce a pixel-wise segmentation map of the image, where each pixel is assigned to a specific object instance.

Image Credit: Deep Occlusion-Aware Instance Segmentation with Overlapping BiLayers, CVPR'21

Papers

Showing 18511900 of 2262 papers

TitleStatusHype
UNIT: Unsupervised Online Instance Segmentation through Time0
UnScene3D: Unsupervised 3D Instance Segmentation for Indoor Scenes0
Unseen Object Instance Segmentation with Fully Test-time RGB-D Embeddings Adaptation0
Unsupervised Discovery of the Long-Tail in Instance Segmentation Using Hierarchical Self-Supervision0
Unsupervised Pre-Training for 3D Leaf Instance Segmentation0
Unsupervised Spiking Instance Segmentation on Event Data using STDP0
Unsupervised Video Object Segmentation with Distractor-Aware Online Adaptation0
CamoFA: A Learnable Fourier-based Augmentation for Camouflage Segmentation0
UrbanBIS: a Large-scale Benchmark for Fine-grained Urban Building Instance Segmentation0
Using depth information and colour space variations for improving outdoor robustness for instance segmentation of cabbage0
Using t-distributed stochastic neighbor embedding for visualization and segmentation of 3D point clouds of plants0
US-net for robust and efficient nuclei instance segmentation0
UVIS: Unsupervised Video Instance Segmentation0
Vehicle Instance Segmentation from Aerial Image and Video Using a Multi-Task Learning Residual Fully Convolutional Network0
Vehicle Occurrence-based Parking Space Detection0
VertDetect: Fully End-to-End 3D Vertebral Instance Segmentation Model0
VFX Creator: Animated Visual Effect Generation with Controllable Diffusion Transformer0
Video Instance Segmentation by Instance Flow Assembly0
Video Instance Segmentation Tracking With a Modified VAE Architecture0
Video Prediction Models as General Visual Encoders0
Virtual KITTI 20
Virtual Worlds as Proxy for Multi-Object Tracking Analysis0
Vision Aided Channel Prediction for Vehicular Communications: A Case Study of Received Power Prediction Using RGB Images0
Vision Foundation Model Embedding-Based Semantic Anomaly Detection0
Vision Transformers Are Good Mask Auto-Labelers0
Visual Relationship Prediction via Label Clustering and Incorporation of Depth Information0
VizWiz-FewShot: Locating Objects in Images Taken by People With Visual Impairments0
Vocabulary-Free 3D Instance Segmentation with Vision and Language Assistant0
Volume and leaf area calculation of cabbage with a neural network-based instance segmentation0
VoteSplat: Hough Voting Gaussian Splatting for 3D Scene Understanding0
VoxelEmbed: 3D Instance Segmentation and Tracking with Voxel Embedding based Deep Learning0
Weakly Supervised 3D Instance Segmentation without Instance-level Annotations0
Weakly Supervised Airway Orifice Segmentation in Video Bronchoscopy0
Weakly Supervised Instance Segmentation by Deep Community Learning0
Weakly Supervised Instance Segmentation by Learning Annotation Consistent Instances0
Weakly Supervised Instance Segmentation for Videos with Temporal Mask Consistency0
Weakly Supervised Instance Segmentation Using Hybrid Network0
Weakly Supervised Instance Segmentation using Motion Information via Optical Flow0
Salient Instance Segmentation with Region and Box-level Annotations0
Weakly Supervised Multi-Object Tracking and Segmentation0
Weakly-Supervised Text Instance Segmentation0
WeakSurg: Weakly supervised surgical instrument segmentation using temporal equivariance and semantic continuity0
What is Point Supervision Worth in Video Instance Segmentation?0
What Makes for Good Views for Contrastive Learning?0
What Makes for Hierarchical Vision Transformer?0
What's in my Room? Object Recognition on Indoor Panoramic Images0
When 3D Bounding-Box Meets SAM: Point Cloud Instance Segmentation with Weak-and-Noisy Supervision0
WildDash - Creating Hazard-Aware Benchmarks0
WISH: Weakly Supervised Instance Segmentation using Heterogeneous Labels0
X^3KD: Knowledge Distillation Across Modalities, Tasks and Stages for Multi-Camera 3D Object Detection0
Show:102550
← PrevPage 38 of 46Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1InternImage-HAP5080.8Unverified
2ResNeSt-200 (multi-scale)AP5070.2Unverified
3CenterMask + VoVNetV2-99 (multi-scale)AP5066.2Unverified
4CenterMask + VoVNetV2-57 (single-scale)AP5060.8Unverified
5Co-DETRmask AP57.1Unverified
6CBNetV2 (EVA02, single-scale)mask AP56.1Unverified
7ISDA (ResNet-50)APL55.7Unverified
8EVAmask AP55.5Unverified
9FD-SwinV2-Gmask AP55.4Unverified
10Mask Frozen-DETRmask AP55.3Unverified
#ModelMetricClaimedVerifiedStatus
1InternImage-BGFLOPs501Unverified
2Co-DETRmask AP56.6Unverified
3ViT-CoMer-L (Mask RCNN, DINOv2)mask AP55.9Unverified
4InternImage-Hmask AP55.4Unverified
5EVAmask AP55Unverified
6Mask Frozen-DETRmask AP54.9Unverified
7MasK DINO (SwinL, multi-scale)mask AP54.5Unverified
8ViT-Adapter-L (HTC++, BEiTv2, O365, multi-scale)mask AP54.2Unverified
9GLEE-Promask AP54.2Unverified
10SwinV2-G (HTC++)mask AP53.7Unverified