SOTAVerified

Instance Segmentation

Instance Segmentation is a computer vision task that involves identifying and separating individual objects within an image, including detecting the boundaries of each object and assigning a unique label to each object. The goal of instance segmentation is to produce a pixel-wise segmentation map of the image, where each pixel is assigned to a specific object instance.

Image Credit: Deep Occlusion-Aware Instance Segmentation with Overlapping BiLayers, CVPR'21

Papers

Showing 10511100 of 2262 papers

TitleStatusHype
Mask is All You Need: Rethinking Mask R-CNN for Dense and Arbitrary-Shaped Scene Text Detection0
Generative Shape Models: Joint Text Recognition and Segmentation with Very Little Training Data0
Generative AI Driven Task-Oriented Adaptive Semantic Communications0
Mask-Guided Matting in the Wild0
MaskLab: Instance Segmentation by Refining Object Detection with Semantic and Direction Features0
Generalized Object Detection on Fisheye Cameras for Autonomous Driving: Dataset, Representations and Baseline0
Generalized Mask-aware IoU for Anchor Assignment for Real-time Instance Segmentation0
3rd Place Solution for PVUW Challenge 2023: Video Panoptic Segmentation0
Mask Frozen-DETR: High Quality Instance Segmentation with One GPU0
Active Coarse-to-Fine Segmentation of Moveable Parts from Real Images0
Generalized Class Discovery in Instance Segmentation0
gen2seg: Generative Models Enable Generalizable Instance Segmentation0
CNN-based Preprocessing to Optimize Watershed-based Cell Segmentation in 3D Confocal Microscopy Images0
ADFQ-ViT: Activation-Distribution-Friendly Post-Training Quantization for Vision Transformers0
MaskGroup: Hierarchical Point Grouping and Masking for 3D Instance Segmentation0
MaskPlus: Improving Mask Generation for Instance Segmentation0
CML-MOTS: Collaborative Multi-task Learning for Multi-Object Tracking and Segmentation0
Associating Inter-Image Salient Instances for Weakly Supervised Semantic Segmentation0
GANtruth - an unpaired image-to-image translation method for driving scenarios0
Cluttered Food Grasping with Adaptive Fingers and Synthetic-Data Trained Object Detection0
3D-BEVIS: Bird's-Eye-View Instance Segmentation0
ClusterViG: Efficient Globally Aware Vision GNNs via Image Partitioning0
Mask Encoding for Single Shot Instance Segmentation0
ClusterNet: 3D Instance Segmentation in RGB-D Images0
Fully-Automated Packaging Structure Recognition in Logistics Environments0
FSD V2: Improving Fully Sparse 3D Object Detection with Virtual Voxels0
Assessing YOLACT++ for real time and robust instance segmentation of medical instruments in endoscopic procedures0
Fruit Detection, Segmentation and 3D Visualisation of Environments in Apple Orchards0
Assessing SAM for Tree Crown Instance Segmentation from Drone Imagery0
From Forks to Forceps: A New Framework for Instance Segmentation of Surgical Instruments0
A Dataset for Lane Instance Segmentation in Urban Environments0
3D-Aware Instance Segmentation and Tracking in Egocentric Videos0
Mask-free OVIS: Open-Vocabulary Instance Segmentation without Manual Mask Annotations0
Joint Object Contour Points and Semantics for Instance Segmentation0
1st Place Winner of the 2024 Pixel-level Video Understanding in the Wild (CVPR'24 PVUW) Challenge in Video Panoptic Segmentation and Best Long Video Consistency of Video Semantic Segmentation0
Free Supervision From Video Games0
CloudTracks: A Dataset for Localizing Ship Tracks in Satellite Images of Clouds0
MaskClustering: View Consensus based Mask Graph Clustering for Open-Vocabulary 3D Instance Segmentation0
FreeSeg: Unified, Universal and Open-Vocabulary Image Segmentation0
Close the Sim2real Gap via Physically-based Structured Light Synthetic Data Simulation0
FPCD: An Open Aerial VHR Dataset for Farm Pond Change Detection0
RevealNet: Seeing Behind Objects in RGB-D Scans0
FoxInst: A Frustratingly Simple Baseline for Weakly Few-shot Instance Segmentation0
ASIST: Annotation-free synthetic instance segmentation and tracking for microscope video analysis0
3D Segmentation of Humans in Point Clouds with Synthetic Data0
FourierMask: Instance Segmentation using Fourier Mapping in Implicit Neural Networks0
ClickSeg: 3D Instance Segmentation with Click-Level Weak Annotations0
Mask3D: Pre-training 2D Vision Transformers by Learning Masked 3D Priors0
FOR-instance: a UAV laser scanning benchmark dataset for semantic and instance segmentation of individual trees0
A Data-efficient Framework for Robotics Large-scale LiDAR Scene Parsing0
Show:102550
← PrevPage 22 of 46Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1InternImage-HAP5080.8Unverified
2ResNeSt-200 (multi-scale)AP5070.2Unverified
3CenterMask + VoVNetV2-99 (multi-scale)AP5066.2Unverified
4CenterMask + VoVNetV2-57 (single-scale)AP5060.8Unverified
5Co-DETRmask AP57.1Unverified
6CBNetV2 (EVA02, single-scale)mask AP56.1Unverified
7ISDA (ResNet-50)APL55.7Unverified
8EVAmask AP55.5Unverified
9FD-SwinV2-Gmask AP55.4Unverified
10Mask Frozen-DETRmask AP55.3Unverified
#ModelMetricClaimedVerifiedStatus
1InternImage-BGFLOPs501Unverified
2Co-DETRmask AP56.6Unverified
3ViT-CoMer-L (Mask RCNN, DINOv2)mask AP55.9Unverified
4InternImage-Hmask AP55.4Unverified
5EVAmask AP55Unverified
6Mask Frozen-DETRmask AP54.9Unverified
7MasK DINO (SwinL, multi-scale)mask AP54.5Unverified
8ViT-Adapter-L (HTC++, BEiTv2, O365, multi-scale)mask AP54.2Unverified
9GLEE-Promask AP54.2Unverified
10SwinV2-G (HTC++)mask AP53.7Unverified