SOTAVerified

Instance Segmentation

Instance Segmentation is a computer vision task that involves identifying and separating individual objects within an image, including detecting the boundaries of each object and assigning a unique label to each object. The goal of instance segmentation is to produce a pixel-wise segmentation map of the image, where each pixel is assigned to a specific object instance.

Image Credit: Deep Occlusion-Aware Instance Segmentation with Overlapping BiLayers, CVPR'21

Papers

Showing 10511100 of 2262 papers

TitleStatusHype
Inverse Image Frequency for Long-tailed Image RecognitionCode0
Self-supervised Learning for Panoptic Segmentation of Multiple Fruit Flower SpeciesCode0
MassMIND: Massachusetts Maritime INfrared DatasetCode1
Exploring Target Representations for Masked AutoencodersCode0
SUNet: Scale-aware Unified Network for Panoptic Segmentation0
Automatic counting of mounds on UAV images: combining instance segmentation and patch-level correction0
MMV_Im2Im: An Open Source Microscopy Machine Vision Toolbox for Image-to-Image TransformationCode1
SIAN: Style-Guided Instance-Adaptive Normalization for Multi-Organ Histopathology Image Synthesis0
Adversarial Stain Transfer to Study the Effect of Color Variation on Cell Instance Segmentation0
MAFormer: A Transformer Network with Multi-scale Attention Fusion for Visual Recognition0
Self-Supervised Pyramid Representation Learning for Multi-Label Visual Analysis and BeyondCode1
Nuclei & Glands Instance Segmentation in Histology Images: A Narrative Review0
Refine and Represent: Region-to-Object Representation LearningCode1
gSwin: Gated MLP Vision Model with Hierarchical Structure of Shifted Window0
Applying Eigencontours to PolarMask-Based Instance SegmentationCode1
Weakly Supervised Airway Orifice Segmentation in Video Bronchoscopy0
Fast and Precise Binary Instance Segmentation of 2D Objects for Automotive Applications0
InstanceFormer: An Online Video Instance Segmentation FrameworkCode1
Image as a Foreign Language: BEiT Pretraining for All Vision and Vision-Language TasksCode0
Towards Calibrated Hyper-Sphere Representation via Distribution Overlap Coefficient for Long-tailed LearningCode0
Open-Vocabulary Universal Image Segmentation with MaskCLIPCode1
Unifying Visual Perception by Dispersible Points LearningCode1
Single-Stage Open-world Instance Segmentation with Cross-task Consistency RegularizationCode0
Video-TransUNet: Temporally Blended Vision Transformer for CT VFSS Instance SegmentationCode1
Look in Different Views: Multi-Scheme Regression Guided Cell Instance Segmentation0
DeepSportradar-v1: Computer Vision Dataset for Sports Understanding with High Quality AnnotationsCode1
FEC: Fast Euclidean Clustering for Point Cloud SegmentationCode2
Uni6Dv2: Noise Elimination for 6D Pose Estimation0
Scale-free and Task-agnostic Attack: Generating Photo-realistic Adversarial Patterns with Patch Quilting Generator0
Collaborative Propagation on Multiple Instance Graphs for 3D Instance Segmentation with Single-point SupervisionCode0
Learning to Complete Object Shapes for Object-level Mapping in Dynamic Scenes0
Semantic Segmentation-Assisted Instance Feature Fusion for Multi-Level 3D Part Instance SegmentationCode1
Occlusion-Aware Instance Segmentation via BiLayer Network ArchitecturesCode2
Instance As Identity: A Generic Online Paradigm for Video Instance SegmentationCode1
Image-based Detection of Surface Defects in Concrete during ConstructionCode0
MinVIS: A Minimal Video Instance Segmentation Framework without Video-based TrainingCode2
OmniCity: Omnipotent City Understanding with Multi-level and Multi-view Images0
Class-Difficulty Based Methods for Long-Tailed Visual RecognitionCode1
Video Mask Transfiner for High-Quality Video Instance SegmentationCode1
Training a universal instance segmentation network for live cell images of various cell types and imaging modalitiesCode0
Visual Recognition by RequestCode1
Object-ABN: Learning to Generate Sharp Attention Maps for Action Recognition0
Compositional Human-Scene Interaction Synthesis with Semantic ControlCode1
Point2Mask: A Weakly Supervised Approach for Cell Segmentation Using Point Annotation0
VizWiz-FewShot: Locating Objects in Images Taken by People With Visual Impairments0
Active Pointly-Supervised Instance SegmentationCode1
Long-tailed Instance Segmentation using Gumbel Optimized LossCode1
Geodesic-Former: a Geodesic-Guided Few-shot 3D Point Cloud Instance SegmenterCode1
Neural Groundplans: Persistent Neural Scene Representations from a Single Image0
Divide and Conquer: 3D Point Cloud Instance Segmentation With Point-Wise BinarizationCode1
Show:102550
← PrevPage 22 of 46Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1InternImage-HAP5080.8Unverified
2ResNeSt-200 (multi-scale)AP5070.2Unverified
3CenterMask + VoVNetV2-99 (multi-scale)AP5066.2Unverified
4CenterMask + VoVNetV2-57 (single-scale)AP5060.8Unverified
5Co-DETRmask AP57.1Unverified
6CBNetV2 (EVA02, single-scale)mask AP56.1Unverified
7ISDA (ResNet-50)APL55.7Unverified
8EVAmask AP55.5Unverified
9FD-SwinV2-Gmask AP55.4Unverified
10Mask Frozen-DETRmask AP55.3Unverified
#ModelMetricClaimedVerifiedStatus
1InternImage-BGFLOPs501Unverified
2Co-DETRmask AP56.6Unverified
3ViT-CoMer-L (Mask RCNN, DINOv2)mask AP55.9Unverified
4InternImage-Hmask AP55.4Unverified
5EVAmask AP55Unverified
6Mask Frozen-DETRmask AP54.9Unverified
7MasK DINO (SwinL, multi-scale)mask AP54.5Unverified
8ViT-Adapter-L (HTC++, BEiTv2, O365, multi-scale)mask AP54.2Unverified
9GLEE-Promask AP54.2Unverified
10SwinV2-G (HTC++)mask AP53.7Unverified