SOTAVerified

Instance Segmentation

Instance Segmentation is a computer vision task that involves identifying and separating individual objects within an image, including detecting the boundaries of each object and assigning a unique label to each object. The goal of instance segmentation is to produce a pixel-wise segmentation map of the image, where each pixel is assigned to a specific object instance.

Image Credit: Deep Occlusion-Aware Instance Segmentation with Overlapping BiLayers, CVPR'21

Papers

Showing 10011050 of 2262 papers

TitleStatusHype
Could Giant Pretrained Image Models Extract Universal Representations?0
Deep Learning based Defect classification and detection in SEM images: A Mask R-CNN approach0
Quantifying and Learning Static vs. Dynamic Information in Deep Spatiotemporal Networks0
CircleSnake: Instance Segmentation with Circle RepresentationCode0
Two-Level Temporal Relation Model for Online Video Instance SegmentationCode0
Grafting Vision Transformers0
Layout Aware Inpainting for Automated Furniture Removal in Indoor Scenes0
Instance Segmentation for Chinese Character Stroke Extraction, Datasets and BenchmarksCode1
BARS: A Benchmark for Airport Runway SegmentationCode1
Self-Supervised Learning with Masked Image Modeling for Teeth Numbering, Detection of Dental Restorations, and Instance Segmentation in Dental Panoramic RadiographsCode1
MGTUNet: An new UNet for colon nuclei instance segmentation and quantification0
Cell tracking for live-cell microscopy using an activity-prioritized assignment strategyCode0
Large-batch Optimization for Dense Visual PredictionsCode1
Self-Supervised Learning via Maximum Entropy CodingCode1
Comparative analysis of deep learning approaches for AgNOR-stained cytology samples interpretation0
TOIST: Task Oriented Instance Segmentation Transformer with Noun-Pronoun DistillationCode1
A Tri-Layer Plugin to Improve Occluded DetectionCode1
Scrape, Cut, Paste and Learn: Automated Dataset Generation Applied to Parcel LogisticsCode1
TIVE: A Toolbox for Identifying Video Instance Segmentation ErrorsCode1
Deformably-Scaled Transposed Convolution0
Instance Segmentation with Cross-Modal Consistency0
Hierarchical Approach for Joint Semantic, Plant Instance, and Leaf Instance Segmentation in the Agricultural DomainCode1
H2RBox: Horizontal Box Annotation is All You Need for Oriented Object DetectionCode1
AISFormer: Amodal Instance Segmentation with TransformerCode1
Latency-aware Spatial-wise Dynamic NetworksCode1
Learning Inter-Superpoint Affinity for Weakly Supervised 3D Instance SegmentationCode1
BoxTeacher: Exploring High-Quality Pseudo Labels for Weakly Supervised Instance SegmentationCode1
4D Unsupervised Object DiscoveryCode1
What the DAAM: Interpreting Stable Diffusion Using Cross AttentionCode2
OGC: Unsupervised 3D Object Segmentation from Rigid Dynamics of Point CloudsCode1
Instance Segmentation of Dense and Overlapping Objects via LayeringCode1
Time-Space Transformers for Video Panoptic Segmentation0
Humans need not label more humans: Occlusion Copy & Paste for Occluded Human Instance SegmentationCode1
Mask3D: Mask Transformer for 3D Semantic Instance SegmentationCode2
Effective Self-supervised Pre-training on Low-compute Networks without DistillationCode1
Exploring The Role of Mean Teachers in Self-supervised Masked Auto-EncodersCode1
Domain Adaptation for Unknown Image Distortions in Instance Segmentation0
K-means for unsupervised instance segmentation using a self-supervised transformer0
MOAT: Alternating Mobile Convolution and Attention Brings Strong Vision ModelsCode1
Learning Equivariant Segmentation with Instance-Unique QueryingCode1
Expediting Large-Scale Vision Transformer for Dense Prediction without Fine-tuningCode1
Dilated Neighborhood Attention TransformerCode2
Strong Instance Segmentation Pipeline for MMSports ChallengeCode1
Diversified Dynamic Routing for Vision Tasks0
D-InLoc++: Indoor Localization in Dynamic EnvironmentsCode0
RNGDet++: Road Network Graph Detection by Transformer with Instance Segmentation and Multi-scale Features Enhancement0
A Dataset for Analysing Complex Document Layouts in the Digital Humanities and Its Evaluation with Krippendorff’s AlphaCode0
SOCRATES: A Stereo Camera Trap for Monitoring of BiodiversityCode0
Scalable SoftGroup for 3D Instance Segmentation on Point CloudsCode2
Segmenting Known Objects and Unseen Unknowns without Prior KnowledgeCode1
Show:102550
← PrevPage 21 of 46Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1InternImage-HAP5080.8Unverified
2ResNeSt-200 (multi-scale)AP5070.2Unverified
3CenterMask + VoVNetV2-99 (multi-scale)AP5066.2Unverified
4CenterMask + VoVNetV2-57 (single-scale)AP5060.8Unverified
5Co-DETRmask AP57.1Unverified
6CBNetV2 (EVA02, single-scale)mask AP56.1Unverified
7ISDA (ResNet-50)APL55.7Unverified
8EVAmask AP55.5Unverified
9FD-SwinV2-Gmask AP55.4Unverified
10Mask Frozen-DETRmask AP55.3Unverified
#ModelMetricClaimedVerifiedStatus
1InternImage-BGFLOPs501Unverified
2Co-DETRmask AP56.6Unverified
3ViT-CoMer-L (Mask RCNN, DINOv2)mask AP55.9Unverified
4InternImage-Hmask AP55.4Unverified
5EVAmask AP55Unverified
6Mask Frozen-DETRmask AP54.9Unverified
7MasK DINO (SwinL, multi-scale)mask AP54.5Unverified
8GLEE-Promask AP54.2Unverified
9ViT-Adapter-L (HTC++, BEiTv2, O365, multi-scale)mask AP54.2Unverified
10SwinV2-G (HTC++)mask AP53.7Unverified