SOTAVerified

Instance Segmentation

Instance Segmentation is a computer vision task that involves identifying and separating individual objects within an image, including detecting the boundaries of each object and assigning a unique label to each object. The goal of instance segmentation is to produce a pixel-wise segmentation map of the image, where each pixel is assigned to a specific object instance.

Image Credit: Deep Occlusion-Aware Instance Segmentation with Overlapping BiLayers, CVPR'21

Papers

Showing 651700 of 2262 papers

TitleStatusHype
Improving Convolutional Networks With Self-Calibrated ConvolutionsCode1
MosaicFusion: Diffusion Models as Data Augmenters for Large Vocabulary Instance SegmentationCode1
MPViT: Multi-Path Vision Transformer for Dense PredictionCode1
MSeg: A Composite Dataset for Multi-domain Semantic SegmentationCode1
Incremental Few-Shot Instance SegmentationCode1
Deep learning approaches to building rooftop thermal bridge detection from aerial imagesCode1
Fashionpedia: Ontology, Segmentation, and an Attribute Localization DatasetCode1
Efficient Connectivity-Preserving Instance Segmentation with Supervoxel-Based Loss FunctionCode1
Multi-Task Learning Using Uncertainty to Weigh Losses for Scene Geometry and SemanticsCode1
NDD20: A large-scale few-shot dolphin dataset for coarse and fine-grained categorisationCode1
Human Instance Matting via Mutual Guidance and Multi-Instance RefinementCode1
Efficient Multi-Task RGB-D Scene Analysis for Indoor EnvironmentsCode1
UVO Challenge on Video-based Open-World Segmentation 2021: 1st Place SolutionCode1
EfficientPS: Efficient Panoptic SegmentationCode1
Humans need not label more humans: Occlusion Copy & Paste for Occluded Human Instance SegmentationCode1
HoVer-UNet: Accelerating HoVerNet with UNet-based multi-class nuclei segmentation via knowledge distillationCode1
BSNet: Box-Supervised Simulation-assisted Mean Teacher for 3D Instance SegmentationCode1
Nucleus-aware Self-supervised Pretraining Using Unpaired Image-to-image Translation for Histopathology ImagesCode1
NucMM Dataset: 3D Neuronal Nuclei Instance Segmentation at Sub-Cubic Millimeter ScaleCode1
HoughNet: Integrating near and long-range evidence for visual detectionCode1
HS-ResNet: Hierarchical-Split Block on Convolutional Neural NetworkCode1
Hybrid Task Cascade for Instance SegmentationCode1
Deep High-Resolution Representation Learning for Human Pose EstimationCode1
BARS: A Benchmark for Airport Runway SegmentationCode1
HistoSmith: Single-Stage Histology Image-Label Generation via Conditional Latent Diffusion for Enhanced Cell Segmentation and ClassificationCode1
OG-HFYOLO :Orientation gradient guidance and heterogeneous feature fusion for deformation table cell instance segmentationCode1
ElC-OIS: Ellipsoidal Clustering for Open-World Instance Segmentation on LiDAR DataCode1
ELSA: Enhanced Local Self-Attention for Vision TransformerCode1
Hierarchical Approach for Joint Semantic, Plant Instance, and Leaf Instance Segmentation in the Agricultural DomainCode1
Hierarchical Aggregation for 3D Instance SegmentationCode1
High-Resolution Image Synthesis and Semantic Manipulation with Conditional GANsCode1
Segmenting Known Objects and Unseen Unknowns without Prior KnowledgeCode1
You Only Need One Thing One Click: Self-Training for Weakly Supervised 3D Scene UnderstandingCode1
EM-Paste: EM-guided Cut-Paste with DALL-E Augmentation for Image-level Weakly Supervised Instance SegmentationCode1
On Model Calibration for Long-Tailed Object Detection and Instance SegmentationCode1
On Point Affiliation in Feature UpsamplingCode1
iBOT: Image BERT Pre-Training with Online TokenizerCode1
End-to-End Human Instance MattingCode1
A One Stop 3D Target Reconstruction and multilevel Segmentation MethodCode1
OpenVIS: Open-vocabulary Video Instance SegmentationCode1
GSPN: Generative Shape Proposal Network for 3D Instance Segmentation in Point CloudCode1
Balanced Meta-Softmax for Long-Tailed Visual RecognitionCode1
End-to-End Referring Video Object Segmentation with Multimodal TransformersCode1
End-to-End Semi-Supervised Object Detection with Soft TeacherCode1
Guided Distillation for Semi-Supervised Instance SegmentationCode1
GRIT: General Robust Image Task BenchmarkCode1
P2T: Pyramid Pooling Transformer for Scene UnderstandingCode1
FcaNet: Frequency Channel Attention NetworksCode1
All in Tokens: Unifying Output Space of Visual Tasks via Soft TokenCode1
H2RBox: Horizontal Box Annotation is All You Need for Oriented Object DetectionCode1
Show:102550
← PrevPage 14 of 46Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1InternImage-HAP5080.8Unverified
2ResNeSt-200 (multi-scale)AP5070.2Unverified
3CenterMask + VoVNetV2-99 (multi-scale)AP5066.2Unverified
4CenterMask + VoVNetV2-57 (single-scale)AP5060.8Unverified
5Co-DETRmask AP57.1Unverified
6CBNetV2 (EVA02, single-scale)mask AP56.1Unverified
7ISDA (ResNet-50)APL55.7Unverified
8EVAmask AP55.5Unverified
9FD-SwinV2-Gmask AP55.4Unverified
10Mask Frozen-DETRmask AP55.3Unverified
#ModelMetricClaimedVerifiedStatus
1InternImage-BGFLOPs501Unverified
2Co-DETRmask AP56.6Unverified
3ViT-CoMer-L (Mask RCNN, DINOv2)mask AP55.9Unverified
4InternImage-Hmask AP55.4Unverified
5EVAmask AP55Unverified
6Mask Frozen-DETRmask AP54.9Unverified
7MasK DINO (SwinL, multi-scale)mask AP54.5Unverified
8ViT-Adapter-L (HTC++, BEiTv2, O365, multi-scale)mask AP54.2Unverified
9GLEE-Promask AP54.2Unverified
10SwinV2-G (HTC++)mask AP53.7Unverified