SOTAVerified

Instance Segmentation

Instance Segmentation is a computer vision task that involves identifying and separating individual objects within an image, including detecting the boundaries of each object and assigning a unique label to each object. The goal of instance segmentation is to produce a pixel-wise segmentation map of the image, where each pixel is assigned to a specific object instance.

Image Credit: Deep Occlusion-Aware Instance Segmentation with Overlapping BiLayers, CVPR'21

Papers

Showing 701750 of 2262 papers

TitleStatusHype
Enhancing Geometric Factors in Model Learning and Inference for Object Detection and Instance SegmentationCode1
PDAM: A Panoptic-Level Feature Alignment Framework for Unsupervised Domain Adaptive Instance Segmentation in Microscopy ImagesCode1
Instance As Identity: A Generic Online Paradigm for Video Instance SegmentationCode1
Test-time Adaptation with Slot-Centric ModelsCode1
3D Part Guided Image Editing for Fine-Grained Object UnderstandingCode1
Instance Segmentation in the DarkCode1
Image Augmentation for Multitask Few-Shot Learning: Agricultural Domain Use-CaseCode1
Balanced Meta-Softmax for Long-Tailed Visual RecognitionCode1
Learning to Aggregate Multi-Scale Context for Instance Segmentation in Remote Sensing ImagesCode1
Pointly-Supervised Instance SegmentationCode1
iBOT: Image BERT Pre-Training with Online TokenizerCode1
All in Tokens: Unifying Output Space of Visual Tasks via Soft TokenCode1
EPSANet: An Efficient Pyramid Squeeze Attention Block on Convolutional Neural NetworkCode1
EPSNet: Efficient Panoptic Segmentation Network with Cross-layer Attention FusionCode1
CBNet: A Composite Backbone Network Architecture for Object DetectionCode1
Equalization Loss v2: A New Gradient Balance Approach for Long-tailed Object DetectionCode1
PolyLoss: A Polynomial Expansion Perspective of Classification Loss FunctionsCode1
PolyWorld: Polygonal Building Extraction with Graph Neural Networks in Satellite ImagesCode1
CDNet: Centripetal Direction Network for Nuclear Instance SegmentationCode1
Implicit Feature Refinement for Instance SegmentationCode1
Deep High-Resolution Representation Learning for Visual RecognitionCode1
Evaluation Study on SAM 2 for Class-agnostic Instance-level SegmentationCode1
Panoptic Feature Fusion Net: A Novel Instance Segmentation Paradigm for Biomedical and Biological ImagesCode1
EViT: An Eagle Vision Transformer with Bi-Fovea Self-AttentionCode1
Human Instance Matting via Mutual Guidance and Multi-Instance RefinementCode1
Evolving Normalization-Activation LayersCode1
HS-ResNet: Hierarchical-Split Block on Convolutional Neural NetworkCode1
Expediting Large-Scale Vision Transformer for Dense Prediction without Fine-tuningCode1
Explain Any Concept: Segment Anything Meets Concept-Based ExplanationCode1
Instances as QueriesCode1
A Review of Panoptic Segmentation for Mobile Mapping Point CloudsCode1
RapidNet: Multi-Level Dilated Convolution Based Mobile BackboneCode1
Humans need not label more humans: Occlusion Copy & Paste for Occluded Human Instance SegmentationCode1
Exploring Classification Equilibrium in Long-Tailed Object DetectionCode1
CellVTA: Enhancing Vision Foundation Models for Accurate Cell Segmentation and ClassificationCode1
Exploring Data-Efficient 3D Scene Understanding with Contrastive Scene ContextsCode1
Decoupling Classifier for Boosting Few-shot Object Detection and Instance SegmentationCode1
A Robust Feature Downsampling Module for Remote Sensing Visual TasksCode1
Fashionpedia: Ontology, Segmentation, and an Attribute Localization DatasetCode1
Exploring The Role of Mean Teachers in Self-supervised Masked Auto-EncodersCode1
CenterMask : Real-Time Anchor-Free Instance SegmentationCode1
Regularized Densely-connected Pyramid Network for Salient Instance SegmentationCode1
HoVer-UNet: Accelerating HoVerNet with UNet-based multi-class nuclei segmentation via knowledge distillationCode1
Hybrid Task Cascade for Instance SegmentationCode1
Relieving Long-tailed Instance Segmentation via Pairwise Class BalanceCode1
Efficient Attention: Attention with Linear ComplexitiesCode1
MViTv2: Improved Multiscale Vision Transformers for Classification and DetectionCode1
FAPIS: A Few-shot Anchor-free Part-based Instance SegmenterCode1
FASA: Feature Augmentation and Sampling Adaptation for Long-Tailed Instance SegmentationCode1
DCT-Mask: Discrete Cosine Transform Mask Representation for Instance SegmentationCode1
Show:102550
← PrevPage 15 of 46Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1InternImage-HAP5080.8Unverified
2ResNeSt-200 (multi-scale)AP5070.2Unverified
3CenterMask + VoVNetV2-99 (multi-scale)AP5066.2Unverified
4CenterMask + VoVNetV2-57 (single-scale)AP5060.8Unverified
5Co-DETRmask AP57.1Unverified
6CBNetV2 (EVA02, single-scale)mask AP56.1Unverified
7ISDA (ResNet-50)APL55.7Unverified
8EVAmask AP55.5Unverified
9FD-SwinV2-Gmask AP55.4Unverified
10Mask Frozen-DETRmask AP55.3Unverified
#ModelMetricClaimedVerifiedStatus
1InternImage-BGFLOPs501Unverified
2Co-DETRmask AP56.6Unverified
3ViT-CoMer-L (Mask RCNN, DINOv2)mask AP55.9Unverified
4InternImage-Hmask AP55.4Unverified
5EVAmask AP55Unverified
6Mask Frozen-DETRmask AP54.9Unverified
7MasK DINO (SwinL, multi-scale)mask AP54.5Unverified
8ViT-Adapter-L (HTC++, BEiTv2, O365, multi-scale)mask AP54.2Unverified
9GLEE-Promask AP54.2Unverified
10SwinV2-G (HTC++)mask AP53.7Unverified