SOTAVerified

Instance Segmentation

Instance Segmentation is a computer vision task that involves identifying and separating individual objects within an image, including detecting the boundaries of each object and assigning a unique label to each object. The goal of instance segmentation is to produce a pixel-wise segmentation map of the image, where each pixel is assigned to a specific object instance.

Image Credit: Deep Occlusion-Aware Instance Segmentation with Overlapping BiLayers, CVPR'21

Papers

Showing 701750 of 2262 papers

TitleStatusHype
Enhancing Geometric Factors in Model Learning and Inference for Object Detection and Instance SegmentationCode1
Learning Saliency Propagation for Semi-Supervised Instance SegmentationCode1
FinnWoodlands DatasetCode1
Fine-Grained Vehicle Perception via 3D Part-Guided Visual Data AugmentationCode1
3D Part Guided Image Editing for Fine-Grained Object UnderstandingCode1
Learning with Noisy Class Labels for Instance SegmentationCode1
Fully Sparse Fusion for 3D Object DetectionCode1
Less is More: Pay Less Attention in Vision TransformersCode1
Learning to Aggregate Multi-Scale Context for Instance Segmentation in Remote Sensing ImagesCode1
Let-It-Flow: Simultaneous Optimization of 3D Flow and Object ClusteringCode1
Point-Set Anchors for Object Detection, Instance Segmentation and Pose EstimationCode1
PolarMask++: Enhanced Polar Representation for Single-Shot Instance Segmentation and BeyondCode1
EPSANet: An Efficient Pyramid Squeeze Attention Block on Convolutional Neural NetworkCode1
EPSNet: Efficient Panoptic Segmentation Network with Cross-layer Attention FusionCode1
CBNet: A Composite Backbone Network Architecture for Object DetectionCode1
Equalization Loss v2: A New Gradient Balance Approach for Long-tailed Object DetectionCode1
Primitive Generation and Semantic-related Alignment for Universal Zero-Shot SegmentationCode1
Lidar Panoptic Segmentation in an Open WorldCode1
CDNet: Centripetal Direction Network for Nuclear Instance SegmentationCode1
LIVECell—A large-scale dataset for label-free live cell segmentationCode1
Deep High-Resolution Representation Learning for Visual RecognitionCode1
Evaluation Study on SAM 2 for Class-agnostic Instance-level SegmentationCode1
Panoptic Feature Fusion Net: A Novel Instance Segmentation Paradigm for Biomedical and Biological ImagesCode1
EViT: An Eagle Vision Transformer with Bi-Fovea Self-AttentionCode1
Locally Enhanced Self-Attention: Combining Self-Attention and Convolution as Local and Context TermsCode1
Evolving Normalization-Activation LayersCode1
Balanced Meta-Softmax for Long-Tailed Visual RecognitionCode1
Expediting Large-Scale Vision Transformer for Dense Prediction without Fine-tuningCode1
PlaneRecNet: Multi-Task Learning with Cross-Task Consistency for Piece-Wise Plane Detection and Reconstruction from a Single RGB ImageCode1
Low Latency Instance Segmentation by Continuous Clustering for LiDAR SensorsCode1
A Review of Panoptic Segmentation for Mobile Mapping Point CloudsCode1
Long-tailed Instance Segmentation using Gumbel Optimized LossCode1
FcaNet: Frequency Channel Attention NetworksCode1
Exploring Classification Equilibrium in Long-Tailed Object DetectionCode1
All in Tokens: Unifying Output Space of Visual Tasks via Soft TokenCode1
Exploring Data-Efficient 3D Scene Understanding with Contrastive Scene ContextsCode1
PointGroup: Dual-Set Point Grouping for 3D Instance SegmentationCode1
A Robust Feature Downsampling Module for Remote Sensing Visual TasksCode1
Making Vision Transformers Efficient from A Token Sparsification ViewCode1
Exploring The Role of Mean Teachers in Self-supervised Masked Auto-EncodersCode1
CenterMask : Real-Time Anchor-Free Instance SegmentationCode1
M6Doc: A Large-Scale Multi-Format, Multi-Type, Multi-Layout, Multi-Language, Multi-Annotation Category Dataset for Modern Document Layout AnalysisCode1
PIDray: A Large-scale X-ray Benchmark for Real-World Prohibited Item DetectionCode1
CenterMask: Real-Time Anchor-Free Instance SegmentationCode1
Decoupling Classifier for Boosting Few-shot Object Detection and Instance SegmentationCode1
Efficient Attention: Attention with Linear ComplexitiesCode1
FIERY: Future Instance Prediction in Bird's-Eye View from Surround Monocular CamerasCode1
Mask Transfiner for High-Quality Instance SegmentationCode1
FASA: Feature Augmentation and Sampling Adaptation for Long-Tailed Instance SegmentationCode1
Plane Geometry Diagram ParsingCode1
Show:102550
← PrevPage 15 of 46Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1InternImage-HAP5080.8Unverified
2ResNeSt-200 (multi-scale)AP5070.2Unverified
3CenterMask + VoVNetV2-99 (multi-scale)AP5066.2Unverified
4CenterMask + VoVNetV2-57 (single-scale)AP5060.8Unverified
5Co-DETRmask AP57.1Unverified
6CBNetV2 (EVA02, single-scale)mask AP56.1Unverified
7ISDA (ResNet-50)APL55.7Unverified
8EVAmask AP55.5Unverified
9FD-SwinV2-Gmask AP55.4Unverified
10Mask Frozen-DETRmask AP55.3Unverified
#ModelMetricClaimedVerifiedStatus
1InternImage-BGFLOPs501Unverified
2Co-DETRmask AP56.6Unverified
3ViT-CoMer-L (Mask RCNN, DINOv2)mask AP55.9Unverified
4InternImage-Hmask AP55.4Unverified
5EVAmask AP55Unverified
6Mask Frozen-DETRmask AP54.9Unverified
7MasK DINO (SwinL, multi-scale)mask AP54.5Unverified
8ViT-Adapter-L (HTC++, BEiTv2, O365, multi-scale)mask AP54.2Unverified
9GLEE-Promask AP54.2Unverified
10SwinV2-G (HTC++)mask AP53.7Unverified