SOTAVerified

Instance Segmentation

Instance Segmentation is a computer vision task that involves identifying and separating individual objects within an image, including detecting the boundaries of each object and assigning a unique label to each object. The goal of instance segmentation is to produce a pixel-wise segmentation map of the image, where each pixel is assigned to a specific object instance.

Image Credit: Deep Occlusion-Aware Instance Segmentation with Overlapping BiLayers, CVPR'21

Papers

Showing 651700 of 2262 papers

TitleStatusHype
Exploring Classification Equilibrium in Long-Tailed Object DetectionCode1
Instance Consistency Regularization for Semi-Supervised 3D Instance SegmentationCode1
Instance Brownian Bridge as Texts for Open-vocabulary Video Instance SegmentationCode1
Instance As Identity: A Generic Online Paradigm for Video Instance SegmentationCode1
Effective Self-supervised Pre-training on Low-compute Networks without DistillationCode1
Active Pointly-Supervised Instance SegmentationCode1
Instance Neural Radiance FieldCode1
Efficient Connectivity-Preserving Instance Segmentation with Supervoxel-Based Loss FunctionCode1
Exploring Data-Efficient 3D Scene Understanding with Contrastive Scene ContextsCode1
Relational Prior Knowledge Graphs for Detection and Instance SegmentationCode1
Overcoming Classifier Imbalance for Long-tail Object Detection with Balanced Group SoftmaxCode1
Efficient Multi-Task RGB-D Scene Analysis for Indoor EnvironmentsCode1
Deep High-Resolution Representation Learning for Human Pose EstimationCode1
EfficientPS: Efficient Panoptic SegmentationCode1
BARS: A Benchmark for Airport Runway SegmentationCode1
Efficient Self-supervised Vision Pretraining with Local Masked ReconstructionCode1
Exploring The Role of Mean Teachers in Self-supervised Masked Auto-EncodersCode1
Instance Segmentation in the DarkCode1
Instance Segmentation of Biomedical Images with an Object-aware Embedding Learned with Local ConstraintsCode1
Explain Any Concept: Segment Anything Meets Concept-Based ExplanationCode1
3D-MPA: Multi-Proposal Aggregation for 3D Semantic Instance SegmentationCode1
OSFormer: One-Stage Camouflaged Instance Segmentation with TransformersCode1
Inter-Instance Similarity Modeling for Contrastive LearningCode1
Improving Weakly-supervised Video Instance Segmentation by Leveraging Spatio-temporal ConsistencyCode1
Eigencontours: Novel Contour Descriptors Based on Low-Rank ApproximationCode1
Interactive Object Segmentation in 3D Point CloudsCode1
ElC-OIS: Ellipsoidal Clustering for Open-World Instance Segmentation on LiDAR DataCode1
ELSA: Enhanced Local Self-Attention for Vision TransformerCode1
P2T: Pyramid Pooling Transformer for Scene UnderstandingCode1
Evaluation Study on SAM 2 for Class-agnostic Instance-level SegmentationCode1
CalibNet: Dual-branch Cross-modal Calibration for RGB-D Salient Instance SegmentationCode1
Balanced Meta-Softmax for Long-Tailed Visual RecognitionCode1
ISETHDR: A Physics-based Synthetic Radiance Dataset for High Dynamic Range Driving ScenesCode1
EM-Paste: EM-guided Cut-Paste with DALL-E Augmentation for Image-level Weakly Supervised Instance SegmentationCode1
iSAID: A Large-scale Dataset for Instance Segmentation in Aerial ImagesCode1
ISBNet: a 3D Point Cloud Instance Segmentation Network with Instance-aware Sampling and Box-aware Dynamic ConvolutionCode1
OpenVIS: Open-vocabulary Video Instance SegmentationCode1
End-to-End Human Instance MattingCode1
A One Stop 3D Target Reconstruction and multilevel Segmentation MethodCode1
JacobiNeRF: NeRF Shaping with Mutual Information GradientsCode1
Kartezio: Evolutionary Design of Explainable Pipelines for Biomedical Image AnalysisCode1
Key Points Estimation and Point Instance Segmentation Approach for Lane DetectionCode1
EViT: An Eagle Vision Transformer with Bi-Fovea Self-AttentionCode1
End-to-End Semi-Supervised Object Detection with Soft TeacherCode1
All in Tokens: Unifying Output Space of Visual Tasks via Soft TokenCode1
OpenMaskDINO3D : Reasoning 3D Segmentation via Large Language ModelCode1
Robust Instance Segmentation through Reasoning about Multi-Object OcclusionCode1
ROFT: Real-Time Optical Flow-Aided 6D Object Pose and Velocity TrackingCode1
Applying Eigencontours to PolarMask-Based Instance SegmentationCode1
Open-Vocabulary Instance Segmentation via Robust Cross-Modal Pseudo-LabelingCode1
Show:102550
← PrevPage 14 of 46Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1InternImage-HAP5080.8Unverified
2ResNeSt-200 (multi-scale)AP5070.2Unverified
3CenterMask + VoVNetV2-99 (multi-scale)AP5066.2Unverified
4CenterMask + VoVNetV2-57 (single-scale)AP5060.8Unverified
5Co-DETRmask AP57.1Unverified
6CBNetV2 (EVA02, single-scale)mask AP56.1Unverified
7ISDA (ResNet-50)APL55.7Unverified
8EVAmask AP55.5Unverified
9FD-SwinV2-Gmask AP55.4Unverified
10Mask Frozen-DETRmask AP55.3Unverified
#ModelMetricClaimedVerifiedStatus
1InternImage-BGFLOPs501Unverified
2Co-DETRmask AP56.6Unverified
3ViT-CoMer-L (Mask RCNN, DINOv2)mask AP55.9Unverified
4InternImage-Hmask AP55.4Unverified
5EVAmask AP55Unverified
6Mask Frozen-DETRmask AP54.9Unverified
7MasK DINO (SwinL, multi-scale)mask AP54.5Unverified
8ViT-Adapter-L (HTC++, BEiTv2, O365, multi-scale)mask AP54.2Unverified
9GLEE-Promask AP54.2Unverified
10SwinV2-G (HTC++)mask AP53.7Unverified