SOTAVerified

Instance Segmentation

Instance Segmentation is a computer vision task that involves identifying and separating individual objects within an image, including detecting the boundaries of each object and assigning a unique label to each object. The goal of instance segmentation is to produce a pixel-wise segmentation map of the image, where each pixel is assigned to a specific object instance.

Image Credit: Deep Occlusion-Aware Instance Segmentation with Overlapping BiLayers, CVPR'21

Papers

Showing 251300 of 2262 papers

TitleStatusHype
AISFormer: Amodal Instance Segmentation with TransformerCode1
Distilling Knowledge via Knowledge ReviewCode1
Distribution Alignment: A Unified Framework for Long-tail Visual RecognitionCode1
Contextual Transformer Networks for Visual RecognitionCode1
3D Part Guided Image Editing for Fine-Grained Object UnderstandingCode1
Continual Learning for Image Segmentation with Dynamic QueryCode1
Contrastive Lift: 3D Object Instance Segmentation by Slow-Fast Contrastive FusionCode1
Crossover Learning for Fast Online Video Instance SegmentationCode1
FAPIS: A Few-shot Anchor-free Part-based Instance SegmenterCode1
Applying Eigencontours to PolarMask-Based Instance SegmentationCode1
AdaLog: Post-Training Quantization for Vision Transformers with Adaptive Logarithm QuantizerCode1
Container: Context Aggregation NetworkCode1
Conformer: Local Features Coupling Global Representations for Visual RecognitionCode1
A One Stop 3D Target Reconstruction and multilevel Segmentation MethodCode1
Container: Context Aggregation NetworksCode1
Explain Any Concept: Segment Anything Meets Concept-Based ExplanationCode1
AnyStar: Domain randomized universal star-convex 3D instance segmentationCode1
Evolving Normalization-Activation LayersCode1
Conditional Convolutions for Instance SegmentationCode1
3D-MPA: Multi-Proposal Aggregation for 3D Semantic Instance SegmentationCode1
Conditional Object-Centric Learning from VideoCode1
Expediting Large-Scale Vision Transformer for Dense Prediction without Fine-tuningCode1
Exploring Classification Equilibrium in Long-Tailed Object DetectionCode1
CompFeat: Comprehensive Feature Aggregation for Video Instance SegmentationCode1
UVO Challenge on Video-based Open-World Segmentation 2021: 1st Place SolutionCode1
Complete Instances Mining for Weakly Supervised Instance SegmentationCode1
Common Limitations of Image Processing Metrics: A Picture StoryCode1
Active Pointly-Supervised Instance SegmentationCode1
Evaluation Study on SAM 2 for Class-agnostic Instance-level SegmentationCode1
Compositional Human-Scene Interaction Synthesis with Semantic ControlCode1
Active Token MixerCode1
Context-Aware Relative Object Queries To Unify Video Instance and Panoptic SegmentationCode1
Combinatorial Optimization for Panoptic Segmentation: A Fully Differentiable ApproachCode1
3D Mitochondria Instance Segmentation with Spatio-Temporal TransformersCode1
Commonality-Parsing Network across Shape and Appearance for Partially Supervised Instance SegmentationCode1
Cross-View Regularization for Domain Adaptive Panoptic SegmentationCode1
EViT: An Eagle Vision Transformer with Bi-Fovea Self-AttentionCode1
Exploring Data-Efficient 3D Scene Understanding with Contrastive Scene ContextsCode1
FASA: Feature Augmentation and Sampling Adaptation for Long-Tailed Instance SegmentationCode1
CLUSTSEG: Clustering for Universal SegmentationCode1
End-to-End Semi-Supervised Object Detection with Soft TeacherCode1
End-to-End Video Instance Segmentation with TransformersCode1
HCFormer: Unified Image Segmentation with Hierarchical ClusteringCode1
Clustering Plotted Data by Image SegmentationCode1
End-to-End Referring Video Object Segmentation with Multimodal TransformersCode1
Enhancing Geometric Factors in Model Learning and Inference for Object Detection and Instance SegmentationCode1
An Instance Segmentation Dataset of Yeast Cells in MicrostructuresCode1
A Comparative Evaluation of Deep Learning Techniques for Photovoltaic Panel Detection from Aerial ImagesCode1
Coherent Reconstruction of Multiple Humans from a Single ImageCode1
ClusterFormer: Clustering As A Universal Visual LearnerCode1
Show:102550
← PrevPage 6 of 46Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1InternImage-HAP5080.8Unverified
2ResNeSt-200 (multi-scale)AP5070.2Unverified
3CenterMask + VoVNetV2-99 (multi-scale)AP5066.2Unverified
4CenterMask + VoVNetV2-57 (single-scale)AP5060.8Unverified
5Co-DETRmask AP57.1Unverified
6CBNetV2 (EVA02, single-scale)mask AP56.1Unverified
7ISDA (ResNet-50)APL55.7Unverified
8EVAmask AP55.5Unverified
9FD-SwinV2-Gmask AP55.4Unverified
10Mask Frozen-DETRmask AP55.3Unverified
#ModelMetricClaimedVerifiedStatus
1InternImage-BGFLOPs501Unverified
2Co-DETRmask AP56.6Unverified
3ViT-CoMer-L (Mask RCNN, DINOv2)mask AP55.9Unverified
4InternImage-Hmask AP55.4Unverified
5EVAmask AP55Unverified
6Mask Frozen-DETRmask AP54.9Unverified
7MasK DINO (SwinL, multi-scale)mask AP54.5Unverified
8ViT-Adapter-L (HTC++, BEiTv2, O365, multi-scale)mask AP54.2Unverified
9GLEE-Promask AP54.2Unverified
10SwinV2-G (HTC++)mask AP53.7Unverified