SOTAVerified

Instance Segmentation

Instance Segmentation is a computer vision task that involves identifying and separating individual objects within an image, including detecting the boundaries of each object and assigning a unique label to each object. The goal of instance segmentation is to produce a pixel-wise segmentation map of the image, where each pixel is assigned to a specific object instance.

Image Credit: Deep Occlusion-Aware Instance Segmentation with Overlapping BiLayers, CVPR'21

Papers

Showing 251300 of 2262 papers

TitleStatusHype
Continuous Copy-Paste for One-Stage Multi-Object Tracking and SegmentationCode1
Divide and Conquer: 3D Point Cloud Instance Segmentation With Point-Wise BinarizationCode1
DocSegTr: An Instance-Level End-to-End Document Image Segmentation TransformerCode1
ContourFormer:Real-Time Contour-Based End-to-End Instance Segmentation TransformerCode1
3D Part Guided Image Editing for Fine-Grained Object UnderstandingCode1
Continual Learning for Image Segmentation with Dynamic QueryCode1
Contour Proposal Networks for Biomedical Instance SegmentationCode1
ConvMLP: Hierarchical Convolutional MLPs for VisionCode1
Fashionpedia: Ontology, Segmentation, and an Attribute Localization DatasetCode1
Applying Eigencontours to PolarMask-Based Instance SegmentationCode1
AdaLog: Post-Training Quantization for Vision Transformers with Adaptive Logarithm QuantizerCode1
Container: Context Aggregation NetworksCode1
A One Stop 3D Target Reconstruction and multilevel Segmentation MethodCode1
Context-Aware Relative Object Queries To Unify Video Instance and Panoptic SegmentationCode1
Conformer: Local Features Coupling Global Representations for Visual RecognitionCode1
AnyStar: Domain randomized universal star-convex 3D instance segmentationCode1
Conditional Object-Centric Learning from VideoCode1
3D-MPA: Multi-Proposal Aggregation for 3D Semantic Instance SegmentationCode1
Container: Context Aggregation NetworkCode1
Exploring Data-Efficient 3D Scene Understanding with Contrastive Scene ContextsCode1
Exploring The Role of Mean Teachers in Self-supervised Masked Auto-EncodersCode1
UVO Challenge on Video-based Open-World Segmentation 2021: 1st Place SolutionCode1
Compositional Human-Scene Interaction Synthesis with Semantic ControlCode1
Complete Instances Mining for Weakly Supervised Instance SegmentationCode1
Active Pointly-Supervised Instance SegmentationCode1
Conditional Convolutions for Instance SegmentationCode1
Explain Any Concept: Segment Anything Meets Concept-Based ExplanationCode1
CompFeat: Comprehensive Feature Aggregation for Video Instance SegmentationCode1
Active Token MixerCode1
Contextual Transformer Networks for Visual RecognitionCode1
Commonality-Parsing Network across Shape and Appearance for Partially Supervised Instance SegmentationCode1
3D Mitochondria Instance Segmentation with Spatio-Temporal TransformersCode1
Common Limitations of Image Processing Metrics: A Picture StoryCode1
Expediting Large-Scale Vision Transformer for Dense Prediction without Fine-tuningCode1
Exploring Classification Equilibrium in Long-Tailed Object DetectionCode1
Detection and Segmentation of Custom Objects using High Distraction Photorealistic Synthetic DataCode1
Fast and Efficient Transformer-based Method for Bird's Eye View Instance PredictionCode1
EPSANet: An Efficient Pyramid Squeeze Attention Block on Convolutional Neural NetworkCode1
Coherent Reconstruction of Multiple Humans from a Single ImageCode1
EPSNet: Efficient Panoptic Segmentation Network with Cross-layer Attention FusionCode1
CLUSTSEG: Clustering for Universal SegmentationCode1
End-to-End Video Instance Segmentation with TransformersCode1
Enhancing Geometric Factors in Model Learning and Inference for Object Detection and Instance SegmentationCode1
Equalization Loss v2: A New Gradient Balance Approach for Long-tailed Object DetectionCode1
An Instance Segmentation Dataset of Yeast Cells in MicrostructuresCode1
A Comparative Evaluation of Deep Learning Techniques for Photovoltaic Panel Detection from Aerial ImagesCode1
Clustering Plotted Data by Image SegmentationCode1
HCFormer: Unified Image Segmentation with Hierarchical ClusteringCode1
End-to-End Referring Video Object Segmentation with Multimodal TransformersCode1
End-to-End Semi-Supervised Object Detection with Soft TeacherCode1
Show:102550
← PrevPage 6 of 46Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1InternImage-HAP5080.8Unverified
2ResNeSt-200 (multi-scale)AP5070.2Unverified
3CenterMask + VoVNetV2-99 (multi-scale)AP5066.2Unverified
4CenterMask + VoVNetV2-57 (single-scale)AP5060.8Unverified
5Co-DETRmask AP57.1Unverified
6CBNetV2 (EVA02, single-scale)mask AP56.1Unverified
7ISDA (ResNet-50)APL55.7Unverified
8EVAmask AP55.5Unverified
9FD-SwinV2-Gmask AP55.4Unverified
10Mask Frozen-DETRmask AP55.3Unverified
#ModelMetricClaimedVerifiedStatus
1InternImage-BGFLOPs501Unverified
2Co-DETRmask AP56.6Unverified
3ViT-CoMer-L (Mask RCNN, DINOv2)mask AP55.9Unverified
4InternImage-Hmask AP55.4Unverified
5EVAmask AP55Unverified
6Mask Frozen-DETRmask AP54.9Unverified
7MasK DINO (SwinL, multi-scale)mask AP54.5Unverified
8GLEE-Promask AP54.2Unverified
9ViT-Adapter-L (HTC++, BEiTv2, O365, multi-scale)mask AP54.2Unverified
10SwinV2-G (HTC++)mask AP53.7Unverified