SOTAVerified

Instance Segmentation

Instance Segmentation is a computer vision task that involves identifying and separating individual objects within an image, including detecting the boundaries of each object and assigning a unique label to each object. The goal of instance segmentation is to produce a pixel-wise segmentation map of the image, where each pixel is assigned to a specific object instance.

Image Credit: Deep Occlusion-Aware Instance Segmentation with Overlapping BiLayers, CVPR'21

Papers

Showing 801825 of 2262 papers

TitleStatusHype
MID-Fusion: Octree-based Object-Level Multi-Instance Dynamic SLAMCode1
GSPN: Generative Shape Proposal Network for 3D Instance Segmentation in Point CloudCode1
Efficient Attention: Attention with Linear ComplexitiesCode1
One-Shot Instance SegmentationCode1
Deformable ConvNets v2: More Deformable, Better ResultsCode1
Weakly- and Semi-Supervised Panoptic SegmentationCode1
BDD100K: A Diverse Driving Dataset for Heterogeneous Multitask LearningCode1
The ApolloScape Open Dataset for Autonomous Driving and its ApplicationCode1
Path Aggregation Network for Instance SegmentationCode1
Multiclass Weighted Loss for Instance Segmentation of Cluttered CellsCode1
Panoptic SegmentationCode1
High-Resolution Image Synthesis and Semantic Manipulation with Conditional GANsCode1
Non-local Neural NetworksCode1
Multi-Task Learning Using Uncertainty to Weigh Losses for Scene Geometry and SemanticsCode1
Mask R-CNNCode1
Microsoft COCO: Common Objects in ContextCode1
SCORE: Scene Context Matters in Open-Vocabulary Remote Sensing Instance SegmentationCode0
Tomato Multi-Angle Multi-Pose Dataset for Fine-Grained Phenotyping0
DreamGrasp: Zero-Shot 3D Multi-Object Reconstruction from Partial-View Images for Robotic Manipulation0
SPADE: Spatial-Aware Denoising Network for Open-vocabulary Panoptic Scene Graph Generation with Long- and Local-range Context Reasoning0
Beyond Appearance: Geometric Cues for Robust Video Instance Segmentation0
NOCTIS: Novel Object Cyclic Threshold based Instance SegmentationCode0
VoteSplat: Hough Voting Gaussian Splatting for 3D Scene Understanding0
Leader360V: The Large-scale, Real-world 360 Video Dataset for Multi-task Learning in Diverse Environment0
A Comprehensive Survey on Video Scene Parsing:Advances, Challenges, and Prospects0
Show:102550
← PrevPage 33 of 91Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1InternImage-HAP5080.8Unverified
2ResNeSt-200 (multi-scale)AP5070.2Unverified
3CenterMask + VoVNetV2-99 (multi-scale)AP5066.2Unverified
4CenterMask + VoVNetV2-57 (single-scale)AP5060.8Unverified
5Co-DETRmask AP57.1Unverified
6CBNetV2 (EVA02, single-scale)mask AP56.1Unverified
7ISDA (ResNet-50)APL55.7Unverified
8EVAmask AP55.5Unverified
9FD-SwinV2-Gmask AP55.4Unverified
10Mask Frozen-DETRmask AP55.3Unverified
#ModelMetricClaimedVerifiedStatus
1InternImage-BGFLOPs501Unverified
2Co-DETRmask AP56.6Unverified
3ViT-CoMer-L (Mask RCNN, DINOv2)mask AP55.9Unverified
4InternImage-Hmask AP55.4Unverified
5EVAmask AP55Unverified
6Mask Frozen-DETRmask AP54.9Unverified
7MasK DINO (SwinL, multi-scale)mask AP54.5Unverified
8GLEE-Promask AP54.2Unverified
9ViT-Adapter-L (HTC++, BEiTv2, O365, multi-scale)mask AP54.2Unverified
10SwinV2-G (HTC++)mask AP53.7Unverified