SOTAVerified

Instance Segmentation

Instance Segmentation is a computer vision task that involves identifying and separating individual objects within an image, including detecting the boundaries of each object and assigning a unique label to each object. The goal of instance segmentation is to produce a pixel-wise segmentation map of the image, where each pixel is assigned to a specific object instance.

Image Credit: Deep Occlusion-Aware Instance Segmentation with Overlapping BiLayers, CVPR'21

Papers

Showing 150 of 2262 papers

TitleStatusHype
SCORE: Scene Context Matters in Open-Vocabulary Remote Sensing Instance SegmentationCode0
Tomato Multi-Angle Multi-Pose Dataset for Fine-Grained Phenotyping1
SPADE: Spatial-Aware Denoising Network for Open-vocabulary Panoptic Scene Graph Generation with Long- and Local-range Context Reasoning0
DreamGrasp: Zero-Shot 3D Multi-Object Reconstruction from Partial-View Images for Robotic Manipulation0
Beyond Appearance: Geometric Cues for Robust Video Instance Segmentation0
No time to train! Training-Free Reference-Based Instance SegmentationCode3
NOCTIS: Novel Object Cyclic Threshold based Instance SegmentationCode0
VoteSplat: Hough Voting Gaussian Splatting for 3D Scene Understanding0
Leader360V: The Large-scale, Real-world 360 Video Dataset for Multi-task Learning in Diverse Environment0
A Comprehensive Survey on Video Scene Parsing:Advances, Challenges, and Prospects0
Prohibited Items Segmentation via Occlusion-aware Bilayer ModelingCode0
ALBERT: Advanced Localization and Bidirectional Encoder Representations from Transformers for Automotive Damage Evaluation0
The Four Color Theorem for Cell Instance SegmentationCode1
Accurate and efficient zero-shot 6D pose estimation with frozen foundation models0
SAM2Auto: Auto Annotation Using FLASH0
OpenSplat3D: Open-Vocabulary 3D Instance Segmentation using Gaussian Splatting0
You Only Estimate Once: Unified, One-stage, Real-Time Category-level Articulated Object 6D Pose Estimation for Robotic Grasping0
Bringing SAM to new heights: Leveraging elevation data for tree crown segmentation from drone imagery0
OpenMaskDINO3D : Reasoning 3D Segmentation via Large Language ModelCode1
CzechLynx: A Dataset for Individual Identification and Pose Estimation of the Eurasian Lynx0
Gen-n-Val: Agentic Image Data Generation and Validation0
SPPSFormer: High-quality Superpoint-based Transformer for Roof Plane Instance Segmentation from Point Clouds0
CAST: Contrastive Adaptation and Distillation for Semi-Supervised Instance Segmentation0
ConfLUNet: Multiple sclerosis lesion instance segmentation in presence of confluent lesions0
The Missing Point in Vision Transformers for Universal Image SegmentationCode2
ThinkVideo: High-Quality Reasoning Video Segmentation with Chain of ThoughtsCode0
Detailed Evaluation of Modern Machine Learning Approaches for Optic Plastics Sorting0
Sketchy Bounding-box Supervision for 3D Instance SegmentationCode1
RE-TRIP : Reflectivity Instance Augmented Triangle Descriptor for 3D Place RecognitionCode1
UWSAM: Segment Anything Model Guided Underwater Instance Segmentation and A Large-scale Benchmark DatasetCode1
gen2seg: Generative Models Enable Generalizable Instance Segmentation0
Decoupling Classifier for Boosting Few-shot Object Detection and Instance SegmentationCode1
Instance Segmentation for Point Sets0
Industrial Synthetic Segment Pre-training0
FlowCut: Unsupervised Video Instance Segmentation via Temporal Mask Matching0
Enhancing Transformers Through Conditioned Embedded Tokens0
SoftPQ: Robust Instance Segmentation Evaluation via Soft Matching and Tunable ThresholdsCode0
SurgPose: Generalisable Surgical Instrument Pose Estimation using Zero-Shot Learning and Stereo Vision0
Pseudo-Label Quality Decoupling and Correction for Semi-Supervised Instance Segmentation0
The RaspGrade Dataset: Towards Automatic Raspberry Ripeness Grading with Deep Learning0
Vision Foundation Model Embedding-Based Semantic Anomaly Detection0
Mix-QSAM: Mixed-Precision Quantization of the Segment Anything Model0
RepSNet: A Nucleus Instance Segmentation model based on Boundary Regression and Structural Re-parameterization0
Hyb-KAN ViT: Hybrid Kolmogorov-Arnold Networks Augmented Vision Transformer0
Self-Supervised Learning for Robotic Leaf Manipulation: A Hybrid Geometric-Neural Approach0
Segment Any RGB-Thermal Model with Language-aided Distillation0
A Novel WaveInst-based Network for Tree Trunk Structure Extraction and Pattern Analysis in Forest Inventory0
Global Collinearity-aware Polygonizer for Polygonal Building Mapping in Remote Sensing0
MoSAM: Motion-Guided Segment Anything Model with Spatial-Temporal Memory Selection0
OG-HFYOLO :Orientation gradient guidance and heterogeneous feature fusion for deformation table cell instance segmentationCode1
Show:102550
← PrevPage 1 of 46Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1InternImage-HAP5080.8Unverified
2ResNeSt-200 (multi-scale)AP5070.2Unverified
3CenterMask + VoVNetV2-99 (multi-scale)AP5066.2Unverified
4CenterMask + VoVNetV2-57 (single-scale)AP5060.8Unverified
5Co-DETRmask AP57.1Unverified
6CBNetV2 (EVA02, single-scale)mask AP56.1Unverified
7ISDA (ResNet-50)APL55.7Unverified
8EVAmask AP55.5Unverified
9FD-SwinV2-Gmask AP55.4Unverified
10Mask Frozen-DETRmask AP55.3Unverified
#ModelMetricClaimedVerifiedStatus
1InternImage-BGFLOPs501Unverified
2Co-DETRmask AP56.6Unverified
3ViT-CoMer-L (Mask RCNN, DINOv2)mask AP55.9Unverified
4InternImage-Hmask AP55.4Unverified
5EVAmask AP55Unverified
6Mask Frozen-DETRmask AP54.9Unverified
7MasK DINO (SwinL, multi-scale)mask AP54.5Unverified
8ViT-Adapter-L (HTC++, BEiTv2, O365, multi-scale)mask AP54.2Unverified
9GLEE-Promask AP54.2Unverified
10SwinV2-G (HTC++)mask AP53.7Unverified