SOTAVerified

Instance Segmentation

Instance Segmentation is a computer vision task that involves identifying and separating individual objects within an image, including detecting the boundaries of each object and assigning a unique label to each object. The goal of instance segmentation is to produce a pixel-wise segmentation map of the image, where each pixel is assigned to a specific object instance.

Image Credit: Deep Occlusion-Aware Instance Segmentation with Overlapping BiLayers, CVPR'21

Papers

Showing 301350 of 2262 papers

TitleStatusHype
M18K: A Comprehensive RGB-D Dataset and Benchmark for Mushroom Detection and Instance SegmentationCode0
Part2Object: Hierarchical Unsupervised 3D Instance SegmentationCode1
WSESeg: Introducing a Dataset for the Segmentation of Winter Sports Equipment with a Baseline for Interactive SegmentationCode0
Adaptive Parametric ActivationCode2
SLoRD: Structural Low-Rank Descriptors for Shape Consistency in Vertebrae SegmentationCode0
MambaVision: A Hybrid Mamba-Transformer Vision BackboneCode7
Unified Embedding Alignment for Open-Vocabulary Video Instance SegmentationCode1
Mapping urban large-area advertising structures using drone imagery and deep learning-based spatial data analysisCode0
Joint prototype and coefficient prediction for 3D instance segmentation0
Improved Block Merging for 3D Point Cloud Instance Segmentation0
Training-free CryoET Tomogram SegmentationCode2
Rethinking Visual Prompting for Multimodal Large Language Models with External Knowledge0
Slice-100K: A Multimodal Dataset for Extrusion-based 3D Printing0
Medical Image Fusion for High-Level Analysis: A Mutual Enhancement Framework for Unaligned PAT and MRICode0
ADFQ-ViT: Activation-Distribution-Friendly Post-Training Quantization for Vision Transformers0
Context-Aware Video Instance SegmentationCode2
Multi-Grained Contrast for Data-Efficient Unsupervised Representation LearningCode1
PanopticRecon: Leverage Open-vocabulary Instance Segmentation for Zero-shot Panoptic Reconstruction0
Robot Instance Segmentation with Few Annotations for GraspingCode0
PM-VIS+: High-Performance Video Instance Segmentation without Video AnnotationCode0
3D Feature Distillation with Object-Centric Priors0
CoDA: Interactive Segmentation and Morphological Analysis of Dendroid Structures Exemplified on Stony Cold-Water CoralsCode0
Optimization of Autonomous Driving Image Detection Based on RFAConv and Triplet Attention0
XAMI -- A Benchmark Dataset for Artefact Detection in XMM-Newton Optical ImagesCode0
Semi-supervised classification of dental conditions in panoramic radiographs using large language model and instance segmentation: A real-world dataset evaluation0
Depth-Guided Semi-Supervised Instance Segmentation0
GMT: Guided Mask Transformer for Leaf Instance SegmentationCode0
Instance Consistency Regularization for Semi-Supervised 3D Instance SegmentationCode1
CholecInstanceSeg: A Tool Instance Segmentation Dataset for Laparoscopic SurgeryCode1
Fine-grained Background Representation for Weakly Supervised Semantic SegmentationCode0
TraceNet: Segment one thing efficiently0
2nd Place Solution for MeViS Track in CVPR 2024 PVUW Workshop: Motion Expression guided Video Segmentation0
3D Instance Segmentation Using Deep Learning on RGB-D Indoor Data0
Competitive Learning for Achieving Content-specific Filters in Video Coding for Machines0
OoDIS: Anomaly Instance Segmentation BenchmarkCode1
Benchmarking Label Noise in Instance Segmentation: Spatial Noise MattersCode0
MMVR: Millimeter-wave Multi-View Radar Dataset and Benchmark for Indoor PerceptionCode0
4M-21: An Any-to-Any Vision Model for Tens of Tasks and ModalitiesCode5
2nd Place Solution for MOSE Track in CVPR 2024 PVUW workshop: Complex Video Object Segmentation0
Dual Thinking and Logical Processing -- Are Multi-modal Large Language Models Closing the Gap with Human Vision ?Code0
RS-DFM: A Remote Sensing Distributed Foundation Model for Diverse Downstream Tasks0
UVIS: Unsupervised Video Instance Segmentation0
PanoSSC: Exploring Monocular Panoptic 3D Scene Reconstruction for Autonomous Driving0
Diving into Underwater: Segment Anything Model Guided Underwater Salient Instance Segmentation and A Large-scale DatasetCode2
Scaling Graph Convolutions for Mobile VisionCode1
1st Place Winner of the 2024 Pixel-level Video Understanding in the Wild (CVPR'24 PVUW) Challenge in Video Panoptic Segmentation and Best Long Video Consistency of Video Semantic Segmentation0
Nacala-Roof-Material: Drone Imagery for Roof Detection, Classification, and Segmentation to Support Mosquito-borne Disease Risk Assessment0
Instance Segmentation and Teeth Classification in Panoramic X-raysCode1
Open-YOLO 3D: Towards Fast and Accurate Open-Vocabulary 3D Instance SegmentationCode3
Generative Active Learning for Long-tailed Instance SegmentationCode2
Show:102550
← PrevPage 7 of 46Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1InternImage-HAP5080.8Unverified
2ResNeSt-200 (multi-scale)AP5070.2Unverified
3CenterMask + VoVNetV2-99 (multi-scale)AP5066.2Unverified
4CenterMask + VoVNetV2-57 (single-scale)AP5060.8Unverified
5Co-DETRmask AP57.1Unverified
6CBNetV2 (EVA02, single-scale)mask AP56.1Unverified
7ISDA (ResNet-50)APL55.7Unverified
8EVAmask AP55.5Unverified
9FD-SwinV2-Gmask AP55.4Unverified
10Mask Frozen-DETRmask AP55.3Unverified
#ModelMetricClaimedVerifiedStatus
1InternImage-BGFLOPs501Unverified
2Co-DETRmask AP56.6Unverified
3ViT-CoMer-L (Mask RCNN, DINOv2)mask AP55.9Unverified
4InternImage-Hmask AP55.4Unverified
5EVAmask AP55Unverified
6Mask Frozen-DETRmask AP54.9Unverified
7MasK DINO (SwinL, multi-scale)mask AP54.5Unverified
8GLEE-Promask AP54.2Unverified
9ViT-Adapter-L (HTC++, BEiTv2, O365, multi-scale)mask AP54.2Unverified
10SwinV2-G (HTC++)mask AP53.7Unverified