SOTAVerified

Instance Segmentation

Instance Segmentation is a computer vision task that involves identifying and separating individual objects within an image, including detecting the boundaries of each object and assigning a unique label to each object. The goal of instance segmentation is to produce a pixel-wise segmentation map of the image, where each pixel is assigned to a specific object instance.

Image Credit: Deep Occlusion-Aware Instance Segmentation with Overlapping BiLayers, CVPR'21

Papers

Showing 901950 of 2262 papers

TitleStatusHype
FrGNet: A fourier-guided weakly-supervised framework for nuclear instance segmentationCode0
Instance Segmentation of Scene Sketches Using Natural Image Priors0
Generalized Class Discovery in Instance Segmentation0
VFX Creator: Animated Visual Effect Generation with Controllable Diffusion Transformer0
AIQViT: Architecture-Informed Post-Training Quantization for Vision Transformers0
Beyond the Final Layer: Hierarchical Query Fusion Transformer with Agent-Interpolation Initialization for 3D Instance Segmentation0
ZISVFM: Zero-Shot Object Instance Segmentation in Indoor Robotic Environments with Vision Foundation ModelsCode0
Mosaic3D: Foundation Dataset and Model for Open-Vocabulary 3D Segmentation0
Memory Efficient Transformer Adapter for Dense Predictions0
Lifting by Gaussians: A Simple, Fast and Flexible Method for 3D Instance Segmentation0
INT: Instance-Specific Negative Mining for Task-Generic Promptable Segmentation0
Tuning Vision Foundation Model via Test-Time Prompt-Guided Training for VFSS Segmentations0
D-PLS: Decoupled Semantic Segmentation for 4D-Panoptic-LiDAR-Segmentation0
Vision Aided Channel Prediction for Vehicular Communications: A Case Study of Received Power Prediction Using RGB Images0
Effective Defect Detection Using Instance Segmentation for NDI0
Foreign object segmentation in chest x-rays through anatomy-guided shape insertion0
ENSeg: A Novel Dataset and Method for the Segmentation of Enteric Neuron Cells on Microscopy ImagesCode0
LiCAR: pseudo-RGB LiDAR image for CAR segmentation0
Data-driven Detection and Evaluation of Damages in Concrete Structures: Using Deep Learning and Computer Vision0
Advancing Oyster Phenotype Segmentation with Multi-Network Ensemble and Multi-Scale mechanism0
ClusterViG: Efficient Globally Aware Vision GNNs via Image Partitioning0
SmartEraser: Remove Anything from Images using Masked-Region Guidance0
Static Segmentation by Tracking: A Frustratingly Label-Efficient Approach to Fine-Grained Segmentation0
NextStop: An Improved Tracker For Panoptic LIDAR Segmentation DataCode0
Rapid Automated Mapping of Clouds on Titan With Instance SegmentationCode0
AutoFish: Dataset and Benchmark for Fine-grained Analysis of Fish0
IAM: Enhancing RGB-D Instance Segmentation with New BenchmarksCode0
Dedicated Inference Engine and Binary-Weight Neural Networks for Lightweight Instance Segmentation0
Leverage Cross-Attention for End-to-End Open-Vocabulary Panoptic Reconstruction0
Common3D: Self-Supervised Learning of 3D Morphable Models for Common Objects in Neural Feature SpaceCode0
Advancing Manga Analysis: Comprehensive Segmentation Annotations for the Manga109 Dataset0
SAM2Object: Consolidating View Consistency via SAM2 for Zero-Shot 3D Instance Segmentation0
PolarNeXt: Rethink Instance Segmentation with Polar Representation0
Semantic and Sequential Alignment for Referring Video Object Segmentation0
Minimizing Labeled, Maximizing Unlabeled: An Image-Driven Approach for Video Instance Segmentation0
Insightful Instance Features for 3D Instance Segmentation0
DefMamba: Deformable Visual State Space Model0
Decoupled Motion Expression Video Segmentation0
WISH: Weakly Supervised Instance Segmentation using Heterogeneous Labels0
PanoSLAM: Panoptic 3D Scene Reconstruction via Gaussian SLAMCode0
A Novel Shape Guided Transformer Network for Instance Segmentation in Remote Sensing Images0
Progressive Fine-to-Coarse Reconstruction for Accurate Low-Bit Post-Training Quantization in Vision Transformers0
PyPotteryLens: An Open-Source Deep Learning Framework for Automated Digitisation of Archaeological Pottery DocumentationCode0
Classification Drives Geographic Bias in Street Scene Segmentation0
SAM-IF: Leveraging SAM for Incremental Few-Shot Instance Segmentation0
STEAM: Squeeze and Transform Enhanced Attention Module0
Open-Vocabulary High-Resolution 3D (OVHR3D) Data Segmentation and Annotation Framework0
Integrating YOLO11 and Convolution Block Attention Module for Multi-Season Segmentation of Tree Trunks and Branches in Commercial Apple Orchards0
Towards Real-Time Open-Vocabulary Video Instance SegmentationCode0
Vision Transformers for Weakly-Supervised Microorganism EnumerationCode0
Show:102550
← PrevPage 19 of 46Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1InternImage-HAP5080.8Unverified
2ResNeSt-200 (multi-scale)AP5070.2Unverified
3CenterMask + VoVNetV2-99 (multi-scale)AP5066.2Unverified
4CenterMask + VoVNetV2-57 (single-scale)AP5060.8Unverified
5Co-DETRmask AP57.1Unverified
6CBNetV2 (EVA02, single-scale)mask AP56.1Unverified
7ISDA (ResNet-50)APL55.7Unverified
8EVAmask AP55.5Unverified
9FD-SwinV2-Gmask AP55.4Unverified
10Mask Frozen-DETRmask AP55.3Unverified
#ModelMetricClaimedVerifiedStatus
1InternImage-BGFLOPs501Unverified
2Co-DETRmask AP56.6Unverified
3ViT-CoMer-L (Mask RCNN, DINOv2)mask AP55.9Unverified
4InternImage-Hmask AP55.4Unverified
5EVAmask AP55Unverified
6Mask Frozen-DETRmask AP54.9Unverified
7MasK DINO (SwinL, multi-scale)mask AP54.5Unverified
8ViT-Adapter-L (HTC++, BEiTv2, O365, multi-scale)mask AP54.2Unverified
9GLEE-Promask AP54.2Unverified
10SwinV2-G (HTC++)mask AP53.7Unverified