SOTAVerified

Instance Segmentation

Instance Segmentation is a computer vision task that involves identifying and separating individual objects within an image, including detecting the boundaries of each object and assigning a unique label to each object. The goal of instance segmentation is to produce a pixel-wise segmentation map of the image, where each pixel is assigned to a specific object instance.

Image Credit: Deep Occlusion-Aware Instance Segmentation with Overlapping BiLayers, CVPR'21

Papers

Showing 11011150 of 2262 papers

TitleStatusHype
How Shift Equivariance Impacts Metric Learning for Instance SegmentationCode0
A Dataset for Analysing Complex Document Layouts in the Digital Humanities and Its Evaluation with Krippendorff’s AlphaCode0
An expert-driven data generation pipeline for histological imagesCode0
XAMI -- A Benchmark Dataset for Artefact Detection in XMM-Newton Optical ImagesCode0
Towards End-to-End Lane Detection: an Instance Segmentation ApproachCode0
DiT: Efficient Vision Transformers with Dynamic Token RoutingCode0
A 3D Convolutional Approach to Spectral Object Segmentation in Space and TimeCode0
Learning Panoptic Segmentation from Instance ContoursCode0
Instance Segmentation of Biological Images Using Harmonic EmbeddingsCode0
One Shot Model For COVID-19 Classification and Lesions Segmentation In Chest CT Scans Using LSTM With Attention MechanismCode0
D-InLoc++: Indoor Localization in Dynamic EnvironmentsCode0
A Feasible Framework for Arbitrary-Shaped Scene Text RecognitionCode0
Learning Regional Purity for Instance Segmentation on 3D Point CloudsCode0
Joint Representation Learning for Text and 3D Point CloudCode0
SGPN: Similarity Group Proposal Network for 3D Point Cloud Instance SegmentationCode0
Deeply Shape-guided Cascade for Instance SegmentationCode0
Towards Partial Supervision for Generic Object Counting in Natural ScenesCode0
ShapeFormer: Shape Prior Visible-to-Amodal Transformer-based Amodal Instance SegmentationCode0
Towards Real-Time Open-Vocabulary Video Instance SegmentationCode0
Vision-based Robotic Grasping From Object Localization, Object Pose Estimation to Grasp Estimation for Parallel Grippers: A ReviewCode0
Leveraging Vision-Language Models for Open-Vocabulary Instance Segmentation and TrackingCode0
Towards Segmenting Anything That MovesCode0
Yeast cell segmentation in microstructured environments with deep learningCode0
IAM: Enhancing RGB-D Instance Segmentation with New BenchmarksCode0
Efficient Temporal Action Segmentation via Boundary-aware Query VotingCode0
Bimodal SegNet: Instance Segmentation Fusing Events and RGB Frames for Robotic GraspingCode0
One Shot Model For COVID-19 Classification and Lesions Segmentation In Chest CT Scans Using LSTM With Attention MechanismCode0
ICDAR 2021 Competition on Historical Map SegmentationCode0
Signature and Log-signature for the Study of Empirical Distributions Generated with GANsCode0
SIMONe: View-Invariant, Temporally-Abstracted Object Representations via Unsupervised Video DecompositionCode0
DAN-NucNet: A dual attention based framework for nuclei segmentation in cancer histology images under wild clinical conditionsCode0
A Novel Incremental Learning Driven Instance Segmentation Framework to Recognize Highly Cluttered Instances of the Contraband ItemsCode0
Monocular Depth Estimation Using Cues Inspired by Biological Vision SystemsCode0
Tracking Passengers and Baggage Items using Multiple Overhead Cameras at Security CheckpointsCode0
Enhanced Masked Image Modeling for Analysis of Dental Panoramic RadiographsCode0
YOLACT++: Better Real-time Instance SegmentationCode0
Benchmarking Label Noise in Instance Segmentation: Spatial Noise MattersCode0
Enforcing Morphological Information in Fully Convolutional Networks to Improve Cell Instance Segmentation in Fluorescence Microscopy ImagesCode0
iFS-RCNN: An Incremental Few-shot Instance SegmenterCode0
Simultaneous Semantic and Instance Segmentation for Colon Nuclei Identification and CountingCode0
SimVODIS: Simultaneous Visual Odometry, Object Detection, and Instance SegmentationCode0
Single-Image Piece-wise Planar 3D Reconstruction via Associative EmbeddingCode0
Single Network Panoptic Segmentation for Street Scene UnderstandingCode0
IM2HEIGHT: Height Estimation from Single Monocular Imagery via Fully Residual Convolutional-Deconvolutional NetworkCode0
Single-Shot Lightweight Model For The Detection of Lesions And The Prediction of COVID-19 From Chest CT ScansCode0
Image as a Foreign Language: BEiT Pretraining for All Vision and Vision-Language TasksCode0
Vision Transformers for Weakly-Supervised Microorganism EnumerationCode0
Image-based Detection of Surface Defects in Concrete during ConstructionCode0
Single-Stage Open-world Instance Segmentation with Cross-task Consistency RegularizationCode0
Trainable Structure Tensors for Autonomous Baggage Threat Detection Under Extreme OcclusionCode0
Show:102550
← PrevPage 23 of 46Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1InternImage-HAP5080.8Unverified
2ResNeSt-200 (multi-scale)AP5070.2Unverified
3CenterMask + VoVNetV2-99 (multi-scale)AP5066.2Unverified
4CenterMask + VoVNetV2-57 (single-scale)AP5060.8Unverified
5Co-DETRmask AP57.1Unverified
6CBNetV2 (EVA02, single-scale)mask AP56.1Unverified
7ISDA (ResNet-50)APL55.7Unverified
8EVAmask AP55.5Unverified
9FD-SwinV2-Gmask AP55.4Unverified
10Mask Frozen-DETRmask AP55.3Unverified
#ModelMetricClaimedVerifiedStatus
1InternImage-BGFLOPs501Unverified
2Co-DETRmask AP56.6Unverified
3ViT-CoMer-L (Mask RCNN, DINOv2)mask AP55.9Unverified
4InternImage-Hmask AP55.4Unverified
5EVAmask AP55Unverified
6Mask Frozen-DETRmask AP54.9Unverified
7MasK DINO (SwinL, multi-scale)mask AP54.5Unverified
8ViT-Adapter-L (HTC++, BEiTv2, O365, multi-scale)mask AP54.2Unverified
9GLEE-Promask AP54.2Unverified
10SwinV2-G (HTC++)mask AP53.7Unverified