SOTAVerified

Instance Segmentation

Instance Segmentation is a computer vision task that involves identifying and separating individual objects within an image, including detecting the boundaries of each object and assigning a unique label to each object. The goal of instance segmentation is to produce a pixel-wise segmentation map of the image, where each pixel is assigned to a specific object instance.

Image Credit: Deep Occlusion-Aware Instance Segmentation with Overlapping BiLayers, CVPR'21

Papers

Showing 15511600 of 2262 papers

TitleStatusHype
Learning to Track Instances without Video Annotations0
The surprising impact of mask-head architecture on novel class segmentationCode0
FAPIS: A Few-shot Anchor-free Part-based Instance SegmenterCode1
Scale-aware Automatic Augmentation for Object DetectionCode1
Using depth information and colour space variations for improving outdoor robustness for instance segmentation of cabbage0
Camouflaged Instance Segmentation In-The-Wild: Dataset, Method, and Benchmark Suite0
SIMstack: A Generative Shape and Instance Model for Unordered Object Stacks0
Assessing YOLACT++ for real time and robust instance segmentation of medical instruments in endoscopic procedures0
Distribution Alignment: A Unified Framework for Long-tail Visual RecognitionCode1
PlaneSegNet: Fast and Robust Plane Estimation Using a Single-stage Instance Segmentation CNN0
Multi-Scale Vision Longformer: A New Vision Transformer for High-Resolution Image EncodingCode1
Panoptic-PolarNet: Proposal-free LiDAR Point Cloud Panoptic SegmentationCode1
Instance segmentation with the number of clusters incorporated in embedding learning0
Sparse Object-level Supervision for Instance Segmentation with Pixel EmbeddingsCode1
Video Instance Segmentation with a Propose-Reduce ParadigmCode1
Swin Transformer: Hierarchical Vision Transformer using Shifted WindowsCode2
Region Similarity Representation LearningCode1
Dilated SpineNet for Semantic Segmentation0
Weakly Supervised Instance Segmentation for Videos with Temporal Mask Consistency0
Deep Occlusion-Aware Instance Segmentation with Overlapping BiLayersCode1
Scaling Local Self-Attention for Parameter Efficient Visual BackbonesCode1
Human De-occlusion: Invisible Perception and Recovery for Humans0
Video Class Agnostic Segmentation Benchmark for Autonomous DrivingCode1
Generic Perceptual Loss for Modeling Structured Output Dependencies0
SG-Net: Spatial Granularity Network for One-Stage Video Instance SegmentationCode1
LRGNet: Learnable Region Growing for Class-Agnostic Point Cloud SegmentationCode1
Track to Detect and Segment: An Online Multi-Object TrackerCode1
BBAM: Bounding Box Attribution Map for Weakly Supervised Semantic and Instance SegmentationCode1
Robust 2D/3D Vehicle Parsing in CVIS0
Instance Segmentation GNNs for One-Shot Conformal Tracking at the LHC0
Unknown Object Segmentation from Stereo ImagesCode1
Spatially Consistent Representation LearningCode1
Quality-Aware Network for Human ParsingCode1
Panoptic Lintention Network: Towards Efficient Navigational Perception for the Visually ImpairedCode0
Cross-View Regularization for Domain Adaptive Panoptic SegmentationCode1
InstantDL-An easy-to-use deep learning pipeline for image segmentation and classificationCode1
Simple multi-dataset detectionCode1
FASA: Feature Augmentation and Sampling Adaptation for Long-Tailed Instance SegmentationCode1
SCD: A Stacked Carton Dataset for Detection and SegmentationCode0
Pyramid Vision Transformer: A Versatile Backbone for Dense Prediction without ConvolutionsCode1
Image Augmentation for Multitask Few-Shot Learning: Agricultural Domain Use-CaseCode1
SALT: A Semi-automatic Labeling Tool for RGB-D Video Sequences0
Contour Loss for Instance Segmentation via k-step Distance Transformation Image0
CellTrack R-CNN: A Novel End-To-End Deep Neural Network for Cell Segmentation and Tracking in Microscopy ImagesCode0
One Shot Model For COVID-19 Classification and Lesions Segmentation In Chest CT Scans Using LSTM With Attention MechanismCode0
One Shot Model For COVID-19 Classification and Lesions Segmentation In Chest CT Scans Using LSTM With Attention MechanismCode0
LambdaNetworks: Modeling Long-Range Interactions Without AttentionCode2
MosaicOS: A Simple and Effective Use of Object-Centric Images for Long-Tailed Object DetectionCode1
EfficientLPS: Efficient LiDAR Panoptic Segmentation0
Building outline delineation: From aerial images to polygons with an improved end-to-end learning framework0
Show:102550
← PrevPage 32 of 46Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1InternImage-HAP5080.8Unverified
2ResNeSt-200 (multi-scale)AP5070.2Unverified
3CenterMask + VoVNetV2-99 (multi-scale)AP5066.2Unverified
4CenterMask + VoVNetV2-57 (single-scale)AP5060.8Unverified
5Co-DETRmask AP57.1Unverified
6CBNetV2 (EVA02, single-scale)mask AP56.1Unverified
7ISDA (ResNet-50)APL55.7Unverified
8EVAmask AP55.5Unverified
9FD-SwinV2-Gmask AP55.4Unverified
10Mask Frozen-DETRmask AP55.3Unverified
#ModelMetricClaimedVerifiedStatus
1InternImage-BGFLOPs501Unverified
2Co-DETRmask AP56.6Unverified
3ViT-CoMer-L (Mask RCNN, DINOv2)mask AP55.9Unverified
4InternImage-Hmask AP55.4Unverified
5EVAmask AP55Unverified
6Mask Frozen-DETRmask AP54.9Unverified
7MasK DINO (SwinL, multi-scale)mask AP54.5Unverified
8ViT-Adapter-L (HTC++, BEiTv2, O365, multi-scale)mask AP54.2Unverified
9GLEE-Promask AP54.2Unverified
10SwinV2-G (HTC++)mask AP53.7Unverified